Advantages of latent class over continuous mixture of Logit models
|
|
|
- Shon Stevenson
- 10 years ago
- Views:
Transcription
1 Advantages of latent class over continuous mixture of Logit models Stephane Hess Moshe Ben-Akiva Dinesh Gopinath Joan Walker May 16, 2011 Abstract This paper adds to a growing body of evidence highlighting the potential advantages of Latent Class Logit models over continuous mixture Logit models. In particular, we present formulae for correlation between coefficients and for elasticities, and show how these are a function of any sociodemographic attributes included in the class allocation model. An empirical analysis is then conducted which not only confirms these advantages in interpretation, but also shows that, even with a limited number of classes, the Latent Class Logit models achieves very similar model fit to that of its continuous mixture counterpart. 1 Introduction The recognition that there exist fundamental differences in preferences between individuals faced with the same choice tasks has been one of the cornerstones of work in the area of behavioural modelling. In a transportation context, the main emphasis in recent years has been on accommodating these variations through a random coefficients approach, with particular interest in continuous Logit mixture models. On the other hand, especially in the marketing literature, latent class approaches have come to dominate. Importantly, a number of past comparisons Institute for Transport Studies, University of Leeds, [email protected], Tel: +44 (0) , Fax: +44 (0) Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, [email protected] Profit Engineering Group, [email protected] Institute of Transportation Studies, University of California at Berkeley, Joan- [email protected] 1
2 between the two structures have highlighted the possible advantages of latent class structures, as for example in the work of Gopinath (1995), Greene and Hensher (2003) and Shen (2009). In the present paper, we extend on such past comparisons with a view to encouraging more widespread use of latent class choice models in a transportation context. Both types of models are based on the idea of using a mixture of a simple underlying model, typically Multinomial Logit, over the distribution of preferences. In the continuous Logit mixture model, this distribution is continuous, while in the latent class context, a finite number of classes are used to express the heterogeneity. The fact that latent class models rely on a limited number of support points in the distribution could arguably be seen as a shortcoming, and may be behind the slow uptake of this model in the field of transport research. Conversely, while continuous mixture models are very flexible (cf. McFadden and Train, 2000), there are also numerous pitfalls in addition to the well documented high computational costs. These relate primarily to the distributional assumptions (see e.g. Hensher and Greene, 2003; Hess et al., 2005; Fosgerau, 2006), and the implications thereof on the interpretation of results and the computation of important model outputs such as willingness-to-pay (WTP) indicators (cf. Daly et al., 2009). Finite mixture structures such as the latent class model are less affected by the computational burden and interpretation issues highlighted above for continuous mixtures. Independently of whether continuous or finite mixtures are used to accommodate the heterogeneity across respondents, there exists the possibility of linking this heterogeneity to covariates, i.e. moving away from a purely random treatment of the variations. In a continuous mixture context, this is done through expressing the parameters of the random distribution as a function of these covariates, while, in a latent class model, this is accommodated in the class allocation model. This paper provides some further insights into the differences between Latent Class Logit structures and continuous Logit mixture models, and especially the potential advantages of the former. We first derive equations for inter-coefficient correlation and elasticities, linking these measures directly to socio-demographic attributes used in the class allocation model. We then describe an application comparing Latent Class Logit models to Logit and continuous mixtures of Logit models. The remainder of this paper is organised as follows. Section 2 discusses modelling methodology, including an in-depth look at taste heterogeneity, intercoefficient correlation and elasticities in Logit, continuous mixture of Logit and Latent Class Logit models. Section 3 presents the results of an application on Stated Choice (SC) data for departure time and mode choice. Finally, Section 4 2
3 presents the conclusions of the research. 2 Methodology This section sets out the methodology used in the paper. We first discuss general methodology before looking at taste heterogeneity, correlation and elasticities. 2.1 Background methodology Let P n (i β) give the probability of respondent n choosing alternative i, conditional on a vector of taste coefficients β. In a Logit model, we have: P n (i β) = e V ni J, (1) j=1 ev nj where J is the total number of alternatives, and where the observed utility V ni is given by f (x ni, β), which is a function of the attributes of alternative i as faced by respondent n and the vector of taste coefficients β 1. In a continuous mixture model, the vector β follows a random distribution with parameters Ω, and the choice probabilities are given by: P n (i Ω) = P n (i β) f (β Ω) dβ, (2) β where P n (i β) once again gives the Logit choice probability from Equation 1 and where f (β Ω) gives the density function for the vector of taste coefficients β. In the case of multiple choices for each respondent, the assumption is generally made that the tastes vary across respondents but not across choices for the same respondent (cf. Revelt and Train 1998 and see Hess and Rose 2009 for a recent discussion of this issue), and the probability of the observed sequence of choices is used in the maximisation of the log-likelihood. This probability is given by: [ Tn ] L n (j n1,..., j ntn Ω) = P n (j nt β) f (β Ω) dβ, (3) β t=1 where j nt gives the alternative chosen by respondent n in choice situation t, with T n giving the total number of choices for respondent n. In a Latent Class Logit model, the heterogeneity in tastes across respondents is accommodated by making use of separate classes with different values for the 1 The inclusion of any alternative specific constants is not made explicit here. 3
4 vector of taste coefficients β. Specifically, in a Latent Class Logit model with S classes, we would have S instances of the vector β, say β 1 to β S, with a possibility of some of the elements in β staying constant across some of the classes. A Latent Class Logit model uses a probabilistic class allocation model, where respondent n belongs to class s with probability π ns, and where 0 π ns 1 s and S π ns = 1. Latent Class models are generally specified with an underlying Logit model, but can easily be adapted for more general underlying structures such as Nested Logit or Cross-Nested Logit. Let P n (i β s ) give the probability of respondent n choosing alternative i conditional on respondent n falling into class s. The unconditional (on s) choice probability for alternative i and respondent n is then given by: S P n (i β 1,..., β S ) = π ns P n (i β s ), (4) i.e. the weighted sum of choice probabilities across the S classes, with the class allocation probabilities being used as weights. Unlike with the continuous mixture model, no simulation is required in the estimation of Latent Class Logit models. This specification can easily be extended to a situation with multiple choices per respondent, where, when making the same assumption of intra-respondent homogeneity as in Equation 3, we obtain: ( S Tn ) L n (j n1,..., j ntn β 1... β S ) = π ns P n (j nt β s ). (5) In the most basic version of a Latent Class Logit model, the class allocation probabilities are constant across respondents such that π ns = π s n. This structure is often referred to as a discrete mixture model. The real flexibility however arises when the class allocation probabilities are not constant across respondents but when a class allocation model is used to link these probabilities to characteristics of the respondents. Typically, these characteristics would take the form of socio-demographic variables, such as income, age and employment status. With z n giving the concerned vector of characteristics for respondent n, and with the class allocation model taking on a Logit form, the probability of respondent n falling into class s would be given by: π ns = t=1 e δs+g(γs,zn) S l=1, eδ l+g(γ l,z n). (6) where δ s is a class-specific constant 2, γ s is a vector of parameters to be estimated and g ( ) gives the functional form of the utility function for the class allocation 2 In a discrete mixture model, only these constants would be estimated. 4
5 model. Here, a major difference arises between class allocation models and choice models. In a choice model, the attributes vary across alternatives while the estimated coefficients (with a few exceptions) stay constant across alternatives. In a class allocation model, the attributes normally stay constant across classes while the parameters vary across classes. This allows the model to probabilistically allocate respondents to different classes depending on their socio-demographic characteristics. For example, a situation where high income and low income respondents are allocated differently to two classes could be represented with a positive income coefficient for the first class and a negative income coefficient for the second class. Finally, it should also be said that it is possible to combine Latent Class Logit and continuous mixture structures, leading to latent class structures with some continuous elements, as for example done by Walker and Li (2006). 2.2 Taste heterogeneity Some major differences arise across the three model structures in terms of their treatment of taste heterogeneity. In a Logit model, any taste heterogeneity needs to be accommodated in a deterministic way by linking marginal utility coefficients to socio-demographic indicators. This can either be done by estimating separate coefficients for mutually exclusive subgroups of the sample population (e.g. trip purpose) or by continuous interaction between taste coefficients and socio-demographic attributes such as income or age. Although some taste heterogeneity may still be explained in a deterministic manner, the main characteristic of the continuous Logit mixture comes in its random representation of taste heterogeneity, with the vector β varying across respondents according to a pre-specified statistical distribution with estimated parameters. Here, an interesting development comes in the form of models linking the parameters of these distributions to socio-demographic indicators of the respondents (cf. Greene et al., 2006). In a Latent Class Logit model, the taste heterogeneity is accommodated as a mixture between a deterministic and a random approach. A probabilistic model is used to allocate respondents to the different classes that characterise different tastes in the sample. However, the class allocation is not purely random but is a function of socio-demographic characteristics of the respondents. Finally, unlike with continuous mixtures, no a priori assumptions are made about the shape of the distribution of tastes other than the number of support points which is equal to the number of classes. In the simple discrete mixture case, the class allocation is not linked to socio-demographic information, bringing the model 5
6 closer to a continuous mixture model, but the number of classes is still fixed and no assumptions are made about the shape of the distribution of tastes across classes. 2.3 Correlation between taste coefficients Further important differences arise between the three models when it comes to correlation between taste coefficients. In the Logit model, such correlation only arises in the case where taste coefficients interact with socio-demographic attributes and specifically where multiple taste coefficients interact with the same socio-demographic characteristics. As an example, one could imagine a situation where cost sensitivity decreases with income while time sensitivity increases with income, resulting in negative correlation between the time and cost coefficients across the sample. While the correlation can thus be linked to socio-demographics, it should be said that in the majority of Logit applications, the coefficients will be distributed independently across the sample. In a continuous mixture model, correlation can be accommodated by specifying a joint distribution for the taste coefficients. While most estimation packages allow users to specify multivariate Normal distributions, the vast majority of continuous mixture applications make use of independently distributed taste coefficients. Correlation is rarely introduced in models not based on the Normal distribution, one exception being given in Walker (2001). In a Latent Class Logit model, correlation between taste coefficients is an inherent characteristic of the model structure. Let us assume that our model has S classes. Let us further assume that across all alternatives, our model makes use of P attributes, such that each vector of taste coefficients β s similarly contains P individual coefficients. Combining these vectors across the S classes, we obtain a (P xs) matrix of taste coefficients given by: β = β 1,1 β 1,2... β 1,S β 2,1 β 2,2... β 2,S.... β P,1 β P,2... β P,S, (7) where each row corresponds to one marginal utility coefficient and each column corresponds to one class. From this, it can be seen that in a Latent Class Logit model, there is likely to be correlation between two coefficients as long as both coefficients take on more than one value across the S classes. In addition to the actual taste coefficients shown in Equation 7, a Latent Class Logit model is also characterised by an additional vector giving the class 6
7 allocation probabilities. This vector is respondent specific, with: π n = [π n1,..., π ns ], (8) and we have that: P (β n1 = β 1,s β n2 = β 2,s... β np = β P,s ) = π ns, (9) with β np giving the value for the p th marginal utility coefficient for respondent n. From this, it can be seen that the correlation between taste coefficients in a Latent Class Logit model is a function of the class allocation probabilities as well as the values of the individual taste coefficients. Indeed, we have that: cov (β n1, β n2 ) = E [(β n1 E (β n1 )) (β n2 E (β n2 ))] = E (β n1 β n2 ) E (β n1 ) E (β n2 ) ( S S ) ( S ) = π ns β 1,s β 2,s π ns β 1,s π ns β 2,s (10) For ease of notation, let α = β 1 and γ = β 2 in which case Equation 10 can be written as: ( S S ) ( S ) cov (α n, γ n ) = π ns α s γ s π ns α s π ns γ s (11) A special situation arises when S = 2, in which case the class allocation probabilities have no effect on the sign of the correlation. Indeed, with the notation from Equation 11, we then have: cov (α n, γ n ) = π n1 π n2 [α 1 (γ 1 γ 2 ) + α 2 (γ 2 γ 1 )] = π n1 π n2 [(α 1 α 2 ) (γ 1 γ 2 )], (12) where the sign of cov (α n, γ n ) only depends on the changes in the two elements in α and γ across the two classes. This discussion has already shown that in a Latent Class Logit model, the correlation between coefficients is implicit in the model structure, and that, unlike with continuous mixtures, no additional adaptation of the specification (such as the form of the distribution) is required to accommodate it. However, another important distinction has to be made. If a multivariate distribution is used in a continuous mixture model, then the correlation between two coefficients is constant across respondents, unless the cross-diagonal terms in the covariance matrix are themselves parameterised. However, it can be seen from Equation 10 7
8 that in a Latent Class Logit model, the covariance (and hence the correlation) between two coefficients depends on the class allocation probabilities. Except in the case of a Latent Class Logit model with purely random class allocation (i.e. a discrete mixture model), the correlation itself thus varies across respondents as a function of the socio-demographic attributes used in the class allocation probabilities. 2.4 Elasticities As a final step, we now look at the elasticities in the different models. The Logit elasticities are well known (see e.g. Ben-Akiva and Lerman 1985), with the direct elasticity given by: E i,xni = V ni x ni x ni (1 P n (i β)), (13) while the cross-elasticity is given by: E i,xnj = V nj x nj x nj P n (j β), (14) exhibiting the IIA characteristic. In the continuous Logit mixture, the direct elasticity (see e.g. Train 2009) is given by: V ni β x E i,xni = ni x ni (1 P n (i β)) P n (i β) f (β Ω) dβ β P, (15) n (i β) f (β Ω) dβ with the cross-elasticity being: E i,xnj = V nj β x nj x nj P n (j β) P n (i β) f (β Ω) dβ β P, (16) n (i β) f (β Ω) dβ where this varies across alternatives, such that the continuous Logit mixture does not exhibit the IIA property. Here, it can be seen that the elasticities are given by an integration of Logit elasticities. We will now derive the elasticities for the Latent Class Logit model. Starting 8
9 with the direct elasticity, we have: x ni E i,xni = P n (i β) x ni P n (i β) ( S ) P n (i β s ) = π ns x ni ( S = = S x ni P n (i β) π ns V nis x ni P n (i β s ) (1 P n (i β s )) π ns P n (i β s ) P n (i β) ) x ni P n (i β) [ ] Vnis x ni (1 P n (i β s )). (17) x ni It can be seen that the term in square brackets corresponds to a Logit direct elasticity for a specific class in our Latent Class Logit model. This means that the direct elasticities in a Latent Class Logit model are a weighted sum of Logit elasticities, with the weights being given by multiplying the class membership probability with the class specific conditional probability and by dividing this product by the marginal probability. It can similarly be seen that the Latent Class Logit cross-elasticities are given by a weighted sum of Logit cross-elasticities. Specifically, we have: x nj E i,xnj = P n (i β) x nj P n (i β) S ( = π ns V ) njs x nj P n (i β s ) P n (j β s ) x nj P n (i β) = S π ns P n (i β s ) P n (i β) [ V ] njs x nj P n (j β s ). (18) x nj Two main observations can be made. Firstly, the similarity with the continuous Logit mixture elasticities is clearly visible. However, weighted summation replaces integration, such that no simulation is required. Like in the continuous mixture model, the cross-elasticities vary across alternatives, such that the model does not exhibit the IIA assumption. The second observation relates to variations across individuals. The Logit and continuous Logit mixture elasticities can be seen to vary across respondents due to differences in the attribute levels of the alternatives and hence also probabilities. Additionally, any socio-demographic interactions will lead to further variations. However, the relationship between the socio-demographic attributes and the elasticities cannot always be easily determined, especially in the continuous mixture model. In the Latent Class Logit 9
10 model on the other hand, the elasticities depend directly on the class allocation probabilities and as such are also a function of any socio-demographic attributes that enter into the class allocation model. 3 Empirical application This section presents an empirical application that illustrates the theoretical points discussed in Section 2. We first look at data and model specification before discussing the main estimation results. Finally, we look in turn at taste heterogeneity and inter-coefficient correlation, where, with the use of unlabelled SP data, no discussion on elasticities is included. 3.1 Data Our analysis makes use of Stated Choice (SC) data collected for the DATIV study carried out in Denmark in 2004 (cf. Burge and Rohr, 2004). For this survey, a binomial unlabelled route choice experiment was used, with two attributes, namely travel time (TT) and travel cost (TC), describing the alternatives. The final sample used in our analysis makes use of 1, 919 observations collected from 241 commuters, with up to 8 choice situations per respondent. In the analysis presented in this paper, a number of socio-demographic variables were used as covariates in the specification of taste heterogeneity, namely age, gender, personal income, and the ability to regularly work from home. Attempts to make use of other socio-demographic attributes, notably working time flexibility, were not successful. 3.2 Model specification In the specification of the underlying utility function, an alternative specific constant (ASC) was included for the first alternative, with a view to capturing left to right reading effects. The two main marginal utility coefficients are β TT and β TC, representing the marginal utility of changes in travel time and travel cost respectively. A number of additional offset parameters were included to estimate deviations from these sensitivities in certain socio-demographic sectors. Here, β TC, low inc. and β TC, high inc. represent deviations in the cost sensitivity for low income (less than DKK300, 000) and high income (more than DKK700, 000) respondents, and β TC, homeworking captures changes in travel cost sensitivity for respondents who regularly work from home. Finally, attempts were made to capture age effects, where the only significant change was observed using a piecewise linear interaction between age and travel time sensitivity, with constant 10
11 sensitivity for respondents below 40 years of age, gradually changing sensitivity for respondents aged between 40 and 60, and once again constant sensitivity for respondents over 60 years of age. The interaction in this middle age group is represented by β TT, age pl. group 2 where the estimate represents the change in sensitivity between a respondent of 40 years of age and a respondent of 60 years of age. No effects were observed for gender in this specification, nor were the time sensitivities different for respondents who regularly work from home. Moving away from the simple Logit model with deterministic taste heterogeneity only, a continuous Logit mixture was estimated, allowing for random heterogeneity in the two main coefficients, β TT and β TC, and recognising the repeated choice nature of the data through using a Revelt and Train (1998) style specification of the log-likelihood function, i.e. carrying out the integration/simulation at the level of a respondent rather than individual choice observation. Here, the best performance was obtained by making use of a multivariate Lognormal distribution, where additional offset parameters were estimated to allow for a bound different from zero. Specifically, the following specification was used: β TC = a TC + e µ TC+σ 11 ξ 1 (19) β TT = a TT + e µ TT+σ 21 ξ 1 +σ 22 ξ 2, (20) where a TC and a TT represent the offset parameters, µ TC and µ TT represent the means of the underlying Normal distribution, σ 11, σ 21 and σ 22 represent the terms of the Cholesky matrix, and ξ 1 and ξ 2 are two standard Normal variates. The sign change on β TC and β TT is required due to the positive domain of the Lognormal distribution. The third structure estimated on the data was a Latent Class Logit model. In this model, δ 1, β TT, age pl. group 2, β TC, low inc., β TC, high inc., and β TC, homeworking were kept as class independent, and only β TT and β TC were allowed to vary across classes, much as in the continuous Logit mixture. In the class allocation model, a total of seven parameters were included for each class. Along with an intercept (δ), coefficients were included for female respondents (β female ) and respondents who regularly work from home (β homeworking ). Age effects were once again captured through a piecewise linear specification, with three segments, namely respondents aged between 23 and 40 (β age pl. group 1 ), respondents aged between 40 and 60 (β age pl. group 2 ), and respondents aged between 60 and 73 (β age pl. group 3 ). Finally, income was used as an explanator, where a linear specification was found to offer the best performance, with the coefficient β income interacting with the income in DKK100, 000s. A separate analysis showed that the optimal number of classes for the Latent Class Logit model in this context is 3. 11
12 3.3 Estimation results This section presents the estimation results for the different models calibrated on our data Main estimation results The detailed estimation results are reported in Table 1. To account for the repeated choice nature of the data in the computation of the standard errors, the panel specification of the sandwich matrix was used across all models (cf. Daly and Hess, 2010). Looking first at model fit, we can see that the continuous Logit mixture and Latent Class Logit models both easily outperform the Logit model, with highly significant increases in log-likelihood (LL) by and units respectively, coming at the cost of 5 respectively 18 additional parameters. While the Latent Class Logit model produces the best LL of the three models, the higher number of parameters when compared to the continuous Logit mixture gives it a marginally lower performance according to the adjusted ρ 2 measure. Overall, the differences in fit between the two models are very small. Turning next to the detailed estimation results, the Logit estimates show the expected negative marginal utilities for travel time and travel cost increases, where the cost sensitivity is higher in the low income group (coefficient significant at the 83% level) while it is lower in the high income group. The sensitivity to travel cost is also lower for respondents who regularly work from home (coefficient significant at the 76% level) while between the age of 40 and the age of 60, the marginal utility of travel time increases drops by around for each additional year in age. Finally, the estimate of the alternative specific constant is positive and significant, possibly suggesting the presence of left to right reading effects. In the estimates for the continuous Logit mixture, we can observe a drop in the significance of the interaction terms with the exception of β TC, homeworking. This arguably signals that accounting for random taste heterogeneity reduces the scope for deterministic heterogeneity in this model. The positive estimates for a TT and a TC do, in conjunction with the sign change, imply universally negative values for the travel time and travel cost coefficients, even when factoring in the interactions with the positive β TT, age pl. group 2, β TC, high inc., and β TC, homeworking parameters. In the Latent Class Logit model, we also observe reductions in the significance levels for interaction terms as witnessed in the continuous Logit mixture. Additionally, while the estimates for β TT and β TC are negative in all three classes (and remain so even when interacted with the positive β TT, age pl. group 2, β TC, high inc., and β TC, homeworking terms), problems with significance are observed in the third 12
13 Table 1: Detailed estimation results continuous Logit Latent Class Logit mixture model Logit model Observations 1,919 1,919 1,919 Respondents Final LL -1, , , par adj. ρ Class indep Class 1 Class 2 Class 3 est. t-rat. est. t-rat. est. t-rat. est. t-rat. est. t-rat. est. t-rat. Wald(=) p-value δ βtt, age pl. group βtc, low inc βtc, high inc βtc, homeworking att βtt µtt atc βtc µtc σ σ σ Class allocation model est. t-rat. est. t-rat. est. t-rat. Wald(=) p-value δ βfemale βhomeworking βage pl. group βage pl. group βage pl. group βincome
14 class. Indeed, the estimate for β TT is only significant around the 50% level, with a 65% level applying for the estimate of β TC. This result could in part be explained by earlier observations of some respondents who are largely indifferent between the two alternatives in this dataset (cf. Hess et al., 2010). In terms of differences across the three classes, the Wald test shows significant variations in both coefficients. The estimates for the various parameters used in the class allocation model show high degrees of variation in coefficient values across the three classes. While in the first of the three classes, only the income parameter attains a high level of statistical significance, the majority of parameters are significant for the remaining two classes Heterogeneity in the Latent Class Logit model Three different classes were identified in the Latent Class Logit model. Disregarding for the moment the presence of the additional interaction terms, the first class shows a valuation of travel time savings (VTTS) of 63.39DKK/hr, where this drops to 20.41DKK/hr in the second class, and 16.83DKK/hr in the third class, where problems with parameter significance were also observed. Alongside the differences in relative valuations across the three classes, we can also observe differences in absolute coefficient values, with visibly higher scale in the second class. These differences in scale across the classes are consistent with earlier observations of very substantial scale differences in this dataset by Hess et al. (2009). On the basis of the estimated class allocation probabilities for each respondent, and the values of the relevant explanatory variables, it is possible to work out an expected value for each of the six variables in each of three classes. As an example, the most likely value for the female dummy in class 1 would be obtained as: female class 1 = N n=1 (π n1 female n ) N n=1 π, (21) n1 where N is the number of respondents, female n is 1 if respondent n is female and 0 otherwise, and π n1 is the probability of respondent n falling into class 1, computed on the basis of the class allocation model. The results of this process are shown in Table 2, where in addition, the expected values for the piecewise linear terms were used to compute an expected age, and where the table also shows the VTTS in the different classes. From the results in Table 2, we can see that the low VTTS in the third class can most easily be linked to lower expected income. The expected income in the 14
15 Table 2: Expected values for explanatory variables in latent classes class 1 class 2 class 3 female homeworking age pl. group age pl. group age pl. group income age VTTS (DKK/hr) remaining two classes is higher, and although it is highest in the second class, the VTTS is much higher in the first class, where this can possibly be linked to a lower average age, a higher rate of regularly working from home (often linked to higher time sensitivity) and a lower representation of female respondents. It is also worth remembering that class 2, which captures more female respondents, as well as highly paid and slightly older respondents, shows the highest scale in Table Comparison of heterogeneity across models As a next step, we compare the retrieved heterogeneity patterns across the three models. We first look at how the three models represent the variation in the VTTS across different socio-economic subgroups. Table 3 takes 18 individuals, differentiated by age (three different ages), income (three different groups), and whether they regularly work from home (reg h-w). With gender only being used in the Latent Class Logit models, we use a purely male sample of respondents in this initial comparison. For the Logit models, the point value is computed on the basis of the main coefficients and the socio-demographic interaction coefficients. For the continuous Logit mixture, we have the additional continuous random component, while, in the Latent Class Logit model, we have a distribution across the three classes for each respondent. Hence, for the continuous Logit mixture and Latent Class Logit models, the standard deviation (for the given type of respondent) is presented alongside the mean. The mean values in the continuous Logit mixture model are higher across all 18 respondents when compared to the Logit point values. With only four exceptions (namely the oldest respondents in the high income group, and the medium and high income young respondents who do not regularly work from home), the Latent Class Logit mean values are lower than the continuous Logit 15
16 mixture values, and are evenly distributed around the Logit values (in terms of increases and decreases). Finally, without a single exception, the degree of heterogeneity for specific types of individual are lower with the Latent Class Logit model than with the continuous Logit mixture model. This is attributable in part to the finite mixture approach in the Latent Class Logit model as well as the use of the Lognormal distribution in the distribution in the continuous Logit mixture model. In the Logit and continuous Logit mixture models, the VTTS universally decreases with age, while this is not necessarily the case in the Latent Class Logit models, a result of the fact that greater heterogeneity in relation to age is accommodated through the inclusion of age as an explanator in the class allocation model. As expected, the VTTS increases with income across all models. Finally, while the VTTS is universally higher for respondents who regularly work from home in the Logit and continuous Logit mixture models, this is only the case for high income respondents in the Latent Class Logit models (and medium income respondents in the middle age group), where this distinction is once again a result of also incorporating this attribute in the class allocation model. So far, we have focussed solely on certain representative individuals, while it is clearly also of great interest to look at the sample level VTTS distribution. For the Logit model, this equates to working out the point value for each individual and looking at the distribution of these values across the sample. For the continuous Logit mixture and Latent Class Logit models, it is however important to additionally incorporate the respondent-level uncertainty in the calculation of the VTTS. In practical terms, for the Latent Class Logit model, a population level distribution is obtained by taking the three respondent-specific VTTS measures for each individual (i.e. for the three classes), and combining this into a set of values across the sample, with weights for each value given by dividing the individual-specific class allocation weights by the number of respondents. For the continuous Logit mixture case, a continuous analogue to this approach was employed, making use of 100, 000 random draws for each of the 241 respondents. The results of this process are summarised in Table 4, showing that while the mean VTTS is very similar in the Logit and Latent Class Logit models, the additional random component of the latter model leads to a greater degree of heterogeneity. For the continuous Logit mixture model, both the mean and especially the standard deviation are higher than in the Logit and Latent Class Logit models, where this is at least in part due to the long tail of the Lognormal distribution. 16
17 Table 3: Comparison of heterogeneity in VTTS across models and sociodemographic groups continuous Latent Class Logit mixture Logit gender age income reg h-w Logit mean s.dev. mean s.dev. male (31.5) low no male (31.5) low yes male (31.5) medium no male (31.5) medium yes male (31.5) high no male (31.5) high yes male (50) low no male (50) low yes male (50) medium no male (50) medium yes male (50) high no male (50) high yes male (66.5) low no male (66.5) low yes male (66.5) medium no male (66.5) medium yes male (66.5) high no male (66.5) high yes Correlation between travel time and travel cost coefficients In the Logit model, we have point estimates for the travel time and travel cost coefficients, and while these point estimates themselves may be correlated, there is no distribution of coefficients. In the continuous Logit mixture model, the two coefficients follow a multivariate random distribution, and while the mean in these distributions varies across respondents (through the incorporation of the interaction terms), the standard deviation stays constant, as does the correlation, which derives solely from the Cholesky transformation of the underlying multivariate Normal distribution. In the Latent Class Logit model, the actual distribution of the random component varies across individuals as the weights for the different classes are not constant across respondents. As a result, and using the formuale from Section 2.3, we can work out individual specific correlations, where this process is illustrated in Table 5 for ten representative individuals with different socio-demographic characteristics. Here, we note five respondents with negative correlation between 17
18 Table 4: Sample population level heterogeneity in VTTS measures mean std.dev. Logit continuous Logit mixture Latent Class Logit Table 5: Correlation between travel time and travel cost coefficients in Latent Class Logit model for ten representative individuals Class 1 Class 2 Class 3 age gender reg. w-h inc.gr. π β TT β TC π β TT β TC π β TT β TC corr 30 male no male no male no male yes female no female yes male no male yes male no male yes the two coefficients, with positive correlation for the remaining five respondents. The expectation would clearly be for negative correlation between time and cost sensitivities, but the presence of significant levels of scale heterogeneity can lead to positive correlation between the distribution of the individual coefficients. A study of the results in Table 5, along with a detailed analysis of the sample population results 3 reveals a number of relationships. Firstly, correlation is higher for female respondents, while it is lower for respondents who regularly work from home. Correlation increases with age up to 60 years, when it starts decreasing. Finally, correlation rises with income. As a final step in our analysis of the correlation patterns, we look at the distribution at the sample population level, with results reported in Table 6. These results were obtained in the same way as the sample population level distribution statistics in Table 4, and show positive correlation between the travel time and travel cost coefficient in all three models, consistent with the earlier comment about high degrees of scale heterogeneity in the DATIV data. The correlation is highest in the Latent Class Logit model, most likely as a result of 3 Detailed results available on request. 18
19 Table 6: Correlation between travel time and travel cost coefficient at sample population level corr (β TT, β TT ) Logit 0.22 continuous Logit mixture 0.60 Latent Class Logit 0.95 the major scale differences between class 2 and the remaining classes. 4 Summary and conclusions This paper has presented a comparison between the commonly used continuous Logit mixture model and the more rarely used Latent Class approach in their respective approaches to dealing with heterogeneity across respondents. The paper has first presented formulae for the inter-coefficient correlation and elasticities in Latent Class Logit models and has shown how these measures are a function of the socio-demographic attributes used in the class allocation model. Our empirical example has then further illustrated the differences between the two types of models, making use of stated choice data take from a value of time study. The results from the empirical application show that both the continuous Logit mixture and Latent Class Logit model produce significant gains in performance when compared to the Logit model by allowing for random taste heterogeneity on top of the already incorporated deterministic variations. The actual fit of the two advanced models is comparable, but significant differences arise between the models in terms of substantive results. Firstly, it is clear that the Latent Class Logit models are able to retrieve richer patterns of heterogeneity by linking the class allocation to socio-demographic indicators. This allows this model to move away from some of the monotonic interactions seen in the Logit and continuous Logit mixture models, such as a strict decreasing relationship between age and VTTS. Incorporating such patterns in the continuous Logit mixture models would require a parameterisation of the variance of the random distributions (cf. Greene et al., 2006), which however leads to additional difficulties in estimation. Secondly, and crucially in the context of the present paper, the analysis has shown how the heterogeneity in VTTS measures and the correlation between taste coefficients can be easily linked to socio-demographic characteristics in the Latent Class Logit model. Also, while both the continuous Logit mixture and Latent Class Logit models allow for uncertainty in the distribution of tastes for individual respondents, only the Latent Class Logit model allows for additional 19
20 variation in the correlation across respondents. The results in this paper provide an illustration of the potential benefits of Latent Class Logit models for applied research in the area of travel behaviour. It remains up to the analyst to make an informed choice between continuous Logit mixture and Latent Class Logit models on a case by case basis, but the Latent Class Logit model should at the very least be regarded as a viable alternative to the continuous Logit mixture model. Further work should be conducted using other datasets, especially with a view to the computation of elasticities. Finally, it is worth adding a brief note on the specification of latent class models. The form of the model we use here is particularly accessible as there are well-established estimation software programs to estimate such models. However, its major disadvantage is that it does not permit one to impose different a-priori restrictions on the specifications of the class membership models and on the class specific choice probabilities. Such restrictions would be needed if the latent classes were based on a priori behavioural hypotheses. For this reason, the model specification process we followed was exploratory in that we allowed the number of classes and the structure of the classes to be inferred from the data. A stronger case for the latent class model could be made by using a confirmatory approach in which the classes and their socio-economic covariates were based on behavioural theory. For an example of such confirmatory approach see Gopinath (1995), where the formulae presented in this paper would also apply to such a model. Further, the causal factors for the latent classes could include other latent factors (such as attitudes) that should be explicitly captured in the model specification. In order to develop such a complex model, additional measurement indicators would likely be needed resulting in a more complex model with multiple equations. For an example, see Walker and Ben-Akiva (2002). Nonetheless, the model as presented here does provide evidence for improved statistical fit, easier interpretation, and greater policy relevance. Acknowledgements This paper is partly based on work conducted during a stay as a visiting research scholar by the first author in the Department of Civil & Environmental Engineering at the Massachusetts Institute of Technology. The first author also acknowledges the support by the Leverhulme Trust, in the form of a Leverhulme Early Career Fellowship. 20
21 References Ben-Akiva, M., Lerman, S. R., Discrete Choice Analysis: Theory and Application to Travel Demand. MIT Press, Cambridge, MA. Burge, P., Rohr, C., DATIV: SP Design: Proposed approach for pilot survey. Tetra-Plan in cooperation with RAND Europe and Gallup A/S. Daly, A., Hess, S., Simple methods for panel data analysis. paper presented at the 12 th World Conference on Transport Research, Lisbon, Portugal. Daly, A., Hess, S., Train, K., Assuring finite moments for willingness to pay in random coefficient models. paper presented at the European Transport Conference, Noordwijkerhout. Fosgerau, M., Investigating the distribution of the value of travel time savings. Transportation Research Part B 40 (8), Gopinath, D., Modeling heterogeneity in discrete choice processes: Application to travel demand. Ph.D. thesis, MIT, Cambridge, MA. Greene, W. H., Hensher, D. A., A latent class model for discrete choice analysis: contrasts with mixed logit. Transportation Research Part B 37 (8), Greene, W. H., Hensher, D. A., Rose, J. M., Accounting for heterogeneity in the variance of unobserved effects in mixed logit models. Transportation Research Part B 40 (1), Hensher, D. A., Greene, W. H., The Mixed Logit Model: The State of Practice. Transportation 30 (2), Hess, S., Bierlaire, M., Polak, J. W., Estimation of value of travel-time savings using mixed logit models. Transportation Research Part A 39 (2-3), Hess, S., Rose, J. M., Allowing for intra-respondent variations in coefficients estimated on repeated choice data. Transportation Research Part B 43 (6), Hess, S., Rose, J. M., Bain, S., Random scale heterogeneity in discrete choice models. ITS working paper. Institute for Transport Studies, University of Leeds. 21
22 Hess, S., Rose, J. M., Polak, J. W., Non-trading, lexicographic and inconsistent behaviour in sp choice data. Transportation Research Part D 15 (7), McFadden, D., Train, K., Mixed MNL Models for discrete response. Journal of Applied Econometrics 15 (5), Revelt, D., Train, K., Mixed Logit with repeated choices: households choices of appliance efficiency level. Review of Economics and Statistics 80 (4), Shen, J., Latent class model or mixed logit model? a comparison by transport mode choice data. Applied Economics 41 (22), Train, K., Discrete Choice Methods with Simulation, second edition Edition. Cambridge University Press, Cambridge, MA. Walker, J., Extended discrete choice models: Integrated framework, flexible error structures, and latent variables. Ph.D. thesis, MIT, Cambridge, MA. Walker, J., Ben-Akiva, M., Generalized random utility model. Mathematical Social Sciences 43 (3), Walker, J., Li, J., Latent lifestyle preferences and household location decisions. Journal of Geographical Systems 9 (1),
Discrete Choice Analysis II
Discrete Choice Analysis II Moshe Ben-Akiva 1.201 / 11.545 / ESD.210 Transportation Systems Analysis: Demand & Economics Fall 2008 Review Last Lecture Introduction to Discrete Choice Analysis A simple
SYSTEMS OF REGRESSION EQUATIONS
SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
Comparing the Latent Class Model with the Random Parameters. Logit - A Choice Experiment analysis of highly heterogeneous
Comparing the Latent Class Model with the Random Parameters Logit - A Choice Experiment analysis of highly heterogeneous electricity consumers in Hyderabad, India Julian Sagebiel Department for Agricultural
Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers
Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers Christine Ebling, University of Technology Sydney, [email protected] Bart Frischknecht, University of Technology Sydney,
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
Yew May Martin Maureen Maclachlan Tom Karmel Higher Education Division, Department of Education, Training and Youth Affairs.
How is Australia s Higher Education Performing? An analysis of completion rates of a cohort of Australian Post Graduate Research Students in the 1990s. Yew May Martin Maureen Maclachlan Tom Karmel Higher
Marketing Mix Modelling and Big Data P. M Cain
1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored
STATISTICAL ANALYSIS OF UBC FACULTY SALARIES: INVESTIGATION OF
STATISTICAL ANALYSIS OF UBC FACULTY SALARIES: INVESTIGATION OF DIFFERENCES DUE TO SEX OR VISIBLE MINORITY STATUS. Oxana Marmer and Walter Sudmant, UBC Planning and Institutional Research SUMMARY This paper
Simple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
BINOMIAL OPTIONS PRICING MODEL. Mark Ioffe. Abstract
BINOMIAL OPTIONS PRICING MODEL Mark Ioffe Abstract Binomial option pricing model is a widespread numerical method of calculating price of American options. In terms of applied mathematics this is simple
How to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88)
Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Introduction The National Educational Longitudinal Survey (NELS:88) followed students from 8 th grade in 1988 to 10 th grade in
Multiple Choice Models II
Multiple Choice Models II Laura Magazzini University of Verona [email protected] http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Multiple Choice Models II 1 / 28 Categorical data Categorical
Multivariate Analysis of Ecological Data
Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology
Chapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
4. Work and retirement
4. Work and retirement James Banks Institute for Fiscal Studies and University College London María Casanova Institute for Fiscal Studies and University College London Amongst other things, the analysis
Leaving the parental home in Poland Kamil Sienkiewicz
Leaving the parental home in Poland Kamil Sienkiewicz Short abstract This study compares trends in the process of leaving parental home before and after the breakdown of the Communist regime in Poland.
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010
Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different
How To Understand And Solve A Linear Programming Problem
At the end of the lesson, you should be able to: Chapter 2: Systems of Linear Equations and Matrices: 2.1: Solutions of Linear Systems by the Echelon Method Define linear systems, unique solution, inconsistent,
Uncovering Consumer Decision Rules under Complex Dynamic Environments: The Case of Coalition Loyalty Programs
Uncovering Consumer Decision Rules under Complex Dynamic Environments: The Case of Coalition Loyalty Programs Andrew Ching Masakazu Ishihara Very Preliminary and Incomplete April 18, 2014 Abstract We propose
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
Calculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: [email protected]) Varna University of Economics, Bulgaria ABSTRACT The
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
Clustering in the Linear Model
Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple
Introduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
Simple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
The primary goal of this thesis was to understand how the spatial dependence of
5 General discussion 5.1 Introduction The primary goal of this thesis was to understand how the spatial dependence of consumer attitudes can be modeled, what additional benefits the recovering of spatial
Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1. Page 1 of 11. EduPristine CMA - Part I
Index Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1 EduPristine CMA - Part I Page 1 of 11 Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting
Chapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
The Impact of the Medicare Rural Hospital Flexibility Program on Patient Choice
The Impact of the Medicare Rural Hospital Flexibility Program on Patient Choice Gautam Gowrisankaran Claudio Lucarelli Philipp Schmidt-Dengler Robert Town January 24, 2011 Abstract This paper seeks to
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided
On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information
Finance 400 A. Penati - G. Pennacchi Notes on On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information by Sanford Grossman This model shows how the heterogeneous information
Schools Value-added Information System Technical Manual
Schools Value-added Information System Technical Manual Quality Assurance & School-based Support Division Education Bureau 2015 Contents Unit 1 Overview... 1 Unit 2 The Concept of VA... 2 Unit 3 Control
Markups and Firm-Level Export Status: Appendix
Markups and Firm-Level Export Status: Appendix De Loecker Jan - Warzynski Frederic Princeton University, NBER and CEPR - Aarhus School of Business Forthcoming American Economic Review Abstract This is
Department of Economics
Department of Economics On Testing for Diagonality of Large Dimensional Covariance Matrices George Kapetanios Working Paper No. 526 October 2004 ISSN 1473-0278 On Testing for Diagonality of Large Dimensional
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038
Earnings Announcement and Abnormal Return of S&P 500 Companies. Luke Qiu Washington University in St. Louis Economics Department Honors Thesis
Earnings Announcement and Abnormal Return of S&P 500 Companies Luke Qiu Washington University in St. Louis Economics Department Honors Thesis March 18, 2014 Abstract In this paper, I investigate the extent
Citi Volatility Balanced Beta (VIBE) Equity Eurozone Net Total Return Index Index Methodology. Citi Investment Strategies
Citi Volatility Balanced Beta (VIBE) Equity Eurozone Net Total Return Index Citi Investment Strategies 21 November 2011 Table of Contents Citi Investment Strategies Part A: Introduction 1 Part B: Key Information
The frequency of visiting a doctor: is the decision to go independent of the frequency?
Discussion Paper: 2009/04 The frequency of visiting a doctor: is the decision to go independent of the frequency? Hans van Ophem www.feb.uva.nl/ke/uva-econometrics Amsterdam School of Economics Department
Student Aid, Repayment Obligations and Enrolment into Higher Education in Germany Evidence from a Natural Experiment
Student Aid, Repayment Obligations and Enrolment into Higher Education in Germany Evidence from a Natural Experiment Hans J. Baumgartner *) Viktor Steiner **) *) DIW Berlin **) Free University of Berlin,
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Introduction to time series analysis
Introduction to time series analysis Margherita Gerolimetto November 3, 2010 1 What is a time series? A time series is a collection of observations ordered following a parameter that for us is time. Examples
NON-PROBABILITY SAMPLING TECHNIQUES
NON-PROBABILITY SAMPLING TECHNIQUES PRESENTED BY Name: WINNIE MUGERA Reg No: L50/62004/2013 RESEARCH METHODS LDP 603 UNIVERSITY OF NAIROBI Date: APRIL 2013 SAMPLING Sampling is the use of a subset of the
Fare Planning for Public Transport
Konrad-Zuse-Zentrum für Informationstechnik Berlin Takustraße 7 D-14195 Berlin-Dahlem Germany MARIKA NEUMANN Fare Planning for Public Transport Supported by the DFG Research Center Matheon Mathematics
COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
Income Distribution Database (http://oe.cd/idd)
Income Distribution Database (http://oe.cd/idd) TERMS OF REFERENCE OECD PROJECT ON THE DISTRIBUTION OF HOUSEHOLD INCOMES 2014/15 COLLECTION October 2014 The OECD income distribution questionnaire aims
Mortgage Loan Approvals and Government Intervention Policy
Mortgage Loan Approvals and Government Intervention Policy Dr. William Chow 18 March, 214 Executive Summary This paper introduces an empirical framework to explore the impact of the government s various
It is important to bear in mind that one of the first three subscripts is redundant since k = i -j +3.
IDENTIFICATION AND ESTIMATION OF AGE, PERIOD AND COHORT EFFECTS IN THE ANALYSIS OF DISCRETE ARCHIVAL DATA Stephen E. Fienberg, University of Minnesota William M. Mason, University of Michigan 1. INTRODUCTION
2 Voluntary retirement module specification
2 Voluntary retirement module specification As part of its research on Superannuation Policy for Post-Retirement the Commission has developed a model referred to as the Productivity Commission Retirement
The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.
The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss
Credit Card Market Study Interim Report: Annex 4 Switching Analysis
MS14/6.2: Annex 4 Market Study Interim Report: Annex 4 November 2015 This annex describes data analysis we carried out to improve our understanding of switching and shopping around behaviour in the UK
A Primer on Forecasting Business Performance
A Primer on Forecasting Business Performance There are two common approaches to forecasting: qualitative and quantitative. Qualitative forecasting methods are important when historical data is not available.
RELEVANT TO ACCA QUALIFICATION PAPER P3. Studying Paper P3? Performance objectives 7, 8 and 9 are relevant to this exam
RELEVANT TO ACCA QUALIFICATION PAPER P3 Studying Paper P3? Performance objectives 7, 8 and 9 are relevant to this exam Business forecasting and strategic planning Quantitative data has always been supplied
STA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! [email protected]! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
Master of Mathematical Finance: Course Descriptions
Master of Mathematical Finance: Course Descriptions CS 522 Data Mining Computer Science This course provides continued exploration of data mining algorithms. More sophisticated algorithms such as support
LOGNORMAL MODEL FOR STOCK PRICES
LOGNORMAL MODEL FOR STOCK PRICES MICHAEL J. SHARPE MATHEMATICS DEPARTMENT, UCSD 1. INTRODUCTION What follows is a simple but important model that will be the basis for a later study of stock prices as
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Jennjou Chen and Tsui-Fang Lin Abstract With the increasing popularity of information technology in higher education, it has
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL
FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL STATIsTICs 4 IV. RANDOm VECTORs 1. JOINTLY DIsTRIBUTED RANDOm VARIABLEs If are two rom variables defined on the same sample space we define the joint
Machine Learning and Pattern Recognition Logistic Regression
Machine Learning and Pattern Recognition Logistic Regression Course Lecturer:Amos J Storkey Institute for Adaptive and Neural Computation School of Informatics University of Edinburgh Crichton Street,
Life Cycle Asset Allocation A Suitable Approach for Defined Contribution Pension Plans
Life Cycle Asset Allocation A Suitable Approach for Defined Contribution Pension Plans Challenges for defined contribution plans While Eastern Europe is a prominent example of the importance of defined
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA ABSTRACT The decision of whether to use PLS instead of a covariance
Statistical Rules of Thumb
Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN
Multiple regression - Matrices
Multiple regression - Matrices This handout will present various matrices which are substantively interesting and/or provide useful means of summarizing the data for analytical purposes. As we will see,
A Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration
Chapter 6: The Information Function 129 CHAPTER 7 Test Calibration 130 Chapter 7: Test Calibration CHAPTER 7 Test Calibration For didactic purposes, all of the preceding chapters have assumed that the
Introduction to Principal Components and FactorAnalysis
Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a
Module 4 - Multiple Logistic Regression
Module 4 - Multiple Logistic Regression Objectives Understand the principles and theory underlying logistic regression Understand proportions, probabilities, odds, odds ratios, logits and exponents Be
Longitudinal Meta-analysis
Quality & Quantity 38: 381 389, 2004. 2004 Kluwer Academic Publishers. Printed in the Netherlands. 381 Longitudinal Meta-analysis CORA J. M. MAAS, JOOP J. HOX and GERTY J. L. M. LENSVELT-MULDERS Department
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Residential Demand for Access to the Internet 1
Residential Demand for Access to the Internet 1 Paul Rappoport, Temple University Donald J. Kridel, University of Missouri at St. Louis Lester D. Taylor, University of Arizona James Alleman, University
A Simple Model of Price Dispersion *
Federal Reserve Bank of Dallas Globalization and Monetary Policy Institute Working Paper No. 112 http://www.dallasfed.org/assets/documents/institute/wpapers/2012/0112.pdf A Simple Model of Price Dispersion
GLM I An Introduction to Generalized Linear Models
GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial
Multivariate Analysis. Overview
Multivariate Analysis Overview Introduction Multivariate thinking Body of thought processes that illuminate the interrelatedness between and within sets of variables. The essence of multivariate thinking
Labour Income Dynamics and the Insurance from Taxes, Transfers, and the Family
Labour Income Dynamics and the Insurance from Taxes, Transfers, and the Family Richard Blundell 1 Michael Graber 1,2 Magne Mogstad 1,2 1 University College London and Institute for Fiscal Studies 2 Statistics
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
The Probit Link Function in Generalized Linear Models for Data Mining Applications
Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications
Statistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Regression III: Advanced Methods
Lecture 4: Transformations Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture The Ladder of Roots and Powers Changing the shape of distributions Transforming
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Stochastic Analysis of Long-Term Multiple-Decrement Contracts
Stochastic Analysis of Long-Term Multiple-Decrement Contracts Matthew Clark, FSA, MAAA, and Chad Runchey, FSA, MAAA Ernst & Young LLP Published in the July 2008 issue of the Actuarial Practice Forum Copyright
