Comparing the Latent Class Model with the Random Parameters Logit - A Choice Experiment analysis of highly heterogeneous electricity consumers in Hyderabad, India Julian Sagebiel Department for Agricultural Economics, Humboldt-Universität zu Berlin julian.sagebiel@hu-berlin.de Abstract The increased application of the Stated Choice methods led researchers to develop several econometric models that relax the strict assumptions of the frequently applied Conditional Logit model. Especially the question of how to incorporate preference heterogeneity into the analysis is subject to current research. This paper contributes to the discussion by comparing two of the most commonly used models, the Latent Class Logit model and the Random Parameters Logit model. Both models have in common that they introduce heterogeneity in the systematic part of utility but differ in their assumptions of the distribution of preferences. For comparison, data from a choice experiment on electricity quality in India will be analyzed. Thereby, measures of fit, willingness to pay values and choice probabilities of both models will be contrasted. Apart from the statistical comparison, I discuss further issues that contribute to an adequate choice of the model.
Contents 1 Introduction 2 2 Choice Models 4 2.1 The Random Parameters Logit Model.............. 5 2.2 The Latent Class Model..................... 6 3 Data 8 4 Results 9 4.1 Random Parameters Logit Estimation............. 12 4.2 Latent Class Estimation..................... 15 5 Model Comparison 15 5.1 Measures of Fit.......................... 19 5.2 Conditional WTP Values..................... 19 5.3 Choice Probabilities........................ 21 6 Discussion 24 7 Conclusion 29 1
1 Introduction The steady increase in applications of discrete choice models has led to a variety of econometric estimation techniques. The conditional logit model (CL), made popular by [McFadden, 1974], provides an easy-to-handle estimation process but is limited to several assumptions. While the CL can model heterogeneity by incorporating interaction terms of case-specific variables with alternative-specific constants or alternative-specific variables (attributes), other models do the same in a more sophisticated way. Worth mentioning are the random parameters logit model (RPL) and the latent class model (LCM). The RPL is characterized by accommodating heterogeneity as a continuous function of the parameters i.e. the parameters are random underlying some ex-ante specified distribution. In contrast, the LCM can be interpreted as a semi-parametric version of the RPL, which derives heterogeneity from different classes, each having its own parameters. As [Greene and Hensher, 2003] point out, the RPL is more flexible as it can induce nearly any behavioral assumption in terms of preference distribution, while the LCM benefits from its semi-parametric structure which does not require any assumption on the distribution of the parameters. The literature provides rather ambiguous results from a comparison of the models. It is hardly found that one model clearly outperforms the other by statistical performance. In this paper, I will briefly discuss the two models and compare results from choice experiment (CE) data generated to elicit willingness to 2
pay (WTP) measures for several attributes of electricity quality. The data is derived from a household survey conducted in February 2010 in Hyderabad, India. It covers 800 private households which are stratified into slum, middle class and high class. While income and electricity consumption as well as lifestyles vary significantly over the classes, household characteristics appear homogeneous within the classes as often witnessed in India. From this observation, my interest in the LCM arises. Regardless of the statistical fit, the LCM appears to describe the context better than the RPL. This is, the LCM could model three classes which are hypothesized to differ significantly in preferences but have homogeneous within-class preferences. If the above mentioned assumption is true, a three class model is most suitable. It remains the question whether this context driven approach is supported by the statistical performance. The statistical comparative analysis will be executed adjacent to previous studies, where the RPL was contrasted to the LCM. The model selection criteria include kernel density plots and ordinary least square regressions of individual WTP values and choice probabilities of the RPL against the LCM, simple comparisons of measures of fit and finally statistical testing with methods for non-nested models. In case both models perform rather similarly, I will argue that model selection should depend on the context and the aims of the researcher. My impression up to now is that the statistical performance of models is frequently overstated in the literature while the theoretical implications of the different models and its consequences on behavioral assumptions play a minor role when arguing in 3
favor or against a model. The paper is structured as follows. Section 2 summarizes the LCM and RPL. Section 3 details the survey and section 4 presents the estimation results. Section 5 is concerned with the statistical comparison of the two models while section 6 discusses issues that go beyond the statistics. Section 7 concludes. 2 Choice Models This section describes the two models at stake, the RPL and the LCM. The models are similar in a way that they both incorporate heterogeneity in respondents preferences on attributes. While the RPL assumes a continuous distribution of the parameters to introduce heterogeneity, the LC uses discrete classes to reach the same. In a sense, the LC is a special case of the RPL with parameters being distributed discretely and hence can be referred to as a semi-parametric sister of the RPL. The analysis throughout the paper will be based on some conventions and definitions which will be explained in the following. We assume a randomly selected individual i which chooses repeatedly in t situations between several alternatives n. Each alternative accommodates attributes k with levels A kn which vary over alternatives. For simplicity we assume indirect utility functions U in for each alternative n and individual i to be linear with respect to attribute levels A kn and price p. For each alternative there are utility sensitive elements e int that cannot be 4
observed by the researcher but are known to the individual. This very simple formulation can be written as U in = V in + e il = β 1 A i1n + β 2 A i2n +... + β k A ikn + e in (1) where A ikn is the level of attribute k for alternative n and β k the corresponding utility coefficient. In a CE, the CL probability for individual i to choose alternative m is given as P r im = exp(β 1A i1m + β 2 A i2m +... + β k A ikm ) N n=1 exp(β 1A i1n + β 2 A i2n +... + β k A ikn ) (2) In the following I will expand the CL specification to derive the RPL and the LCM. 2.1 The Random Parameters Logit Model The RPL is characterized by randomness in parameters. In the CL the parameters are fixed and take the same value for all respondents. The RPL specification introduces a random component in the parameters such as β ik = β k + η ki (3) where η ki is an error term with distribution f(η ki ) and mean 0 and variance φ 2. Hence β ik is a random variable with distribution f( β ik ) and mean 5
β k. The distribution function can be chosen by the researcher without further limitations 1. The unconditional PRL choice probability is then given as a weighted average of all possible β ik for the attribute parameters that are considered random. P r im =... P r im β i1 = β i2 = β ik = f( β i1 )f( β i2 )...f( β ik )d( β i1 )d( β i2 )...d( β ik ) (4) with P r im = exp( β i1 A i1m + β i2 A i2m +... + β ik A ikm ) N n=1 exp( β i1 A i1n + β i2 A i2n +... + β ik A ikn ) (5) The multidimensional Integral does not have a closed form, so that the probability can only be achieved with simulation. Most commonly, the maximum simulated likelihood method is applied. In this specification, each decision maker has his own parameters i.e. each decision maker has different preferences. 2.2 The Latent Class Model The LCM can be regarded as a special case of the RPL with β k taking a finite number S of values < β k 1, β k 2,..., β k s > with corresponding probabilities 1 The most common distribution functions are the normal, log normal and triangular. In case that f( β ik ) = 1 for β ik = b and f( β ik ) = 0 β ik b the model reduces to the CL. 6
< h 1, h 2,..., h s >. The unconditional probability to choose alternative m is the weighted average of the s β k s parameters. S P r im = h s P r im s (6) s=1 with P r im s being the CL probability to choose alternative m when belonging to class s. P r im s = exp(β i1 sa i1m + β i2 s A i2m +... + β ik s A ikm ) N n=1 exp(β i1 sa i1n + β i2 s A i2n +... + β ik s A ikn ) (7) h s are unknown but can be estimated with a multinomial logit model. h s = exp(ζ s X i ) S s=1 exp(ζ sx i ) (8) where X i is a vector of case-specific variables like income, age, or attitudes that have an effect on the class probability and ζ s is the corresponding parameter vector for class s. The vector X i can comprise only a constant if case-specific variables are not available or do not explain the class probability. In this case, the heterogeneity is unobserved. The number of classes can be chosen by the researcher but one has to keep in mind that the class probabilities are subject to a statistical procedure rather than behavioral assumptions. To identify the optimal number of classes statistically, measures of fit like CAIC or BIC are commonly used. However, one should also make sure that the parameters of the classes are valid in a behavioral sense. 7
3 Data The data I will use in the following analysis was generated within the research project Sustainable Hyderabad in January 2010 in Hyderabad, India. 2 The sample covers 798 private households from the Greater Hyderabad area, stratified by class (slum, middle class, high class). The study purpose was to investigate consumer behavior and consumption patterns of electricity use. The CE consisted of 27 choice sets and each choice set includes five attributes with three levels each as summarized in table 1. The selection of attributes and levels was based on expert interviews, focus group discussions, a preliminary WTP study [Hanisch et al., 2010] and pretests. From the 3 5 possible alternatives an orthogonal array with 54 alternatives was constructed. The study was blocked into three surveys, so that each respondent had to answer nine choice sets with each consisting of two alternatives. The attributes scheduled power cuts and unscheduled power cuts refer to the duration of daily power cuts, which are either pre-announced in newspapers and television or appear suddenly without warning. The latter ones are expected to be more severe, as people are not able to adjust to it beforehand. The worst case scenario means a total duration of one hour power cuts per day, which is not uncommon during summer. The share of renewable energy in the electricity mix, the third attribute, is currently two percent in Hyderabad, however the local electricity regulator specified the minimum percentage to five percent. 2 more details on the survey are found in [Rommel et al., 2010] 8
According to the expert interviews, the ten percent scenario is the maximum one can expect in the nearest future. The last attribute refers to the organizational form of the supply company. The current supplier of electricity in Hyderabad is the state owned Andhra Pradesh Central Power Distribution Company Limited, which has a monopoly on distribution. A possible reform of the power sector can lead to more competition and private companies entering the market. Another concept, which exists already in four districts in Andhra Pradesh, are cooperatively organized distribution companies, where the customer is member of the company. Some examples showed better service quality and more satisfied end users. These three options, Government (status quo), private distribution company and cooperative society, are included as dummy variables in the choice sets. Finally, the cost attribute was chosen in a way that it will not overburden the financial capacity of the respondents. As 50 percent of the respondents are slum dwellers, a bigger increase than 20 percent to the current electricity bill could be infeasible and hence could decrease the accuracy of the estimation. 4 Results In the following estimations, case-specific variables were not included in order to facilitate the understanding and interpretation of the model. The estimation was carried out with NLOGIT 4.0 and STATA 10.1. All models have the same utility specification and variables, while they differ by definition in 9
Table 1: Attribute Description Variable Description Levels Code 30 minutes 30 Scheduled power cuts SCHED 15 minutes 15 per day in minutes 0 minutes 0 UNSCH REN PRIV COOP COST Unscheduled power cuts per day in minutes Percentage of renewable energy in the electricity mix Whether supply is carried out by private company Whether supply is carried out by cooperative society Additional cost to monthly electricity bill in per cent 30 minutes 30 15 minutes 15 0 minutes 0 2% 2 5% 5 10% 10 Private company 1 COOP or Government 0 Cooperative 1 PRIV or Government 0 no additional costs 0 10% 1 20% 2 10
the distribution of parameters. 51 respondents were dropped as these were considered to be irrational, not picking a dominant choice set 3. The exclusion improved the estimation significantly. In a first step I estimated the simple CL specification, from which the results were used as starting values for the other models. The results are given in table 2. Table 2: Estimation results : Conditional Logit Variable Coefficient (Std. Err.) SCHED -0.027 (0.002) UNSCH -0.010 (0.002) REN 0.018 (0.006) PRIV 0.005 (0.053) COOP -0.160 (0.048) COST -8.324 (0.249) N 6723 Log-likelihood -3855.245 Log-likelihood (NULL) -4660.028 Pseudo R 2 16.54% Significance levels : : 10% : 5% : 1% All parameters are significant on a one percent significance level and the two dummy variables are jointly significant on a one percent significance level (Wald test, likelihood ratio test). All signs are as expected and the overall model is highly significant. An increase in scheduled and unscheduled power cuts and in costs reduce the probability of the alternative to be chosen. An increase in renewable energy increases the probability. There is 3 One choice set, where one alternative was dominating the other one was included. Respondents who picked the dominated choice set are likely to not having understood the CE and hence were dropped. 11
little difference of preferences between a governmental supplier and a private company but the probability to choose an alternative decreases when the organizational form is switched to a cooperative. 4.1 Random Parameters Logit Estimation The RPL model was estimated with all attributes being randomly and normally distributed. Although a normal distribution allows for positive and negative values, which may be misleading for costs and power cuts, there are several reasons to choose it. First, the normal distribution has been widely used and comprises some convenience features. Second, in case there are high parameter values, the probability that a value is on the wrong side is very low. Hence, the normal distribution can still be a good approximation [Meijer and Rouwendal, 2006, Sillano and de Dios Ortand, 2005]. Third, as the data were collected in a developing country and illiterate respondents made up some percentage of the observation, it is likely that due to limited understanding, the choices were made in an irrational way. Yet it is not possible to identify these wrong choices, it is likely that some respondents actually have positive parameters for cost and power cuts. At least the data says so. Hence, a wrong sign is a problem of data collection rather than of the statistical and behavioral assumptions. Fourth, as this paper aims to compare the RPL with the LCM, theoretical assumptions on the sign of the parameter do not play a major role. The LCM is also not restricted to one sided parameters, so why should the RPL 12
be? Fifth, after estimating several models with different parameter distributions, the model with all parameters being normally distributed gives the best fit. Sixth, using different distributions that force the parameter to have a positive sign only lead to further difficulties with interpretation and estimation. E.g. the log-normal distribution has a long thick tail, and the corresponding log-likelihood function tends to be extremely flat at its maximum [Sillano and de Dios Ortand, 2005]. Another unusual construction applied here is to allow the cost parameter to be random. In most studies, the cost parameter is nonrandom (e.g. [Morrison and Nalder, 2009, Carlsson and Martinsson, 2008, Revelt and Train, 1998] for applications in the electricity sector) due to several reasons (cf section 6). However, as a randomly distributed cost parameter increases the model fit significantly (likelihood ratio test), it seems inappropriate to follow this convenience assumption. Further there is no theoretical argument, why the cost parameter should be nonrandom. [Meijer and Rouwendal, 2006] (p.242) argue Treating the coefficient of the monetary variable as a fixed constant [...] gives markedly different distributions of the [WTP] and cannot be recommended. Lastly, we allow for correlation along the random parameters. This is also justified by a high improvement of the log likelihood at convergence. Further, this assumption can be very reasonable. For example might someone who has problems with scheduled power cuts also have problems with unscheduled power cuts. We used 1000 Halton draws for the simulation of the random parameters and 13
maximized the simulated log likelihood function using the BHHH Estimator. Our estimation implies similar results as the CL 4 but significant standard deviations of the parameters suggest heterogeneity in preferences. A likelihood ratio test rejected the null hypothesis that the CL and the RPL are the same. Table 3 gives the results of the RPL. Table 3: Estimation results : RPL Variable Coefficient (Std. Err.) SCHED -0.0509 (0.004) UNSCH -0.019 (0.003) REN 0.075 (0.012) PRIV -0.113 (0.109) COOP -0.346 (0.100) COST -16.467 (0.807) Standard deviations of the random parameters SCHED SD 0.047 (0.005) UNSCH SD 0.023 (0.005) REN SD 0.103 (0.020) PRIV SD 1.016 (0.182) COOP SD 0.801 (0.331) COST SD 15.789 (1.347) N 6723 Log-likelihood -3473.296 Log-likelihood (NULL) -4660.028 Pseudo R 2 25.466% Significance levels : : 10% : 5% : 1% 4 It is not by chance that the parameters in the RPL are larger than in the CL. [Sillano and de Dios Ortand, 2005] explain the confounding of the scale parameter, which is smaller in the RPL as parts of the random variation are incorporated in the systematic part of the utility function V in. 14
4.2 Latent Class Estimation The LC model has a reasonable fit with three classes. From these classes, two turned out to be dominating and the other one can be considered as a small outlier group with a class probability of less than five percent. The results of the LC model are given in table 4. The overall model is highly significant. Only REN is not significant in Class 3 which might reflect the ignorance towards renewable energy in the population. All parameters have the expected sign and are significant at least at a 10 percent significance level. The results imply the virtue of the model. While in Class 1 and 2 the parameters for PRIV and COOP are negative, they are positive in Class 3 suggesting antipodal preferences. Some respondents are highly in favor of the status quo (Governmental distribution company) and others prefer a reformation of the power sector with private of cooperative distributors. These antipodal differences can lead to insignificant parameters in simpler models like the CL. Averaging would not make sense in this case. Also a RPL struggles with these kind of preferences. A normally distributed parameter for example has the highest probability at its mean but with antipodal preferences the mean probability would be very low. 5 Model Comparison Comparing two statistical models is usually a task to be performed in the very initial stage of statistical analysis. The researcher aims to find the best 15
Table 4: Estimation results : LCM Variable Class 1 Class 2 Class 3-0.024 SCHED -0.075-0.016 (0.075) (0.004) (0.002) UNSCH REN PRIV COOP COST PROB -0.145-0.034-0.009 (0.068) (0.004) (0.002) 1.086 0.163 0.001 (0.322) (0.146) (0.006) -3.790-0.651 0.204 (1.540) (0.141) (0.048) -6.687-0.577 0.076 (2.217) (0.136) (0.043) -40.628-36.263-2.388 (12.654) (1.265) (0.202) 0.061 0.454 0.484 (0.026) (0.022) (0.021) N 6723 Log-likelihood -3433.515 Log-likelihood (NULL) -4660.028 Pseudo R 2 26.32% Significance levels : : 10% : 5% : 1% Standard errors in parentheses 16
statistical fit for his data, hence tries out different models and specifications, chooses one based on certain criteria and only then begins interpreting the estimation results. With discrete choice analysis, several models have been developed which each compete with each other. The more sophisticated models like the above described ones introduce behavioral assumptions that go beyond the distribution of the error term. These assumptions are meaningful to describe human behavior but mostly, as e.g. [Hensher and Greene, 2003] mention, there is no theoretical foundation for choosing any of the available distributions. As discussed in Subsection 4.1, there are some behavioral criteria to choose a certain distribution, i.e. a log-normal instead of a normal at a monetary attribute implies that subjects are rather willing to buy the same product at a lower price. However, why choosing a normal instead of any other similar distribution remains unclear but might have an impact on model performance. The same argument is true when comparing the LCM with the RPL. In the LCM we assume discrete distributions of the parameters, but do we have any reason based on economic theory for it? Basically no. Still there might be different reasons to choose a model with heterogeneity apart from statistical fit and basic assumptions from economic theory. Several authors addressed this concern before. Proposing different ways to compare the RPL with the LCM, results are ambiguous. [Greene and Hensher, 2003] analyzed WTP values and choice probabilities in detail and find small support for the LCM. The same conclusion is drawn by [Birol et al., 2006], who argue that apart from a better performance, the LCM is superior for welfare measures 17
and interpretation. [Colombo et al., 2009] contrasts three models, the RPL, LCM and the Covariance-Heterogeneity model. Their contribution focuses on the sources of heterogeneity. Relying on statistical tests and welfare analysis, they find a small dominance of the LCM. [Provencher and Bishop, 2004] find no model dominating but surprisingly high correct predictions of the CL. [Hynes et al., 2008] report similar results of the LCM and RPL in terms of welfare estimates but finally promote the LCM model as the more informative one. [Torres et al., 2011] compare the models using Monte Carlo simulations with incorporated heterogeneity. They simulate preference heterogeneity based on a RPL and apply the data to a LCM and vice versa. Their findings imply that in case the RPL is the true model, the errors by using a LCM are rather small. In the opposite case the errors are becoming larger. Overall, they find the performance of the RPL best. Summarized, most studies find a small dominance of the LCM, but no author argues strongly in favor of the LCM compared the the RPL. This chapter will compare the models from a statistical perspective. First, measures of fit of both models will be contrasted. Then graphical comparisons of individual WTP values and choice probabilities are presented. In the next chapter, based on the results of the statistical analysis, I will discuss the rationale to decide for a model beyond the scope of statistical analysis. 18
5.1 Measures of Fit Table 5 contrasts measures of fit of the three models. Likelihood ratio tests show that the CL is outperformed by the other two models. Testing the LCM against the RPL requires a test for non-nested models. I used a test proposed by [Ben-Akiva and Swait, 1986]. The result suggests that the LCM is significantly better than the RPL in terms of the log likelihood given a p- value that the likelihoods of both models are the same of 0.000. The AIC, BIC, and R 2 values indicate a small dominance of the LCM. All values are slightly better in the LCM. Also the correct predictions are highest in the LCM. Table 5: Measures of Fit Measure CL RPL LCM Log Likelihood -3855.245-3473.296-3433.515 McFadden R 2 0.165 0.254 0.263 Adj. McFadden R 2 0.162 0.252 0.261 AIC 1.149 1.041 1.027 BIC 1.155 1.069 1.048 Chi Squared 1527.76 2373.464 2453.027 Correct Predictions 0.612 0.675 0.785 Parameters 6 27 20 5.2 Conditional WTP Values In this section I will compare the conditional WTP values of both models. These are calculated as described in [Greene, 2007] (p.n17-36). The conditional mean incorporates all information of one individual gathered in the 19
CE including his choices. Still the inclusion of case-specific variables are neglected for simplicity. To get an overview of the results table 6 gives the mean and the standard deviations of the conditional WTP values in percent for the LCM and RPL and the unconditional WTP values of the CL. The latter ones are simply calculated as the ratio of the attribute at stake and the cost attribute β i β cost. The WTP is the marginal rate of substitution between an attribute and the cost attribute i.e. the WTP value gives necessary compensation in monetary terms for a one unit deterioration of an attribute to remain the same level of utility. In this case the WTP is in percent additional to the electricity bill. For example, an increase in scheduled power cuts (SCHED) by one minute has be be compensated by a decrease of the monthly electricity bill by 0.324 percent according to the CL and 0.418 according the the LCM. The conditional WTP values are calculated in the same manner but based on the individual parameter estimates. In the RPL, these estimates can lead to huge WTP values in case the individual cost parameter is small, while the attribute parameter is large. This is the reason why the standard deviations of the RPL WTP values is significantly higher than in the LCM. Comparing the values of the CL with the RPL or LCM is not meaningful as the values given for the RPL and LCM are conditional values based on individual estimates. These are not necessarily an unbiased estimator of the WTP as explained by [Greene, 2007]. Still it is useful to compare the values of the LCM and the RPL. 20
Table 6: Conditional WTP Values Variable CL RPL LCM Mean SD Mean SD SCHED -0.324-0.185 0.760-0.418 0.211 UNSCH -0.120-0.071 0.426-0.174 0.194 REN 0.216 0.384 1.031 0.446 0.623 PRIV 0.060 7.774 46.435 1.700 5.509 COOP -1.922 3.294 30.132-0.827 4.690 As expected the RPL shows a higher standard deviation than the LCM. Further, The mean values of the RPL are smaller than in the LCM except for PRIV and COOP. The latter ones are surprisingly high, which can be due to the antipodal preferences as explained in subsection 4.2. To get more insights on the distributions of the WTP values, kernel density estimates of the individual WTP values of the RPL and LCM are plotted in figure 1. It seems that the distributions peak quite similar, but the RPL values are more spread out. This reflects the high standard deviation. A closer look at the LCM distributions reveals different peaks. The RPL values however follow a rather normally shaped distribution. Both these shapes are implied by the model and highlight the differences between them. 5.3 Choice Probabilities Another way of comparison is contrasting the choice probabilities of the two models, i.e. the estimates on y for each individual based on the utility parameters of each model. As we have generic alternatives, choice probabilities 21
Density 0 100 200 300 Scheduled Power Cuts Density 0 100 200 300 400 Unscheduled Power Cuts.08.06.04.02 0.02 WTPSCHEDRPL kernel = epanechnikov, bandwidth = 0.0003 WTPSCHEDRPL WTPSCHEDLCM (a) Scheduled Power Cuts.06.04.02 0.02 WTP kernel = epanechnikov, bandwidth = 0.0002 WTPUNSCHRPL WTPUNSCHLCM (b) Unscheduled Power Cuts Density 0 50 100 150 200 Renewable Energy Density 0 5 10 15 Private Distributor.1.05 0.05.1 WTP kernel = epanechnikov, bandwidth = 0.0006 WTPRENRPL WTPRENLCM (c) Renewable Energy 8 6 4 2 0 WTP kernel = epanechnikov, bandwidth = 0.0136 WTPPRIVRPL WTPPRIVLCM (d) Private Distributor Cooperative Distributor Density 0 5 10 15 20 6 4 2 0 WTP kernel = epanechnikov, bandwidth = 0.0070 WTPCOOPRPL WTPCOOPLCM (e) Cooperative Distributor Figure 1: Kernel Density Estimates for WTP 22
do not have much meaning per se. 5 Table 7 gives the results of a linear regression of the choice probabilities of the RPL on the LCM and figure 2 and plots the choice probabilities of the two models. Table 7: OLS Regression Choice Probabilities Variable Coefficient (Std. Err.) LCMCP 0.597 (0.004) Intercept 0.214 (0.002) N 6723 R 2 0.802 F (1,6721) 27164.794 Significance levels : : 10% : 5% : 1% The regression suggests that the choice probabilities of the LCM and of the RPL are highly correlated. In fact, the variation of the choice probabilities of the LCM explain the variation of choice probabilities of the PRL by 80.2 per cent. [Greene and Hensher, 2003] perform the same analysis and found lower R 2 values for their data. While they conclude that each model is representing the choice responses quite differently for the majority of the sample (p.695), the results presented here indicate rather similar choice probabilities. This is a sign for the similarity of the models. This observation contrasts to our previous analysis. To analyzing the choice probabilities a bit further, figure 3 displays kernel density functions of the choice probabilities. For each choice probability, the graph shows the corresponding density. For example, in the RPL most 5 Only the choice probabilities for alternative 2 are presented. The choice probabilities of alternative 1 are simply P r 1nt = 1 P r 2nt. 23
Comparison of Choice Probabilities RPLCP.2.4.6.8 1 0.2.4.6.8 1 LCMCP Figure 2: Comparison of Choice Probabilities choice probabilities are around 0.8 while in the LCM, it is at about 0.95. The same pattern is true for the choice probabilities below 0.5. This observation indicates that choice probabilities of the LCM are more at the extremes, i.e. closer to zero or closer to one than in the RPL. This result is consistent with the findings from [Greene and Hensher, 2003]. 6 Discussion With the here applied data the LCM seems to have slight statistical advantages over the RPL, however there might be other reasons for choosing the LCM. In a way the choice of the model should depend on the purpose of research and behavioral assumptions. If, for example, it is assumed that each 24
Kernel Density Choice Probability Density 0 1 2 3 4 0.2.4.6.8 1 kernel = epanechnikov, bandwidth = 0.0545 LCMCP RPLCP Figure 3: Kernel Choice Probabilities individuals have each different preferences, which, as an example, could be the case with cultural goods, a RPL might be the appropriate model. In case the researcher assumes antipodal preferences of different groups that are homogeneous within, the structure of the LCM satisfies the context. For instance, the purpose of a study could be eliciting preferences for a policy scheme for wage rates. One could expect that members of a labor union will have rather similar preferences as a group and employers might have opposite preferences which are also homogeneous within the group. However, in many cases, these assumptions are not very robust and the researcher cannot draw any observations on heterogeneity a priori. This survey exemplifies the point. The study was performed in a developing country and lower class people show very similar living standards, habits, behavior, and struggles. 25
Though maybe differently affected by power cuts, the purpose of electricity is mainly similar - for cooling and lighting. Thus, it makes sense to find general conclusions for this group. This observation, I call within-class homogeneity. In the same country, middle class and high class people are expected to behave very differently. Electricity is used for a large number of different appliances. It is used to watch TV, listen to music, cleaning, and a variety of hobbies. Hence, these people are expected to show different preferences for better quality within their class. I call this preference distribution within-class heterogeneity. In case the researcher expects homogeneity over the whole sample (e.g. he samples only slum inhabitants), he can stick with the CL. When different groups within the sample are expected, which show within class homogeneity, the LCM should be the model of choice. Expecting overall heterogeneity leads to the RPL and and if we have within-class heterogeneity the decision is not clear. Here, the striking point is the assumption whether the differences in preferences are antipodal between the classes. If no, one can run the RPL but if yes, something like a RPL-LCM combination is needed. The latter one means that one has to account for the fact that some random parameter values are not present, at least there is no overall continuity. For example, the LCM shows bipolar preferences for the attribute representing the distribution company. While there are certain types of people preferring privatization, others are strongly against it. Only a few people are indifferent. These gaps in the distribution are well represented in figure 1 d and e. 26
Apart from the assumptions on heterogeneity, there are other advantages and disadvantages of both models. The RPL is more complex and the estimation process demands computational time and deep model understanding. Further, the flexibility of the RPL is not only a virtue but can also turn out to be a struggle for the researcher. Deciding on which parameters to be random and which distribution might be appropriate is a demanding task and hardly any model specification will show a clear dominance. Choosing the number of classes in the LCM is also challenging in a way. Mostly, the researcher has no prior information on the appropriate number of classes and relies on measures of fit in the selection. Still, the effort to identify the optimal model specification is by far less than in the RPL. Model interpretation might be easier with the LCM. [Hensher and Greene, 2003] warn that in the RPL, mean parameters are not to be interpreted as in the CL especially in more confounded specifications with correlation among attributes and alternatives. Calculating marginal effects and WTP values often requires thorough investigation of all correlations and a laborious dismantling of the Cholesky matrix. Making the RPL more operational, researchers often base their behavioral assumptions on technical feasibility. A frequently observed example is a constant cost parameter. [Revelt and Train, 1998] (p.650) provide technical reasons: We specify the price coefficient to be fixed while allowing the other coefficients to vary. The willingness to pay for each attribute is thereby distributed in the same way as the attribute s coefficient, 27
which is convenient for interpretation of the model. Apparently, several authors follow their recommendation by citing their paper or not even giving any reason. Yet the question remains on what behavioral or theoretical basis this assumptions is formulated. [Meijer and Rouwendal, 2000] (p.12) put is as: [Keeping the coefficient of the monetary variable nonrandom] is, however, not very satisfactory. A priori it appears at least as likely that the coefficients in the utility function that refer to the monetary variable are random variables, as that those referring to any other variable are. All these considerations are not necessary with the LCM, which is clearly an advantage. Apart from these technical challenges, the LCM is more straightforward in interpretation. Arguing with classes instead of distributions of the population gives more scope for policy recommendations. Often CEs are conducted to inform policy makers. A clustering approach as in the LCM is easier to understand and leads to more straight forward results. It also allows the researcher to give names to the classes and segment the population into multidimensional interest groups (see e.g. [Meyerhoff et al., 2010]). Conclusively, I suggest - neglecting any assumptions on heterogeneity - that the LCM is a model that should be used for demonstrative purposes. In case a broad audience with limited statistical and economic background or policy 28
makers are the target group, the LCM is clearly more accessible and coherent. If a deeper analysis is required and the research purpose aims also towards methodological issues, the RPL is a fruitful challenge. Statistic lovers have more leeway and are free to experiment with new specifications. In fact, the RPL s flexibility is astonishing and further research necessary. The most recent advances tend to semi parametric distributions of the random parameters. [Fosgerau and Hess, 2008] and [Rouwendal et al., 2010] propose methods where no a priori assumptions on the distribution of random parameters is necessary. 7 Conclusion In this paper, I aimed to compare two models for discrete choice that incorporate heterogeneity in preferences, the RPL and the LCM. I discussed both models and pointed out their characteristics. The data used for the analysis came from a CE survey in Hyderabad, India and investigated consumer preferences for better electricity quality. The data is appropriate as it covers highly heterogeneous respondents ranging from very poor and traditional to rich and modern. A statistical comparison of the models was carried out using measures of fit, a test for non-linear and non-nested models, conditional WTP values and choice probabilities. The LCM shows some advantages in terms of statistical fitness and a smaller variance in conditional WTP values, yet it is not possible to derive a clear statement on which model performs 29
better. I finally discussed the two models taking into account further criteria than statistical success and provided some suggestions on when to use which model. The choice of the model depends, first, on the a priori assumptions on the kind of heterogeneity and, second, on the purpose of the study. However, it remains to the finger tip feel of the researcher to decide on a model. 30
References [Ben-Akiva and Swait, 1986] Ben-Akiva, M. and Swait, J. (1986). The akaike likelihood ratio index. Transportation Science, 20(2):133 136. [Birol et al., 2006] Birol, E., Karousakis, K., and Koundouri, P. (2006). Using a choice experiment to account for preference heterogeneity in wetland attributes: The case of cheimaditida wetland in greece. Ecological Economics, 60(1):145 156. [Carlsson and Martinsson, 2008] Carlsson, F. and Martinsson, P. (2008). Does it matter when a power outage occurs? a choice experiment study on the willingness to pay to avoid power outages. Energy Economics, 30(3):1232 1245. [Colombo et al., 2009] Colombo, S., Hanley, N., and Louviere, J. J. (2009). Modeling preference heterogeneity in stated choice data: an analysis for public goods generated by agriculture. Agricultural Economics, 40(3):307 322. [Fosgerau and Hess, 2008] Fosgerau, M. and Hess, S. (2008). Competing methods for representing random taste heterogeneity in discrete choice models. Mpra paper, University Library of Munich, Germany. [Greene, 2007] Greene, W. H. (2007). NLogit Version 4.0: Reference Guide. Econometric Software INC., Plainview, 1 edition. 31
[Greene and Hensher, 2003] Greene, W. H. and Hensher, D. A. (2003). A latent class model for discrete choice analysis: contrasts with mixed logit. Transportation Research Part B: Methodological, 37(8):681 698. [Hanisch et al., 2010] Hanisch, M., Kimmich, C., Rommel, J., and Sagebiel, J. (2010). Coping with power scarcity in an emerging megacity: A consumers perspective from hyderabad. International Journal of Global Energy Issues, 33(3&4):189 204. [Hensher and Greene, 2003] Hensher, D. and Greene, W. (2003). The mixed logit model: The state of practice. Transportation, 30(2):133 176. [Hynes et al., 2008] Hynes, S., Hanley, N., and Scarpa, R. (2008). Effects on welfare measures of alternative means of accounting for preference heterogeneity in recreational demand models. American Journal of Agricultural Economics, 90(4):1011 1027. [McFadden, 1974] McFadden, D. (1974). Conditional logit analysis of qualitative choice behavior. In Zarembka, P., editor, Frontiers in econometrics, pages 105 142. Academic Press. [Meijer and Rouwendal, 2000] Meijer, E. and Rouwendal, J. (2000). Measuring welfare effects in models with random coefficients. [Meijer and Rouwendal, 2006] Meijer, E. and Rouwendal, J. (2006). Measuring welfare effects in models with random coefficients. Journal of Applied Economics, 21(2):227 244. 32
[Meyerhoff et al., 2010] Meyerhoff, J., Ohl, C., and Hartje, V. (2010). Landscape externalities from onshore wind power. Energy Policy, 38(1):82 92. [Morrison and Nalder, 2009] Morrison, M. and Nalder, C. (2009). Willingness to pay for improved quality of electricity supply across business type and location. The Energy Journal, 30(2):117 133. [Provencher and Bishop, 2004] Provencher, B. and Bishop, R. C. (2004). Does accounting for preference heterogeneity improve the forecasting of a random utility model? a case study. Journal of Environmental Economics and Management, 48(1):793 810. [Revelt and Train, 1998] Revelt, D. and Train, K. (1998). Mixed logit with repeated choices: Households choices of appliance efficiency level. Review of Economics and Statistics, 80(4):647 657. [Rommel et al., 2010] Rommel, K., Hanisch, M., Deb, K., and Sagebiel, J. (2010). Consumer preferences for improvements of power supply quality: Results from a choice experiment in Hyderabad, India and implications for energy policy: Paper presented at the European conference of the IAEE, August 25-28, 2010 in Vilnius, Lithuania. [Rouwendal et al., 2010] Rouwendal, J., de Blaeij, A., Rietveld, P., and Verhoef, E. (2010). The information content of a stated choice experiment: A new method and its application to the value of a statistical life. Transportation Research Part B: Methodological, 44(1):136 151. 33
[Sillano and de Dios Ortand, 2005] Sillano, M. and de Dios Ortand, J. (2005). Willingness-to-pay estimation with mixed logit models: some new evidence. Environment and Planning A, 37(3):525 550. [Torres et al., 2011] Torres, C., Colombo, S., and Hanley, N. (2011). Incorrectly accounting for taste heterogeneity in choice experiments: Does it really matter for welfare measurement? Stirling Economics Discussion Papers 2011-02, University of Stirling, Division of Economics. 34