ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS
|
|
|
- Fay Dalton
- 10 years ago
- Views:
Transcription
1 ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS Jeffrey M. Wooldridge THE INSTITUTE FOR FISCAL STUDIES DEPARTMENT OF ECONOMICS, UCL cemmap working paper CWP04/04
2 ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI This version: June 2003 Abstract: I show that a class of fixed effects estimators is reasonably robust for estimating the population-averaged slope coefficients in panel data models with individual-specific slopes, where the slopes are allowed to be correlated with the covariates. In addition to including the usual fixed effects estimator, the results apply to estimators that eliminate individual-specific trends. Further, asymptotic variance matrices are straightforward to estimate. I apply the results, and propose alternative estimators, to estimation of average treatment in a general class of unobserved effects models. Keywords: Average Treatment Effect, Fixed Effects, Random Coefficient Thanks to Josh Angrist for reminding me about the example in Jin Hahn s JBES comment. 1
3 1. Introduction The standard fixed effects, or within, estimator is a workhorse in empirical studies that rely on linear panel data models. When the partial effects of interest are on time-varying covariates, fixed effects estimation is attractive because it allows for additive, unobserved heterogeneity that can be arbitrarily correlated with the time-varying covariates. (On the other hand, with random effects methods we assume that unobserved heterogeneity is uncorrelated with observed covariates.) Extensions of the standard linear model with an additive unobserved effect include the random trend model, where each cross-sectional unit is allowed to have its own linear trend (in addition to a separate level effect); a special case is the so-called random growth model, as in Heckman and Hotz (1989). Wooldridge (2002a, Section 11.2) provides an overview of these kinds of models. The properties of fixed effects estimators in general unobserved effects models have been derived assuming constant coefficients on the individual-specific, time-varying covariates. In Wooldridge (2003), I pointed out that the usual fixed effects estimator in the standard additive model is consistent in a model with individual specific slopes whenever the slopes are conditionally mean independent of the time-demeaned covariates. Importantly, this finding implies that the individual-specific slopes can be correlated with the time averages of the covariates, which we tend to think of as the major source of endogeneity in random coefficient panel data models. [With a small number of time periods, much more has been written about random coefficient models when the coefficients are assumed to be independent of the 2
4 covariates. See, for example, Hsiao (1986, Chapter 6). For most economic applications, the independence assumption is unrealistic.] In this paper, I extend the framework of Wooldridge (2003) to allow for general aggregate time effects. I show that the fixed effects estimator that sweeps away the individual-specific intercept and slopes on the aggregate variables is satisfyingly robust to the presence of individual-specific slopes on the individual-specific covariates. Section 2 contains the main result. In Section 3, I briefly consider estimation strategies based on first differencing. As a special case in Section 4, I treat the so-called random trend model, which has become popular in empirical studies [for example, Papke (1994), Hoxby (1996), and Friedberg (1998)]. In addition, I cover average treatment effect estimation in a general unobserved effects model, generalizing a simple example due to Hahn (2001). Because consistency of the fixed effects estimator generally rules out aggregate time effects in index models, I also propose modified estimators that can consistently estimate the average treatment effects when fixed effects does not. In Section 5, I show how the result extends to models where time-constant observable covariates are available and correlated with unobserved heterogeneity. 2. Linear Models and a Result for Fixed Effects For a random draw i from the population, the model is y it w t a i x it b i u it,t 1,...,T (2.1) where w t is a 1 J vector of aggregate time variables which we treat as nonrandom (without 3
5 consequence, since they are usually just time trends) a i is a J 1 vector of individual-specific slopes on the aggregate variables, x it is a 1 K vector of covariates that change across time (possibly including year dummies), b i is a K 1 vector of individual-specific slopes, and u it is an idiosyncratic error. In what follows, we view T as being relatively small, and so we keep it as fixed in the asymptotic analysis. We assume we have a sample of size N randomly drawn from the population. For simplicity, we assume a balanced panel. The object of interest is E b i,thek 1 vector of population-averaged partial effects. With small T, it is not possible to get precise estimates of each b i (when we treat them as parameters to estimate). Instead, we hope to estimate the average effects using standard estimators. Throughout we maintain the assumption E u it x i1,...,x it,a i,b i 0,t 1,...,T, (2.2) which follows under the conditional mean assumption E y it x i1,...,x it,a i,b i E y it x it,a i,b i w t a i x it b i,t 1,...,T. (2.3) Assumption (2.3) is a standard strict exogeneity condition in unobserved effects models: conditional on x it,a i,b i, the covariates from the other time periods do not affect the expected value of y it. While this rules out the possibility of lagged dependent variables, it does not restrict the correlation between a i,b i and x i1,...,x it. The possibility of correlation between b i and the x it makes (2.1) a correlated random coefficient model, to borrow a phrase from Heckman and Vytlacil (1998) for the cross-sectional case. The basic unobserved effects model is obtained with w t 1andb i. The random linear trend model also has b i but w t 1,t, sothata i a i1,a i2,wherea i2 is the 4
6 random trend for unit i. More flexible trends can be allowed with a sufficient number of time periods; for example, we can take w t 1,t,t 2. However, we cannot allow a full set of year dummies to interact with separate unobserved heterogeneity terms, since we then lose identification of. Generally, we must have J T; see, for example, Wooldridge (2002a, Section 11.2). One possibility for analyzing equation (2.1) is to treat the a i and b i as parameters to estimate for for i. Under (2.2) and an appropriate rank condition, we can obtain unbiased estimators of a i and b i,sayâ i and b i, by using ordinary least squares (OLS) on the time series for each i. Unfortunately, when T is small, the scope of such a strategy is limited. For one, we would need J K T to even implement the procedure. Unless T is fairly large, precise estimation of b i is not possible. Nevertheless, the average of the b i is generally consistent for (for fixed T as N and N -asymptotically normal. [See Wooldridge (2002a, Section 11.2) for verification in a closely related context.] Since this strategy is not available for large K, and since the covariance matrix of the resulting estimator is not easy to estimate, alternative methods for estimating the average effect are desirable. In this paper, we study estimators of that are motivated by the assumption that the slopes b i are constant, but we study the properties of these estimators in the context of model (2.1). Write b i d i, where E d i 0 by definition. Simple substitution into (2.1) gives y it w t a i x it x it d i u it w t a i x it v it, (2.4) (2.5) where v it x it d i u it. Whether any or all of the elements of a i are constant, we estimate in (2.1) allowing the entire vector a i to vary by i, and to be arbitrarily correlated with the x it.for the linear, additive effects model, this leads to the usual fixed effects estimator. More 5
7 generally, define y i to be the T 1 vector of y it,letw be the T J matrix with t th row w t,let X i be the T K matrix with t th row x it, and let v i be the vector of v it. Then we can write y i Wa i X i v i Wa i X i X i d i u i. (2.6) To eliminate a i,definethet T matrix M I T W W W 1 W, and premultipy (2.6) by M: My i MX i Mv i MX i MX i d i Mu i. We can write the equation in terms of residuals from individual-specific regressions as ÿ i X i v i ÿ i X i X i d i ü i (2.7) or ÿ it x it v it,t 1,...,T, (2.8) where, for instance, x it is the 1 K vector of residuals from the regression x it on w t,t 1,...,T. The fixed effects (FE) estimator of interpreted in the general sense of eliminating a i from (2.1) is just the pooled OLS estimator from (2.8). Rather than just restricting attention to time-demeaning, as in the usual fixed effects analysis, we allow for very general kinds of individual-specific detrending. Since the FE estimator,, is just a pooled OLS estimator, sufficient conditions for consistency are simple to obtain. In addition to the rank condition rank E X i X i K, (2.9) a sufficient condition is E X i v i E X i X i d i E X i ü i E X i X i d i E X i u i 0. Now, by (2.2), E u i X i 0, and so we must only worry about E X i X i d i. If E X i X i d i 0 (2.10) 6
8 then the FE estimator will be consistent. Since X i X i T t 1 x it x it,a sufficient condition is E x it x it d i 0,t 1,...,T. (2.11) Conditions (2.10) and (2.11) are a bit difficult to interpret. A simpler condition that is sufficient for (2.11) is E b i x it E b i,t 1,...,T, (2.12) which says that b i is mean independent of all of the detrended x it. [If we slightly strengthen (2.12) to E b i x i1,...x it E b i, then the fixed effects estimator can be shown to be unbiased, provided the expectation exists.] Condition (2.12) is notably weaker than the standard assumption assumed in a random effects environment, that b i is mean independent of each x it. Intuitively, condition (2.12) allows b i to be correlated with systematic components of x it. WegivesomespecificexamplesinSection3. Generally, (2.12) is more likely to hold the richer is w t. So, even of we do not think the term w t a i is necessary in (2.1), acting as if (2.1) contains individual-specific trends affords more robustness for estimating because more individual specific features are swept out of x it. Of course, the more that is included in w t, the less variation there is in x it : t 1,...,T,and so efficiency of can be adversely affected. In the limiting case J T, x it 0,t 1,...,T, and the fixed effects procedure cannot be carried out. Estimating the asymptotic variance of is straightforward with large N and small T. The usual, fully robust estimator for example, Wooldridge [2002a, equation (10.59)] is consistent: Avar N X i X i i 1 1 N X i û i û i X i i 1 N X i X i i 1 1, (2.13) 7
9 where û i ÿ i X i are the T 1 vectors of fixed effects residuals. Even if we assume homoskedasticity and serial independence of u it : t 1,...,T [conditional on X i,a i,b i ], a fully robust variance matrix is needed if b i : the presence of X i d i in the error terms induces both conditional heteroskedasticity and serial dependence. Fortunately, (2.13) is computed routinely by many regression packages, sometimes under the description of a cluster-robust variance matrix estimator. 3. Methods Based on First Differencing Often in empirical work, first differencing is used in place of the within transformation in order to eliminate an additive, unobserved effect. It is easy to see that the first difference (FD) estimator has robustness properties similar to the FE estimator. In the model with a single additive, unobserved effect in a i, first differencing gives y it x it x it d i u it,t 2,...,T. (3.1) Using an argument similar to the fixed effects case, under a standard rank condition and (2.2), a sufficient condition for consistency of the FD estimator is E b i x it E b i,t 2,...,T, (3.2) which explicitly allows b i to be correlated with the first-period covariates, x i1. When T 2 and, (2.12) and (3.2) are the same condition when x it are the time-demeaned covariates. When T 2, (3.2) differs from (2.12), but they are similar in flavor. A fully robust asymptotic variance matrix estimator for the FD estimator can be routinely computed after pooled OLS 8
10 estimation in first differences. See Wooldridge (2002a, Section ). For more complicated models, first differencing can be followed by a fixed effects type analysis to eliminate additional unobserved heterogeneity, in which case the model in first differences can be analyzed as in Section 2. We explicitly cover the random trend model in the next section. 4. Some Examples 4.1. The Basic Additive Model As mentioned earlier, a special case of the setup in Section 2 is the usual unobserved T effects model estimated by fixed effects. Then, x it x it x i,wherex i r 1 x ir. Condition (2.12) means that b i can be correlated with x i provided that b i is conditionally mean independent of the deviations from the means, x it. For example, if x it f i r it,t 1,...,T, then (2.12) allows for arbitrary correlation between f i and b i, provided E b i r i1,...,r it E b i (4.1) Similarly, the first differencing estimator is also consistent under (4.1); it, too, allows arbitrary correlation between b i and f i Random Trend Models 9
11 If we specify (2.1) as a random trend model, there are two popular approaches to estimation. ThepurefixedeffectsapproachistofollowtheprocedurefromSection2 sothat the x it are the detrended values from the regression x it on 1,t,t 1,...,T, for each i. Then, we can allow even more dependence between b i and time-constant features of x it. For example, suppose we can write x it f i g i t r it,t 1,...,T, (4.2) so that each element of x it is allowed to have an individual-specific trend. Then, for each i, x it depends only on r i1,...,r it, and so (4.1) is again sufficient. In applications of (2.1), we are usually worried that b i is correlated with time-constant components of x it f i and g i in the case of (4.2) in which case (4.1) seems reasonable. The process in (4.2) includes the case where x it is an integrated of order one process with individual-specific drift, as in x it g i x i,t 1 q it,t 1,...,T, (4.3) where q it : t 1,...,T can have arbitrary serial correlation. Repeated substitution shows t that (4.2) holds with f i x i0 and r it s 1 q is.since r it : t 1,...,T is a function of q it : t 1,...,T, (4.1) holds if E b i q i1,...,q it, which seems reasonable since we can allow b i to be arbitrarily correlated with the vector of initial conditions, x i0,aswellasthe vector of drifts, g i. An alternative estimation approach is to first difference to eliminate the additive effect, and then to use the within transformation to account for the random trend. First differencing is more attractive than the pure fixed effects approach from Section 2 when u it : t 1,...,T contains substantial positive serial correlation. Since we are applying the within 10
12 transformation to the first differenced equation, we see that a sufficient condition for consistency is E b i x it E b i,t 2,...,T, (4.4) where x it denotes the time-demeaned first differences. If x it : t 1,...,T follows (4.2), then first differencing x it eliminates f i while the within transformation applied to the first differences eliminates g i. In other words, (4.1) is still sufficient for consistency. Similar conclusions hold for both FE and strategies based on differencing if we take w t 1,t,t 2 (provided T 4). Then, x it can have an individual-specific quadratic trend, provided b i is mean independent of the idiosyncratic part of x it. And so on Estimating Average Treatment Effects with Unobserved Heterogeneity The results in Sections 2 and 3 have interesting implications for estimating average treatment effects (ATEs) in a class of nonlinear unobserved effects panel data models. For motivation, consider an example due to Hahn (2001), who was commenting on Angrist (2001). Hahn (2001) considered an unobserved effects probit model with two periods of panel data, and a single binary treatment indicator, x it : P y it 1 x i1,x i2,c i c i x it,t 1,2, (4.5) where is the standard normal cumulative distribution function. Hahn also assumed that y i1 and y i2 are independent conditional on x i1,x i2,c i, and that no units are treated in the first time period while all are treated in the second: x i1,x i2 0,1. The last assumption implies 11
13 that x i1,x i2 is independent of c i, which would seem to be ideal for estimating the only parameter in the model,. Hahn points out that, even with all of the assumptions he imposes, is not known to be identified. On the other hand, the average treatment effect, E c i c i, is identified, and a simple, consistent estimator is N 1 N i 1 y i2 y i1. It is easy to see that is the usual fixed effects estimator in the simple linear model y it a i x it u it,t 1,2. (Recall that FE is identical to FD when T 2, and the FD estimator is easily seen to be because x i2 x i1 1 for all i.). Hahn (2001) uses this example to show that ATEs can be identified even when underlying parameters are probably not. But he also uses the special structure of x it : t 1,2 to argue that the success of Angrist s (2001) strategy of eschewing nonlinear models in favor of linear methods even when y it is a limited dependent variable hinges on the structure of treatment assignment. Here, I use the results from Section 2 to determine assumptions under which simple panel data strategies do recover average treatment effects. We can identify average treatment effects in a very general class of unobserved effects models, provided we make assumptions of the kind in Section 2, and assume no time heterogeneity. Consider E y it x i1,...,x it,c i h x it,c i,t 1,...,T, (4.6) where h, is an unknown function, c i is a vector of unobserved heterogeneity, and x it is a 1 K vector of mutually exclusive binary treatment indicators. This structure for x it is very common in the treatment effect literature, where the base group (in time period t) is characterized by x it x it1,x it2,...,x itk 0. Other units in the population are subjected to one, and only one, of K treatments. For example, in the population of people with at least a 12
14 high school education, the base group could be people with no additional schooling. The treatment indicators can denote different amounts of college. Or, perhaps people participate in a job training program at different levels, with x it 0 indicating no job training. The leading case is K 1, where x it is a binary treatment indicator. There are only two assumptions in (4.6). The first is strict exogeneity of the treatment indicators, x it, conditional on c i. We have maintained strict exogeneity throughout, and it is very difficult to relax in general unobserved effects models. Second, (4.6) implies that the treatment effects are constant across time. For cross sectional unit i, the treatment effect of treatment level j (relative to no treatment) is b ij h e j,c i h 0,c i, (4.7) where e j is the vector with one in its j th entry and zeros elsewhere. Therefore, the ATEs are j E h e j,c i h 0,c i E b ij,j 1,...,K. (4.8) The goal is to determine when the usual fixed effects estimator, applied to a linear model, consistently estimates the ATEs. This is simple in the pure treatment effects setup because we can write E y it X i,c i a i x it b i,t 1,...,T (4.9) where a i h 0,c i and b i is the K 1 vector of individual-specific treatment effects, b ij. Equation (4.9) holds because each cross-sectional unit falls into one, and only one, treatment class at time t. Given (4.9), we can apply the results for the fixed effects estimator from Section 2. If c i is independent of the time-demeaned covariates, x it : t 1,...,T, thensois b i, and condition (2.12) holds. It follows that, regardless of the nature of y it,foranypatternof serial dependence, and for general treatment patterns over time even some that induce 13
15 correlation between x it and c i the FE estimator consistently estimates the average treatment effects. Similar comments hold for the first differencing estimator. Unfortunately, model (4.6) is not as general as we would like. For one, it does not allow other individual-specific covariates to affect y it. Perhaps most importantly, (4.6) excludes aggregate time effects, which generally allow ATEs to vary with time, and can be important in policy analysis with panel data. It turns out that we can identify, and easily estimate, time-varying ATEs in a general model, provided we change the assumption about the relationship between the unobserved heterogeneity and x it : t 1,...,T. For simplicity, let x it be a binary treatment indicator, and replace (4.6) with E y it x i1,...,x it,c i h t x it,c i,t 1,...,T, (4.10) so that h t, is allowed to vary with time. The average treatment effect now depends on t: t E h t 1,c i h t 0,c i,t 1,...,T, (4.11) Now, rather than assuming that c i is independent of x it : t 1,...,T, we assume independence conditional on x i: D c i x i1,...,x it D c i x i or D c i x i,x i1,...,x it D c i x i. (4.12) Assumption (4.12) is a nonparametric version of Mundlak s (1978) conditional mean assumption in the linear case; see also Chamberlain (1984) and Wooldridge (2002a). It states that the distribution of the unobserved effect, given the observed history of treatments, depends only on the fraction of periods treated. Condition (4.12) is similar in spirit to (2.12), but it is not the same, even if (4.12) could be stated in terms of conditional expectations. For example, if x it f i r it,t 1,...,T and c i b i, (4.1) is sufficient for (2.12), but (4.1) does not imply E c i x i1,...,x it E c i x i. 14
16 Under (4.10) and (4.12) we have E y it X i h t x it,c dg c X i h t x it,c dg c x i m t x it,x i (4.13) E y it x it,x i,t 1,...,T. The key is that E y it X i does not depend on x i1,...,x it in an unrestricted fashion. If x it were continuous, or took on numerous values, we could use nonparametric methods to estimate m t,. In the treatment effect case, estimation is very simple because x it,x i can take on only 2 T 1 different values (since x it takes on only two values and x i.takes on the values 0,1/T,..., T 1 /T,1 ). Let s i1 1 x i 1/T, s i2 1 x i 2/T,..., and s it 1 x i 1. Then we can write E y it X i t t x it s i t x it s i s t,t 1,...,T (4.14) where s i is the 1 T vector of s it and s E s i. The coefficient on x it is the average treatment effect. [Generally, iterated expectations implies that t E E h t 1,c i h t 0,c i x i E m t 1,x i m t 0,x i ; see Wooldridge (2002b, Lemma 2.2) for a general treatment.] Subtracting s from s i before forming the interactions ensures t is the treatment effect. In practice, s would be replaced with s N 1 N i 1 s i. In other words, for each period t, weruntheregression y it on 1,x it,s i1,...,s it,x it s i1 s 1,...,x it s it s T,i 1,...,N, (4.15) where the coefficient t on x it is the estimated ATE for period t. If we made the random effects assumption D c i X i D c i then, of course, the simple regression of y it on 1,x it,i 1,...N would consistently estimate t. If we pool across t (as well as i) and run the regression y it on 1,d2 t,...,dt t,x it,x i,t 1,...,T;i 1,...,N, wheredr t is a period r dummy variable, then the common coefficient on x it, which is identical to the 15
17 fixed effects estimate, would be the estimate of the ATE (assumed constant across t). The regression in (4.15) is more flexible because it allows ATEs to change over time while allowing E y it x it,x i to depend on x it,x i in a completely general way. Provided x it : t 1,...,T has some time variation, x it and x i will have independent variation for any t, which is all we need to identify t under (4.12). Condition (4.12) is hardly general, but it can be relaxed with T 2. For example, if x i T 1 1 T t 2 x it is the average change in treatment over the T periods, we might replace (4.12) with D c i X i D c i x i, x i or D c i x i1,...,x it D c i x i,x it x i1, (4.16) where equivalence follows because x i x it x i1 / T 1. [This assumption is in the spirit of assuming b i, the vector of slopes in a linear model, is independent of r i1,...,r it in (4.2); but it is not the same condition.] Then, E y it X i E y it x it,x i, x i m t x it,x i, x i. (4.17) Except for special treatment patterns, the ATE for each time period is identified from the population regression of y it on x it,x i, x i provided T 3. Generally, the regressors in each time period can take on 2 T T 1 different values because x i takes on values in 1/ T 1,0,1/ T 1. We can estimate a saturated regression model by defining two dummy variables, say, w i0,w i1,for x i taking on the values 0 and 1/ T 1, respectively. For each time period t, the regression would contain an overall intercept, x it,s i,w i, interactions s ij w ik, and interactions x it s ij s j,x it w ik w k, andx it s ij w ik s j w k for all j and k. Demeaning all of the indicators, including s ij w ik, before forming the interactions with x it, ensures that the coefficient on x it is the average treatment effect. As in many cases, it makes 16
18 sense to obtain heteroskedasticity-robust standard errors for the ATEs. The procedure described in the previous paragraph is costly in terms of degrees of freedom. For one, T different cross-sectional regressions are used; there is not pooling across t. So, for estimating t, one has N 6 T 1 degrees of freedom. The panel structure of the data is used only in obtaining the time-constant controls, s ij and w ik. One could use as regressors 1,x it,x i,x it x i1,x it x i x,x it x i x,where x is the cross-sectional average of x i and x is the average of x i. Still, there is something to say for the general procedure, as it may properly reflect the uncertainty in estimating ATEs under nonparametric assumptions. While further embellishments are possible with large T, identification of the ATEs in every time period hinges on the functions of x it : t 1,...,T assumedtoappearin D c i x i1,...,x it. We cannot allow D c i x i1,...,x it to be entirely unrestricted. How do the above procedures compare with more common approaches? A general comparison is not possible because (4.10) puts very little structure on E y it X i,c i [at the cost of (4.12) or (4.16)]. But suppose y it is a binary response: P y it 1 X i,c i F t x it c i,t 1,...,T, (4.18) where F is a cumulative distribution function. If we take F to be the logistic function, and the y it are conditionally independent across time, then the fixed effects logit estimator is consistent for (and the aggregate time effect coefficients). Unfortunately, ATEs are not identified since we make no distributional assumption for c i. Essentially by construction, methods that take no stand concerning the unconditional distribution of c i, or the conditional distribution D c i X i, have little hope of identifying ATES. If F is the standard normal cdf, Chamberlain s (1980) random effects probit model can be used, provided we assume c i X i Normal 0 1 x i1... T x it, 2. [In principle, F could be 17
19 the logit function, but then implementation of Chamberlain s method is much more difficult.] Chamberlain s approach identifies as well as the ATEs see Chamberlain (1984) or Wooldridge (2002a, Chapter 15) the latter of which vary over time because of the presence of t. Compared with the procedure discussed above, Chamberlain s method allows unrestricted weights on the x it in E c i X i, at the cost of homoskedasticity and normality. The regression procedure outlined above replaces Chamberlain s parametric assumptions with (4.12) or (4.16). The two approaches are complementary, since they work under different sets of assumptions, neither of which nests the other. All of the methods described above can be extended to the case of K 1 treatment levels, but degrees of freedom could be an issue. Then, each of the K elements in x i can take on T 1 different values, and so K T 1 dummy variables are needed to saturate the model, and these each need to be interacted with the elements of x it. A large cross-sectional sample would be needed to implement a fully nonparametric analysis under (4.12), and the extension in (4.16) would require even more data. 5. Other Extensions Sometimes, we want to allow b i to vary with observed, time-constant covariates, say z i,a 1 L vector: b i z i d i, (5.1) where is K 1and is K L. (One possibility is to include the time averages, x i,inz i.) Under (4.1), we can write 18
20 y it w t a i x it z i x it v it,t 1,...,T, (5.2) where vec and v it x it d i u it, as before. Equation (5.2) is just the formal way of writing that we add to the original model interactions between the elements of z i and x it.if E d i x it E d i 0, the fixed effects estimator applied to (5.2) would consistently estimate and, in which case the average effects E z i are consistently estimated by z, where z is the sample average across i. The methods from Section 4.3 can also be extended when time constant covariates, z i,are available. For example, in (4.10), we could replace (4.12) with D c i X i,z i D c i x i,z i, (5.3) in which case (4.13) becomes E y it X i,z i E y it x it,x i,z i,t 1,...,T. An estimate of the ATE at time t is obtained as N 1 N i 1 Ê y it 1,x i,z i Ê y it 0,x i,z i, for a suitable estimator of the conditional expectation. By including in z i observables such as family background, education, pre-training earnings, and so on, the assumption that heterogeneity depends only on the average treatment level may be more plausible. A thorough study that considers estimating E y it x it,x i,z i when the dimension of z i is large or contains continuous variables is left for future study. References Angrist, J.D. (2001), Estimation of Limited Dependent Variable Models With Dummy Endogenous Regressors: Simple Strategies for Empirical Practice, Journal of Business and 19
21 Economic Statistics 19, Chamberlain, G. (1980), Analysis of Covariance with Qualitative Data, Review of Economic Studies 47, Chamberlain, G. (1984), Panel Data, in Handbook of Econometrics, Volume 2, ed. Z. Griliches and M.D. Intriligator. Amsterdam: North Holland, Friedberg, L. (1998), Did Unilateral Divorce Raise Divorce Rates? American Economic Review 88, Hahn, J. (2001), Comment: Binary Regressors in Nonlinear Panel-Data Models with Fixed Effects, Journal of Business and Economic Statistics 19, Heckman, J.J. and V.J. Hotz (1989), Choosing among Alternative Nonexperimental Methods for Estimating the Impact of Social Programs: The Case of Manpower Training, Journal of the American Statistical Association 84, Heckman, J.J. and E. Vytlacil (1998), Instrumental Variables Methods for the Correlated Random Coefficient Model, Journal of Human Resources 33, Hoxby, C.M. (1996), How Teachers Unions Affect Education Production, Quarterly Journal of Economics 111, Hsiao, C. (1986), Analysis of Panel Data. Cambridge: Cambridge University Press. Mundlak, Y. (1978), On the Pooling of Time Series and Cross Section Data, Econometrica 46, Papke, L.E. (1994), Tax Policy and Urban Development: Evidence from the Indiana Enterprise Zone Program, Journal of Public Economics 54, Wooldridge, J.M. (2002a), Econometric Analysis of Cross Section and Panel Data. Cambridge, MA: MIT Press. 20
22 Wooldridge, J.M. (2002b), "Unobserved Heterogeneity and Estimation of Average Partial Effects," mimeo, Michigan State University Department of Economics. Wooldridge, J.M. (2003), Fixed Effects Estimation of the Population-Averaged Slopes in a Panel Data Random Coefficient Model, forthcoming, Problem, Econometric Theory
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:
Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable
Clustering in the Linear Model
Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple
SYSTEMS OF REGRESSION EQUATIONS
SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations
Correlated Random Effects Panel Data Models
INTRODUCTION AND LINEAR MODELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. The Linear
1. THE LINEAR MODEL WITH CLUSTER EFFECTS
What s New in Econometrics? NBER, Summer 2007 Lecture 8, Tuesday, July 31st, 2.00-3.00 pm Cluster and Stratified Sampling These notes consider estimation and inference with cluster samples and samples
Models for Longitudinal and Clustered Data
Models for Longitudinal and Clustered Data Germán Rodríguez December 9, 2008, revised December 6, 2012 1 Introduction The most important assumption we have made in this course is that the observations
Marketing Mix Modelling and Big Data P. M Cain
1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Arthur Lewbel, Yingying Dong, and Thomas Tao Yang Boston College, University of California Irvine, and Boston
1. Review of the Basic Methodology
What s New in Econometrics? NBER, Summer 2007 Lecture 10, Tuesday, July 31st, 4.30-5.30 pm Difference-in-Differences Estimation These notes provide an overview of standard difference-in-differences methods
Panel Data: Linear Models
Panel Data: Linear Models Laura Magazzini University of Verona [email protected] http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Panel Data: Linear Models 1 / 45 Introduction Outline What
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
1. When Can Missing Data be Ignored?
What s ew in Econometrics? BER, Summer 2007 Lecture 12, Wednesday, August 1st, 11-12.30 pm Missing Data These notes discuss various aspects of missing data in both pure cross section and panel data settings.
Regression III: Advanced Methods
Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models
On Marginal Effects in Semiparametric Censored Regression Models
On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the
CAPM, Arbitrage, and Linear Factor Models
CAPM, Arbitrage, and Linear Factor Models CAPM, Arbitrage, Linear Factor Models 1/ 41 Introduction We now assume all investors actually choose mean-variance e cient portfolios. By equating these investors
Introduction to Regression Models for Panel Data Analysis. Indiana University Workshop in Methods October 7, 2011. Professor Patricia A.
Introduction to Regression Models for Panel Data Analysis Indiana University Workshop in Methods October 7, 2011 Professor Patricia A. McManus Panel Data Analysis October 2011 What are Panel Data? Panel
Chapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
Econometric Methods for Panel Data
Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst [email protected] University of Vienna and Institute for Advanced Studies
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*
Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?* Jennjou Chen and Tsui-Fang Lin Abstract With the increasing popularity of information technology in higher education, it has
Examining the effects of exchange rates on Australian domestic tourism demand: A panel generalized least squares approach
19th International Congress on Modelling and Simulation, Perth, Australia, 12 16 December 2011 http://mssanz.org.au/modsim2011 Examining the effects of exchange rates on Australian domestic tourism demand:
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Multiple Linear Regression in Data Mining
Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple
Panel Data Econometrics
Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Quantile Treatment Effects 2. Control Functions
State Space Time Series Analysis
State Space Time Series Analysis p. 1 State Space Time Series Analysis Siem Jan Koopman http://staff.feweb.vu.nl/koopman Department of Econometrics VU University Amsterdam Tinbergen Institute 2011 State
MODELS FOR PANEL DATA Q
Greene-2140242 book November 23, 2010 12:28 11 MODELS FOR PANEL DATA Q 11.1 INTRODUCTION Data sets that combine time series and cross sections are common in economics. The published statistics of the OECD
IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS. Steven T. Berry and Philip A. Haile. March 2011 Revised April 2011
IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS By Steven T. Berry and Philip A. Haile March 2011 Revised April 2011 COWLES FOUNDATION DISCUSSION PAPER NO. 1787R COWLES FOUNDATION
Chapter 2. Dynamic panel data models
Chapter 2. Dynamic panel data models Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans Université d Orléans April 2010 Introduction De nition We now consider
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS
Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS Philip Merrigan ESG-UQAM, CIRPÉE Using Big Data to Study Development and Social Change, Concordia University, November 2103 Intro Longitudinal
1 Introduction. 2 The Econometric Model. Panel Data: Fixed and Random Effects. Short Guides to Microeconometrics Fall 2015
Short Guides to Microeconometrics Fall 2015 Kurt Schmidheiny Unversität Basel Panel Data: Fixed and Random Effects 2 Panel Data: Fixed and Random Effects 1 Introduction In panel data, individuals (persons,
ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers
Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers Christine Ebling, University of Technology Sydney, [email protected] Bart Frischknecht, University of Technology Sydney,
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
Handling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
for an appointment, e-mail [email protected]
M.Sc. in Economics Department of Economics, University College London Econometric Theory and Methods (G023) 1 Autumn term 2007/2008: weeks 2-8 Jérôme Adda for an appointment, e-mail [email protected] Introduction
Implementing Panel-Corrected Standard Errors in R: The pcse Package
Implementing Panel-Corrected Standard Errors in R: The pcse Package Delia Bailey YouGov Polimetrix Jonathan N. Katz California Institute of Technology Abstract This introduction to the R package pcse is
Machine Learning Methods for Demand Estimation
Machine Learning Methods for Demand Estimation By Patrick Bajari, Denis Nekipelov, Stephen P. Ryan, and Miaoyu Yang Over the past decade, there has been a high level of interest in modeling consumer behavior
Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions
Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially
Cost implications of no-fault automobile insurance. By: Joseph E. Johnson, George B. Flanigan, and Daniel T. Winkler
Cost implications of no-fault automobile insurance By: Joseph E. Johnson, George B. Flanigan, and Daniel T. Winkler Johnson, J. E., G. B. Flanigan, and D. T. Winkler. "Cost Implications of No-Fault Automobile
It is important to bear in mind that one of the first three subscripts is redundant since k = i -j +3.
IDENTIFICATION AND ESTIMATION OF AGE, PERIOD AND COHORT EFFECTS IN THE ANALYSIS OF DISCRETE ARCHIVAL DATA Stephen E. Fienberg, University of Minnesota William M. Mason, University of Michigan 1. INTRODUCTION
Evaluating one-way and two-way cluster-robust covariance matrix estimates
Evaluating one-way and two-way cluster-robust covariance matrix estimates Christopher F Baum 1 Austin Nichols 2 Mark E Schaffer 3 1 Boston College and DIW Berlin 2 Urban Institute 3 Heriot Watt University
We shall turn our attention to solving linear systems of equations. Ax = b
59 Linear Algebra We shall turn our attention to solving linear systems of equations Ax = b where A R m n, x R n, and b R m. We already saw examples of methods that required the solution of a linear system
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
3.1 Least squares in matrix form
118 3 Multiple Regression 3.1 Least squares in matrix form E Uses Appendix A.2 A.4, A.6, A.7. 3.1.1 Introduction More than one explanatory variable In the foregoing chapter we considered the simple regression
Introduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. A General Formulation 3. Truncated Normal Hurdle Model 4. Lognormal
APPROXIMATING THE BIAS OF THE LSDV ESTIMATOR FOR DYNAMIC UNBALANCED PANEL DATA MODELS. Giovanni S.F. Bruno EEA 2004-1
ISTITUTO DI ECONOMIA POLITICA Studi e quaderni APPROXIMATING THE BIAS OF THE LSDV ESTIMATOR FOR DYNAMIC UNBALANCED PANEL DATA MODELS Giovanni S.F. Bruno EEA 2004-1 Serie di econometria ed economia applicata
The Method of Least Squares
Hervé Abdi 1 1 Introduction The least square methods (LSM) is probably the most popular technique in statistics. This is due to several factors. First, most common estimators can be casted within this
Note 2 to Computer class: Standard mis-specification tests
Note 2 to Computer class: Standard mis-specification tests Ragnar Nymoen September 2, 2013 1 Why mis-specification testing of econometric models? As econometricians we must relate to the fact that the
Chapter 3: The Multiple Linear Regression Model
Chapter 3: The Multiple Linear Regression Model Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans November 23, 2013 Christophe Hurlin (University of Orléans) Advanced Econometrics
FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits
Technical Paper Series Congressional Budget Office Washington, DC FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits Albert D. Metz Microeconomic and Financial Studies
Time Series Analysis
Time Series Analysis Identifying possible ARIMA models Andrés M. Alonso Carolina García-Martos Universidad Carlos III de Madrid Universidad Politécnica de Madrid June July, 2012 Alonso and García-Martos
COURSES: 1. Short Course in Econometrics for the Practitioner (P000500) 2. Short Course in Econometric Analysis of Cointegration (P000537)
Get the latest knowledge from leading global experts. Financial Science Economics Economics Short Courses Presented by the Department of Economics, University of Pretoria WITH 2015 DATES www.ce.up.ac.za
THE IMPACT OF 401(K) PARTICIPATION ON THE WEALTH DISTRIBUTION: AN INSTRUMENTAL QUANTILE REGRESSION ANALYSIS
THE IMPACT OF 41(K) PARTICIPATION ON THE WEALTH DISTRIBUTION: AN INSTRUMENTAL QUANTILE REGRESSION ANALYSIS VICTOR CHERNOZHUKOV AND CHRISTIAN HANSEN Abstract. In this paper, we use the instrumental quantile
Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets
Unraveling versus Unraveling: A Memo on Competitive Equilibriums and Trade in Insurance Markets Nathaniel Hendren January, 2014 Abstract Both Akerlof (1970) and Rothschild and Stiglitz (1976) show that
IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.
Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium
From the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group
MISSING DATA TECHNIQUES WITH SAS IDRE Statistical Consulting Group ROAD MAP FOR TODAY To discuss: 1. Commonly used techniques for handling missing data, focusing on multiple imputation 2. Issues that could
Do R&D or Capital Expenditures Impact Wage Inequality? Evidence from the IT Industry in Taiwan ROC
Lai, Journal of International and Global Economic Studies, 6(1), June 2013, 48-53 48 Do R&D or Capital Expenditures Impact Wage Inequality? Evidence from the IT Industry in Taiwan ROC Yu-Cheng Lai * Shih
Introduction to time series analysis
Introduction to time series analysis Margherita Gerolimetto November 3, 2010 1 What is a time series? A time series is a collection of observations ordered following a parameter that for us is time. Examples
The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.
The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss
Statistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
Chapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
Department of Economics
Department of Economics On Testing for Diagonality of Large Dimensional Covariance Matrices George Kapetanios Working Paper No. 526 October 2004 ISSN 1473-0278 On Testing for Diagonality of Large Dimensional
1 Determinants and the Solvability of Linear Systems
1 Determinants and the Solvability of Linear Systems In the last section we learned how to use Gaussian elimination to solve linear systems of n equations in n unknowns The section completely side-stepped
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Penalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD
REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
From the help desk: Demand system estimation
The Stata Journal (2002) 2, Number 4, pp. 403 410 From the help desk: Demand system estimation Brian P. Poi Stata Corporation Abstract. This article provides an example illustrating how to use Stata to
Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University
Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction
Power and sample size in multilevel modeling
Snijders, Tom A.B. Power and Sample Size in Multilevel Linear Models. In: B.S. Everitt and D.C. Howell (eds.), Encyclopedia of Statistics in Behavioral Science. Volume 3, 1570 1573. Chicester (etc.): Wiley,
Mgmt 469. Fixed Effects Models. Suppose you want to learn the effect of price on the demand for back massages. You
Mgmt 469 Fixed Effects Models Suppose you want to learn the effect of price on the demand for back massages. You have the following data from four Midwest locations: Table 1: A Single Cross-section of
1 Short Introduction to Time Series
ECONOMICS 7344, Spring 202 Bent E. Sørensen January 24, 202 Short Introduction to Time Series A time series is a collection of stochastic variables x,.., x t,.., x T indexed by an integer value t. The
Markups and Firm-Level Export Status: Appendix
Markups and Firm-Level Export Status: Appendix De Loecker Jan - Warzynski Frederic Princeton University, NBER and CEPR - Aarhus School of Business Forthcoming American Economic Review Abstract This is
Example G Cost of construction of nuclear power plants
1 Example G Cost of construction of nuclear power plants Description of data Table G.1 gives data, reproduced by permission of the Rand Corporation, from a report (Mooz, 1978) on 32 light water reactor
STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS
STOCK MARKET VOLATILITY AND REGIME SHIFTS IN RETURNS Chia-Shang James Chu Department of Economics, MC 0253 University of Southern California Los Angles, CA 90089 Gary J. Santoni and Tung Liu Department
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided
1 Teaching notes on GMM 1.
Bent E. Sørensen January 23, 2007 1 Teaching notes on GMM 1. Generalized Method of Moment (GMM) estimation is one of two developments in econometrics in the 80ies that revolutionized empirical work in
Breast and cervical cancer screening in Great Britain: Dynamic interrelated processes
Labeit and Peinemann Health Economics Review (2015) 5:32 DOI 10.1186/s13561-015-0065-3 RESEARCH Open Access Breast and cervical cancer screening in Great Britain: Dynamic interrelated processes Alexander
Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs
Why High-Order Polynomials Should Not be Used in Regression Discontinuity Designs Andrew Gelman Guido Imbens 2 Aug 2014 Abstract It is common in regression discontinuity analysis to control for high order
Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem
Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become
