Binary Outcome Models: Endogeneity and Panel Data
|
|
- Marybeth Alexander
- 7 years ago
- Views:
Transcription
1 Binary Outcome Models: Endogeneity and Panel Data ECMT 676 (Econometric II) Lecture Notes TAMU April 14, 2014 ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
2 Topics Issues in binary response models: Endogeneity I I Continuous endogenous x F F Control function approach IV probit approach Binary endogenous x Panel data I I I RE CRE FE ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
3 Binary response model with endogeneity Endogeneity arises naturally in the latent variable model LPM (linear probability model) with 2SLS is a handy solution for binary response with endogeneity. But it does not allow for individual-speci c marginal e ects We will deal with probit/logit with endogeneity Why endogeneity causes problem in binary model? not as transparent as in the linear model ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
4 The binary outcome model: y i = I fyi > 0g yi = xi 0 i Now we allow Ex i u i 6= 0 by assuming u i jx i N(q(x i ), 1). Then P(y i = 1jx i ) = P(u i < x 0 i βjx i ) = P(u i q(x i ) < x 0 i β q(x i )jx i ) = Φ(x 0 i β q(x i )) Probit of y i on x i is not consistent. We will see general endogeneity (σ uv 6= 0) introduces not only non-constant mean but also non-unity variance. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
5 When the endogenous variable is continuous A control function (CF) approach The part is based on Blundell and Powell (2004 REStud) and also Wooldridge (15.7.2). Unanswered question: why not traditional 2 stages? The model y 1i = I fy1i > 0g y1i = xi u i = z1i 0 β + αy 2i + u i Now suppose part of x i (called y 2i, assuming as a scalar) is endogenous: x i = (z 0 1i, y 2i ) 0. The vector of IVs z i = (z 0 1i, z0 2i )0. The reduced form (RF) y 2i = z 0 i δ + v i = z 0 1i δ 1 + z 0 2i δ 2 + v i ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
6 The parameters (α, β) index the average structural function (ASF) (or, the response probability). Again, this is de ned by xing all observed explanatory variables and integrating out the unobservable, u: ASF (y 2, z 1 ) = E u f1[z1 0 β + αy 2 + u > 0]g = Φ(z1 0 β + αy 2) Thus, α and β are the parameters that appear in the APEs (derivatives of ASF). ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
7 Assume ui Endogeneity comes from σ uv 6= 0. v i 1 jz N 0, σ uv σuv σ 2 v Normality of v is not realistic if y 2i is binary. So we are assuming for now the endogenous variable is continuous. Probit has some advantage here over logit, because we can then assume joint normality ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
8 Rivers and Vuong (1998) proposed a "control function (CF)" approach: "which introduces residuals from the reduced form for the regressors as covariates in the binary response model to account for endogeneity" (BP, 2004). Linear projection u = θv + e where θ = σ uv /σ 2 v and e is also normal. Having the linear projection, we get We will need D(ejz 1, y 2, v). I (y 1 = 1) = I (z1i 0 β + αy 2i + u i > 0) = I (z1i 0 β + αy 2i + θv i + e i > 0) the mean part ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
9 First we have e v = u θv 1 θ = v ρ 2 0 N(0, 0 σ 2 v u v ), given z, where ρ = corr(u, v) = σ uv /σ v. Then e is independent of v : uncorrelatedness of e and v (which also follows from the linear projection), implies independence, since they are jointly normal. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
10 Let D(j) mean conditional distribution D(ejz 1, y 2, v) = D(ejz, v) (since y 2 is generated by z and v) = D(ejz) (since e is independent of v) N(0, 1 ρ 2 ). So standard assumptions hold: conditional normality, homoskedasticity But the identi cation condition fails: Var(e) 6= 1. The outcome equation becomes (CF) y 1i = z 0 1i β + αy 2i + θv i + e i ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
11 Then P(y 1 = 1jx, v) = P(y 1 = 1jz 1, y 2, v) = P(z1i 0 β + αy 2i + θv i + e i > 0jz 1, y 2, v) = P(e i > (z1i 0 β + αy 2i + θv i )jz 1, y 2, v) = Φ (z1i 0 β + αy 2i + θv i )/ p 1 ρ 2 (1) v i can be estimated from RF, as bv i A probit of y 1 on z 1, y 2 and bv gives the estimates, called eβ, eα and eθ. So q eβ! β = β/ 1 ρ 2 eα! q α = α/ 1 ρ 2 q eθ! θ = θ/ 1 ρ 2 ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
12 Recall we are interested in β and α (especially α). The problem is now how to estimate ρ. Note that q θ = θ/ 1 ρ 2 θ = σ uv /σ 2 v = ρσ v /σ 2 v = ρ/σ v Solve the system for ρ and θ, since σ v and θ can be estimated. So p 1 ρ 2 can be estimated: p q 1 ρ 2 = 1 + θ 2 σ 2 v. So q bβ = eβ/ 1 + eθ 2 bσ 2 v q bα = eα/ 1 + eθ 2 bσ 2 v ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
13 An alternative derivation of (1) By Fact 1 (see below), u i jv i N(θv i, 1 ρ 2 ). This conditional normality is essentially all we need (we don t really need joint normality) So E (y 1i jx i, v i ) = P(u i > xi 0 β 0 jx i, v i ) = 1 P(u i < xi 0 β 0 jx i, v i ) = 1 P( u i θv p i 1 ρ 2 < x i 0β 0 θv p i 1 ρ 2 jx i, v i ) = 1 Φ( θv p i ) 1 ρ 2 xi 0β 0 = Φ( x 0 i β 0 + θv i p 1 ρ 2 ) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
14 Fact 1 This result is also crucial in Kalman lter. If then where u N v mu m v Suu, S vu ujv N(m ujv, S ujv ) S uv S vv m ujv = m u + S uv S 1 vv (v m v ) S ujv = S uu S uv S 1 vv S vu ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
15 Since v i has to be estimated, so CF regression involves a generated regressor. So getting the standard error is hard. An alternative approach: MLE (it is also called IV probit) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
16 Control function approach: in the linear model, it leads to the same estimate as 2SLS Wooldridge, Section 6.2, p. 127 & Problem 5.1 Exercise: show CF=2SLS (using OLS algebra) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
17 When the endogenous variable is continuous IV probit approach IV probit: the key is to obtain f (y 1, y 2 jz) f (y 1, y 2 jz) = f (y 1 jy 2, z) f (y 2 jz) First, f (y 2 jz) = f (zi 0δ + v i jz) N(zi 0δ, σ2 v ) = ϕ((y 2 zi 0δ)/σ v )/σ v Second, to get f (y 1 jy 2, z), essentially P(y 1 = 1jy 2, z), we need the distribution of u given y 2, z, where y1i = z1i 0 β + αy 2i + u i ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
18 By Fact 1, given z f (ujy 2, z) = f (ujz, v) N(σ uv σ 2 v v, 1 σ 2 uv σ 2 v ) This can be compared with the exogenous probit, in which case f (ujy 2, z) = f (ujz, v) u?v = f (ujz) N(0, 1) So endogeneity not only introduces non-zero mean but also non-unit variance in the conditional distribution of u. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
19 So P(y 1 = 1jy 2, z) = = P(z1i 0 β + αy 2i + u i > 0) = P(u i > q (z1i 0 β + αy 2i )) = P(u i σ uv σv 2 v/ 1 ρ 2 q > ( z1i 0 β αy 2i σ uv σv 2 v)/ q 1 ρ 2 ) = Φ((z1i 0 β + αy 2i + ρσv 1 v)/ 1 ρ 2 ) q = Φ((z1i 0 β + αy 2i + ρσv 1 (y 2 {z } z 0 δ))/ 1 ρ 2 ) =θ Φ(w) The likelihood for the i th observation: l i = Φ(w) y 1 [1 Φ(w)] 1 y 1 ϕ((y 2 zi 0δ)/σ v )/σ v Log likelihood for the sample L(β, α, ρ, σ v, δ)= n i =0 log l i Maximization over β, α, ρ, σ v, δ. Standard error: the estimated Hessian, or the outer product of the score. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
20 A simple test of exogeneity: ρ = 0. Control function approach: only focuses on f (y 1 jy 2, z), is thus a limited information procedure. CF approach replaces v by bv. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
21 When the endogenous variable is binary Binary exogenous variable needs special treatment because the linear model in the rst stage is not reasonable any more. The model y 1i = I fy1i > 0g y1i = xi u i = z1i 0 β + y 2i 0 α + u i y 2 = I fzi 0 i > 0g Like before, we assume (u, v) is jointly normal given z But here Var(v) = 1 for identi cation Assume, given z ui v i 1 ρ jz i N 0, ρ 1 ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
22 The likelihood of observation i : l i = f (y 1i, y 2i jz i ) = f (y 1i jy 2i, z i ) f (y 2i jz i ). The second factor is easy to obtain: f (y 2 jz) = P(y 2 = 1) y 2 [1 P(y 2 = 1)] 1 y 2 The rst factor is one of four cases: = Φ(z 0 δ) y 2 [1 Φ(z 0 δ)] 1 y 2 P(y 1 = 1jy 2 = 1, z) P(y 1 = 0jy 2 = 1, z) [=1-P(y 1 = 1jy 2 = 1, z)] P(y 1 = 1jy 2 = 0, z) P(y 1 = 0jy 2 = 0, z) [=1-P(y 1 = 1jy 2 = 0, z)] Thus MLE is obtained: L(β, α, ρ, δ) = n i =0 log l i We will calculate these four cases now. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
23 Fact 2 If v is standard normal, PDF (vjv > a) ϕ(v)/p(v > a) = ϕ(v)/φ( a) And PDF (vjv < a) ϕ(v)/p(v < a) = ϕ(v)/[1 Φ( a)] (proof?) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
24 When the endogenous variable is binary P(y 1 = 1jy 2 = 1, z) = E (I (y 1 = 1)jy 2 = 1, z) = E fe [I (y 1 = 1)jv, z]jy 2 = 1, zg (smaller conditional set dominates) = E (P(y 1 = 1jv, z)jy 2 = 1, z) q = E (Φ((z1i 0 β + αy 2i + ρv)/ 1 q ρ 2 )j y2 =1,z ) = E (Φ((z1i 0 β + αy 2i + ρv)/ 1 ρ 2 )jv i > zi 0 δ, z) = Z q support of v Φ((z0 1i β + αy 2i + ρv)/ Z by Fact 2 q = where (*) uses a projection. z 0 i δ Φ((z 0 1i β + αy 2i + ρv)/ 1 ρ 2 )f (vjv i > z 0 i δ, z)dv 1 ρ 2 )ϕ(v)/φ(z 0 i δ)dv, ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
25 Similarly P(y 1 = 1jy 2 = 0, z) Z zi 0 = δ q Φ((z1i 0 β + αy 2i + ρv)/ 1 ρ 2 )ϕ(v)/[1 Φ(zi 0 δ)]dv Similarly we can compute P(y 1 = 0jy 2 = 1, z) and P(y 1 = 0jy 2 = 0, z) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
26 EXAMPLE: E ects of Children on Labor Force Participation, LABSUP.txt y 1 = worked, y 2 = morekids (a dummy for having more than two children). Population is women with at least two children. worked = 1[α morekids + β 0 + β 1 nonmomi + β 2 educ +β 3 age + β 4 age 2 + β 5 black + β 6 hispan + u > 0] The binary variable samesex is the IV for morekids. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
27 Binary panel data model Pooled probit/logit Panel model without unobserved e ects: P(y it = 1jx it ) = Φ(x 0 it β) Strict exogeneity (SE): D(y it jx i ) = D(y it jx it ) for t = 1,, T, where x i = (x i1,, x it ) 0. (D means distribution) Stronger than the one in the linear model: in terms of expectation there. One di culty of using MLE (which is necessary for binary models) is y it is not independent over t, which is inconvenient in constructing the likelihood function Conditional independence (CI): y i1, y i2,, y it are independent conditional on x i. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
28 Under these two assumptions: f (y i1, y i2,, y it jx i ) CI = Π T t=1 f (y it jx i ) SE = Π T t=1 f (y it jx it ), where f (y it jx it ) = Φ(x it β) y it [1 Φ(x it β)] 1 y it. Pooled likelihood n T [y it log Φ(xit 0 β) + (1 y it ) log(1 Φ(x 0 i =1 t=1 Maximizer: pooled probit estimator. it β))] Partial likelihood theory says it is still consistent if CI doesn t hold. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
29 Binary panel data model RE model Now we add unobserved e ects Response probability: P(y it = 1jx it, c i ) = Φ(x 0 it β + c i ) SE: D(y it jx i, c i ) = D(y it jx it, c i ) for t = 1,, T. CI: y i1, y i2,, y it are independent conditional on x i, c i y i1, y i2,, y it are unconditionally dependent because of c i. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
30 CI implies: f (y i1, y i2,, y it jx i, c i ) CI = Π T t=1 f (y it jx i, c i ) SE = Π T t=1 f (y it jx it, c i ), where f (y it jx it, c i ) = Φ(x it β + c i ) y it [1 Φ(x it β + c i )] 1 y it. But c i is unobservable, so c i shouldn t appear in the likelihood function Viewing c i as parameters along with β leads to an incidental parameters problem. It means that MLEs bc i and bβ are inconsistent. (unlike in the linear case, in which we have p n consistency) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
31 Hallmark of FE analysis (in nonlinear models): no speci cation of a distribution for c i given x i. RE (random e ects) probit: assuming (The RE assumption) c i jx i N(0, σ 2 c ) Since the rst element of x i is 1, it implies c i N(0, σ 2 c ). Then a conditional ML can be applied to β and σ 2 c as follows. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
32 Dealing with c i : integrate out c i f (y i1, y i2,, y it jx i ) = Z Π T t=1 f (y it jx it, c i )ϕ(c/σ c )/σ c dc. The likelihood function for the entire sample is then straightforward: production over i It is called RE probit estimator. The estimate is also called population averaged. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
33 As shown before, under RE assumption, viewing c i + u it as the composite error in the latent variable model and applying pooled probit will give the attenuation bias This is in contrast to the linear model. Latent variable model yi = xit 0 i + u it xit 0 it y it = I (yi > 0) Var(v it jx it ) = Var(c i + u it jx it ) u it?c = i Var(uit jx it ) + σ 2 c Var(u it jx it, c i ) + σ 2 c = 1 + σ 2 c Response probability: u it?c = i P(y it = 1jx it ) = Φ(x 0 it β/ q1 + σ 2 c ) So bβ p! β/ p 1 + σ 2 c This result also motivates the necessity to specify the distribution of c (in contrast to linear RE model) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
34 Binary panel data model CRE model RE and CI might be too strong. Correlated RE (CRE) probit (Chamberlain s): A relaxation of RE: c i jx i N(x 0 i ξ, σ2 c ) allowing dependence between c i and x i Estimation is straightforward: only needs a modi cation of density of c f (y i1, y i2,, y it jx i ) = Z Π T t=1 f (y it jx it, c i )ϕ((c x 0 i ξ)/σ c )/σ c dc. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
35 Binary panel data model FE logit Panel logit model: where G (x) = e x /(1 + e x ). P(y it = 1jx it, c i ) = G (x 0 it β + c i ), An important advantage of panel logit is that we can obtain p N consistent and asymp. normal estimator of β without any assumptions on D(c i jx i ), or achieving xed e ects (FE) estimation. Assumptions: SE, CI, and that each element of x it is time-varying. De ne N i = T t=1 y it. It turns out that D(y i jx i, c i, N i ) does not depend on c i. The functional form of logit helps. We can t do this in probit. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
36 Consider T = 2. Then N i 2 f0, 1, 2g. Note that neither of which is informative for β. So we only consider N i = 1. P(y i1 = 1jx i, c i, N i = 0) = 0 P(y i2 = 1jx i, c i, N i = 0) = 0 P(y i1 = 1jx i, c i, N i = 2) = 1 P(y i2 = 1jx i, c i, N i = 2) = 1 As we will show, consistent estimation of β can be obtained by a standard logit of y i2 on x i2 x i1 using the observations for which N i = 1. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
37 where P(y i2 = 1jx i, c i, N i = 1) = P(y i2 = 1, N i = 1jx i, c i ) P(N i = 1jx i, c i ) Top = P(y i2 = 1, y i1 = 0jx i, c i ) CI = P(y i2 = 1jx i, c i )P(y i1 = 0jx i, c i ) SE = G (x 0 i2 β + c i )[1 G (x 0 i1 β + c i )] = e x 0 i2 β+c i (1 + e x 0 i2 β+c i )(1 + e x 0 i1 β+c i ), Bottom = P(y i2 = 1, y i1 = 0jx i, c i ) + P(y i2 = 0, y i1 = 1jx i, c i ) = G (xi2 0 β + c i )[1 G (xi1 0 β + c i )] + [1 G (xi2 0 β + c i )]G (xi1 0 β + c i ) e x i1 0 β+c i + e x i2 0 β+c i = (1 + e x 0 i2 β+c i )(1 + e x 0 i1 β+c i ). ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
38 So P(y i2 = 1jx i, c i, N i = 1) = = G ((x i2 x i1 ) 0 β) e x 0 i2 β+c i e x 0 i1 β+c i + e x 0 i2 β+c i = e(x i2 x i1 ) 0 β 1 + e (x i2 x i1 ) 0 β c i is canceled! On the other hand P(y i1 = 1jx i, c i, N i = 1) = P(y i2 = 0jx i, c i, N i = 1) = 1 G ((x i2 x i1 ) 0 β) ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
39 f (y i1, y i2 jx i, c i, N i = 1) = P(y i1 = 1jx i, c i, N i = 1) or P(y i2 = 1jx i, c i, N i = 1) = P(y i1 = 1jx i, c i, N i = 1) y i1 P(y i2 = 1jx i, c i, N i = 1) y i2 So the conditional log likelihood function for observation i is y i1 =1 y i2 l i (β) = I (N i = 1)[y i1 log(1 G ((x i2 x i1 ) 0 β)) +y i2 log G ((x i2 x i1 ) 0 β)] = I (Ni = 1)[(1 y i2 ) log(1 G ((x i2 x i1 ) 0 β)) +y i2 log G ((x i2 x i1 ) 0 β)] We select out the observations for which N i = 1. Computationally, it is just a standard logit of y i2 on x i2 observations for which N i = 1. x i1 using the ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
40 It is called conditional MLE (CMLE), as it is di erent for a full MLE. To get a full MLE: P(y i1, y i2 jx i, c i ) = P(y i1, y i2 jx i, c i, N i = 0) P(N {z } i = 0jx i, c i ) deterministic +P(y i1, y i2 jx i, c i, N i = 1) P(N {z } i = 1jx i, c i ) CMLE +P(y i1, y i2 jx i, c i, N i = 2) P(N {z } i = 2jx i, c i ). deterministic Here we used the rule of total probability: P(A)=P(AjB 1 )P(B 1 )+P(AjB 2 )P(B 2 ) if B 1 and B 2 forms a partition of the sample space. A side: this rule was used to show the selection bias, which mistakenly assumes P(A)=P(AjB 1 ). CMLE doesn t consider P(N i = 0jx i, c i ), P(N i = 1jx i, c i ) or P(N i = 2jx i, c i ) which may depend on β. ECMT 676 (TAMU) Binary Outcomes: Endogeneity and Panel April 14, / 40
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Quantile Treatment Effects 2. Control Functions
More informationChapter 3: The Multiple Linear Regression Model
Chapter 3: The Multiple Linear Regression Model Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans November 23, 2013 Christophe Hurlin (University of Orléans) Advanced Econometrics
More informationStandard errors of marginal effects in the heteroskedastic probit model
Standard errors of marginal effects in the heteroskedastic probit model Thomas Cornelißen Discussion Paper No. 320 August 2005 ISSN: 0949 9962 Abstract In non-linear regression models, such as the heteroskedastic
More informationproblem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved
4 Data Issues 4.1 Truncated Regression population model y i = x i β + ε i, ε i N(0, σ 2 ) given a random sample, {y i, x i } N i=1, then OLS is consistent and efficient problem arises when only a non-random
More informationChapter 4: Statistical Hypothesis Testing
Chapter 4: Statistical Hypothesis Testing Christophe Hurlin November 20, 2015 Christophe Hurlin () Advanced Econometrics - Master ESA November 20, 2015 1 / 225 Section 1 Introduction Christophe Hurlin
More informationChapter 2. Dynamic panel data models
Chapter 2. Dynamic panel data models Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans Université d Orléans April 2010 Introduction De nition We now consider
More informationEconometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
More informationComparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Arthur Lewbel, Yingying Dong, and Thomas Tao Yang Boston College, University of California Irvine, and Boston
More informationSolución del Examen Tipo: 1
Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical
More informationWhat s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
More information1 Another method of estimation: least squares
1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i
More informationIAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
More informationChapter 10: Basic Linear Unobserved Effects Panel Data. Models:
Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable
More informationIDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS. Steven T. Berry and Philip A. Haile. March 2011 Revised April 2011
IDENTIFICATION IN A CLASS OF NONPARAMETRIC SIMULTANEOUS EQUATIONS MODELS By Steven T. Berry and Philip A. Haile March 2011 Revised April 2011 COWLES FOUNDATION DISCUSSION PAPER NO. 1787R COWLES FOUNDATION
More informationFIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038
More informationON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS
ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS Jeffrey M. Wooldridge THE INSTITUTE FOR FISCAL STUDIES DEPARTMENT OF ECONOMICS, UCL cemmap working
More informationThe Bivariate Normal Distribution
The Bivariate Normal Distribution This is Section 4.7 of the st edition (2002) of the book Introduction to Probability, by D. P. Bertsekas and J. N. Tsitsiklis. The material in this section was not included
More informationHURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. A General Formulation 3. Truncated Normal Hurdle Model 4. Lognormal
More informationMaximum Likelihood Estimation
Math 541: Statistical Theory II Lecturer: Songfeng Zheng Maximum Likelihood Estimation 1 Maximum Likelihood Estimation Maximum likelihood is a relatively simple method of constructing an estimator for
More informationCorrelated Random Effects Panel Data Models
INTRODUCTION AND LINEAR MODELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. The Linear
More informationCAPM, Arbitrage, and Linear Factor Models
CAPM, Arbitrage, and Linear Factor Models CAPM, Arbitrage, Linear Factor Models 1/ 41 Introduction We now assume all investors actually choose mean-variance e cient portfolios. By equating these investors
More informationLogistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: jiali@stat.psu.edu Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More informationPanel Data: Linear Models
Panel Data: Linear Models Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Panel Data: Linear Models 1 / 45 Introduction Outline What
More informationClassification Problems
Classification Read Chapter 4 in the text by Bishop, except omit Sections 4.1.6, 4.1.7, 4.2.4, 4.3.3, 4.3.5, 4.3.6, 4.4, and 4.5. Also, review sections 1.5.1, 1.5.2, 1.5.3, and 1.5.4. Classification Problems
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationEfficient and Practical Econometric Methods for the SLID, NLSCY, NPHS
Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS Philip Merrigan ESG-UQAM, CIRPÉE Using Big Data to Study Development and Social Change, Concordia University, November 2103 Intro Longitudinal
More informationLecture 14: GLM Estimation and Logistic Regression
Lecture 14: GLM Estimation and Logistic Regression Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South
More informationLecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
More informationOn Marginal Effects in Semiparametric Censored Regression Models
On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the
More information1 Sufficient statistics
1 Sufficient statistics A statistic is a function T = rx 1, X 2,, X n of the random sample X 1, X 2,, X n. Examples are X n = 1 n s 2 = = X i, 1 n 1 the sample mean X i X n 2, the sample variance T 1 =
More informationLinda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
More informationWooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions
Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions What will happen if we violate the assumption that the errors are not serially
More informationLecture 19: Conditional Logistic Regression
Lecture 19: Conditional Logistic Regression Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina
More informationModels for Longitudinal and Clustered Data
Models for Longitudinal and Clustered Data Germán Rodríguez December 9, 2008, revised December 6, 2012 1 Introduction The most important assumption we have made in this course is that the observations
More informationStatistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationMarkov Chain Monte Carlo Simulation Made Simple
Markov Chain Monte Carlo Simulation Made Simple Alastair Smith Department of Politics New York University April2,2003 1 Markov Chain Monte Carlo (MCMC) simualtion is a powerful technique to perform numerical
More informationINDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
More informationMultiple Choice Models II
Multiple Choice Models II Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Multiple Choice Models II 1 / 28 Categorical data Categorical
More informationSales forecasting # 1
Sales forecasting # 1 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
More informationIMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD
REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT
More informationThe Probit Link Function in Generalized Linear Models for Data Mining Applications
Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications
More informationEmployer-Provided Health Insurance and Labor Supply of Married Women
Upjohn Institute Working Papers Upjohn Research home page 2011 Employer-Provided Health Insurance and Labor Supply of Married Women Merve Cebi University of Massachusetts - Dartmouth and W.E. Upjohn Institute
More informationNote on the EM Algorithm in Linear Regression Model
International Mathematical Forum 4 2009 no. 38 1883-1889 Note on the M Algorithm in Linear Regression Model Ji-Xia Wang and Yu Miao College of Mathematics and Information Science Henan Normal University
More informationQuadratic forms Cochran s theorem, degrees of freedom, and all that
Quadratic forms Cochran s theorem, degrees of freedom, and all that Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 1, Slide 1 Why We Care Cochran s theorem tells us
More informationA General Approach to Variance Estimation under Imputation for Missing Survey Data
A General Approach to Variance Estimation under Imputation for Missing Survey Data J.N.K. Rao Carleton University Ottawa, Canada 1 2 1 Joint work with J.K. Kim at Iowa State University. 2 Workshop on Survey
More informationPanel Data Econometrics
Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is
More informationChapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem
Chapter Vector autoregressions We begin by taking a look at the data of macroeconomics. A way to summarize the dynamics of macroeconomic data is to make use of vector autoregressions. VAR models have become
More informationTopic 5: Stochastic Growth and Real Business Cycles
Topic 5: Stochastic Growth and Real Business Cycles Yulei Luo SEF of HKU October 1, 2015 Luo, Y. (SEF of HKU) Macro Theory October 1, 2015 1 / 45 Lag Operators The lag operator (L) is de ned as Similar
More information1 Prior Probability and Posterior Probability
Math 541: Statistical Theory II Bayesian Approach to Parameter Estimation Lecturer: Songfeng Zheng 1 Prior Probability and Posterior Probability Consider now a problem of statistical inference in which
More informationLogit and Probit. Brad Jones 1. April 21, 2009. University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science
Logit and Probit Brad 1 1 Department of Political Science University of California, Davis April 21, 2009 Logit, redux Logit resolves the functional form problem (in terms of the response function in the
More informationSF2940: Probability theory Lecture 8: Multivariate Normal Distribution
SF2940: Probability theory Lecture 8: Multivariate Normal Distribution Timo Koski 24.09.2015 Timo Koski Matematisk statistik 24.09.2015 1 / 1 Learning outcomes Random vectors, mean vector, covariance matrix,
More informationPS 271B: Quantitative Methods II. Lecture Notes
PS 271B: Quantitative Methods II Lecture Notes Langche Zeng zeng@ucsd.edu The Empirical Research Process; Fundamental Methodological Issues 2 Theory; Data; Models/model selection; Estimation; Inference.
More informationMultivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
More informationEconometric Methods for Panel Data
Based on the books by Baltagi: Econometric Analysis of Panel Data and by Hsiao: Analysis of Panel Data Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies
More informationLOGIT AND PROBIT ANALYSIS
LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y
More informationImputation of missing data under missing not at random assumption & sensitivity analysis
Imputation of missing data under missing not at random assumption & sensitivity analysis S. Jolani Department of Methodology and Statistics, Utrecht University, the Netherlands Advanced Multiple Imputation,
More informationOutline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
More informationThe Engle-Granger representation theorem
The Engle-Granger representation theorem Reference note to lecture 10 in ECON 5101/9101, Time Series Econometrics Ragnar Nymoen March 29 2011 1 Introduction The Granger-Engle representation theorem is
More informationRegression with a Binary Dependent Variable
Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22 Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel,
More informationOverview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)
Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and
More informationWeb-based Supplementary Materials for Bayesian Effect Estimation. Accounting for Adjustment Uncertainty by Chi Wang, Giovanni
1 Web-based Supplementary Materials for Bayesian Effect Estimation Accounting for Adjustment Uncertainty by Chi Wang, Giovanni Parmigiani, and Francesca Dominici In Web Appendix A, we provide detailed
More informationMultilevel Models for Longitudinal Data. Fiona Steele
Multilevel Models for Longitudinal Data Fiona Steele Aims of Talk Overview of the application of multilevel (random effects) models in longitudinal research, with examples from social research Particular
More informationUnivariate Time Series Analysis; ARIMA Models
Econometrics 2 Spring 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Outline of the Lecture () Introduction to univariate time series analysis. (2) Stationarity. (3) Characterizing
More informationMonte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091)
Monte Carlo and Empirical Methods for Stochastic Inference (MASM11/FMS091) Magnus Wiktorsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February
More informationMulti-variable Calculus and Optimization
Multi-variable Calculus and Optimization Dudley Cooke Trinity College Dublin Dudley Cooke (Trinity College Dublin) Multi-variable Calculus and Optimization 1 / 51 EC2040 Topic 3 - Multi-variable Calculus
More informationMonte Carlo-based statistical methods (MASM11/FMS091)
Monte Carlo-based statistical methods (MASM11/FMS091) Jimmy Olsson Centre for Mathematical Sciences Lund University, Sweden Lecture 5 Sequential Monte Carlo methods I February 5, 2013 J. Olsson Monte Carlo-based
More informationSAMPLE SELECTION BIAS IN CREDIT SCORING MODELS
SAMPLE SELECTION BIAS IN CREDIT SCORING MODELS John Banasik, Jonathan Crook Credit Research Centre, University of Edinburgh Lyn Thomas University of Southampton ssm0 The Problem We wish to estimate an
More informationSpatial Statistics Chapter 3 Basics of areal data and areal data modeling
Spatial Statistics Chapter 3 Basics of areal data and areal data modeling Recall areal data also known as lattice data are data Y (s), s D where D is a discrete index set. This usually corresponds to data
More informationAnalyzing Structural Equation Models With Missing Data
Analyzing Structural Equation Models With Missing Data Craig Enders* Arizona State University cenders@asu.edu based on Enders, C. K. (006). Analyzing structural equation models with missing data. In G.
More informationSYSTEMS OF REGRESSION EQUATIONS
SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations
More informationLecture 3: Differences-in-Differences
Lecture 3: Differences-in-Differences Fabian Waldinger Waldinger () 1 / 55 Topics Covered in Lecture 1 Review of fixed effects regression models. 2 Differences-in-Differences Basics: Card & Krueger (1994).
More informationECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
More informationWes, Delaram, and Emily MA751. Exercise 4.5. 1 p(x; β) = [1 p(xi ; β)] = 1 p(x. y i [βx i ] log [1 + exp {βx i }].
Wes, Delaram, and Emily MA75 Exercise 4.5 Consider a two-class logistic regression problem with x R. Characterize the maximum-likelihood estimates of the slope and intercept parameter if the sample for
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
More informationLecture 8: More Continuous Random Variables
Lecture 8: More Continuous Random Variables 26 September 2005 Last time: the eponential. Going from saying the density e λ, to f() λe λ, to the CDF F () e λ. Pictures of the pdf and CDF. Today: the Gaussian
More informationReject Inference in Credit Scoring. Jie-Men Mok
Reject Inference in Credit Scoring Jie-Men Mok BMI paper January 2009 ii Preface In the Master programme of Business Mathematics and Informatics (BMI), it is required to perform research on a business
More informationSF2940: Probability theory Lecture 8: Multivariate Normal Distribution
SF2940: Probability theory Lecture 8: Multivariate Normal Distribution Timo Koski 24.09.2014 Timo Koski () Mathematisk statistik 24.09.2014 1 / 75 Learning outcomes Random vectors, mean vector, covariance
More informationFigure B.1: Optimal ownership as a function of investment interrelatedness. Figure C.1: Marginal effects at low interrelatedness
Online Appendix for: Lileeva, A. and J. Van Biesebroeck. Outsourcing when Investments are Specific and Interrelated, Journal of the European Economic Association Appendix A: Proofs Proof of Proposition
More informationThe Real Business Cycle Model
The Real Business Cycle Model Ester Faia Goethe University Frankfurt Nov 2015 Ester Faia (Goethe University Frankfurt) RBC Nov 2015 1 / 27 Introduction The RBC model explains the co-movements in the uctuations
More informationClustering in the Linear Model
Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple
More information6.2 Permutations continued
6.2 Permutations continued Theorem A permutation on a finite set A is either a cycle or can be expressed as a product (composition of disjoint cycles. Proof is by (strong induction on the number, r, of
More informationSales forecasting # 2
Sales forecasting # 2 Arthur Charpentier arthur.charpentier@univ-rennes1.fr 1 Agenda Qualitative and quantitative methods, a very general introduction Series decomposition Short versus long term forecasting
More informationMULTIVARIATE PROBABILITY DISTRIBUTIONS
MULTIVARIATE PROBABILITY DISTRIBUTIONS. PRELIMINARIES.. Example. Consider an experiment that consists of tossing a die and a coin at the same time. We can consider a number of random variables defined
More informationUnivariate Time Series Analysis; ARIMA Models
Econometrics 2 Fall 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Univariate Time Series Analysis We consider a single time series, y,y 2,..., y T. We want to construct simple
More informationProblem of Missing Data
VASA Mission of VA Statisticians Association (VASA) Promote & disseminate statistical methodological research relevant to VA studies; Facilitate communication & collaboration among VA-affiliated statisticians;
More informationImbens/Wooldridge, Lecture Notes 5, Summer 07 1
Imbens/Wooldridge, Lecture Notes 5, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 5, Monday, July 30th, 4.30-5.30pm Instrumental Variables with Treatment Effect Heterogeneity: Local
More informationPortfolio selection based on upper and lower exponential possibility distributions
European Journal of Operational Research 114 (1999) 115±126 Theory and Methodology Portfolio selection based on upper and lower exponential possibility distributions Hideo Tanaka *, Peijun Guo Department
More informationNonlinear Regression:
Zurich University of Applied Sciences School of Engineering IDP Institute of Data Analysis and Process Design Nonlinear Regression: A Powerful Tool With Considerable Complexity Half-Day : Improved Inference
More informationMultinomial and Ordinal Logistic Regression
Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,
More informationLecture 15. Endogeneity & Instrumental Variable Estimation
Lecture 15. Endogeneity & Instrumental Variable Estimation Saw that measurement error (on right hand side) means that OLS will be biased (biased toward zero) Potential solution to endogeneity instrumental
More informationSection 6.1 Joint Distribution Functions
Section 6.1 Joint Distribution Functions We often care about more than one random variable at a time. DEFINITION: For any two random variables X and Y the joint cumulative probability distribution function
More informationUsing the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes JunXuJ.ScottLong Indiana University August 22, 2005 The paper provides technical details on
More informationFor a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )
Probability Review 15.075 Cynthia Rudin A probability space, defined by Kolmogorov (1903-1987) consists of: A set of outcomes S, e.g., for the roll of a die, S = {1, 2, 3, 4, 5, 6}, 1 1 2 1 6 for the roll
More informationA Subset-Continuous-Updating Transformation on GMM Estimators for Dynamic Panel Data Models
Article A Subset-Continuous-Updating Transformation on GMM Estimators for Dynamic Panel Data Models Richard A. Ashley 1, and Xiaojin Sun 2,, 1 Department of Economics, Virginia Tech, Blacksburg, VA 24060;
More informationA Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
More informationCovariance and Correlation
Covariance and Correlation ( c Robert J. Serfling Not for reproduction or distribution) We have seen how to summarize a data-based relative frequency distribution by measures of location and spread, such
More information