Limited Dependent Variable Models
|
|
- Wendy Cain
- 7 years ago
- Views:
Transcription
1 Limited Dependent Variable Models Contents: Censored and truncated samples Sample selection bias and Mills ratio Truncated regression The Tobit I Model Interpretation, Tests, etc. The Selectivity Tobit II Model The Selectivity Tobit III Model The Double Hurdle Model Econometrics 2 (Summer 2008) 1 / 31
2 Censored and truncated samples Censored and Truncated Samples One may not always observe data on continuous dependent and explanatory variables over the entire population. Examples: Wages are only observed for people with job Same happens for working hours Individuals whose incomes fall short of some poverty line are assigned incomes equal to that poverty line for records there may also exist an upper bound (highest taxable income) Only people with children may need child care etc. We define samples as either truncated or censored depending on the nature of the limitations of the data. Econometrics 2 (Summer 2008) 2 / 31
3 Censored and Truncated Samples Censored and truncated samples Truncated samples: A sample is truncated if data is only available on a subset of the whole population. In many samples, only people having a job are recorded For truncated samples, data is simply not available to the researcher Censored samples: A sample is censored if data is re-coded for a subset of the population. Censored data can be characterized as a sample defect. Recall example of minimum wages Further Examples: A model of the demand for tickets to a concert. A model of desired labor supply, or the latent propensity to work. A model for labor supply (in hours) for labor force survey Econometrics 2 (Summer 2008) 3 / 31
4 Observability Criteria Observability Criteria from the dependent variable s perspective Consider an observed dependent variable y, a set of explanatory variables x, a latent variable y. Truncated Samples: T1: y = y if y > c, not observed otherwise, T2: y = y if y < d, not observed otherwise, T3: y = y if c < y < d, not observed otherwise. Censored Samples: C1: y = y 1I(y > c) + c 1I(y c), C2: y = y 1I(y < d) + d 1I(y d), C3: y = y 1I(c < y < d) + c 1I(y c) + d 1I(y d). Econometrics 2 (Summer 2008) 4 / 31
5 Sample Selection Bias Sample Selection Bias Consider the latent relationship y = x β + u with E(u x) = 0. Suppose that the observed dependent variable y is truncated from below at zero (T1 from above). Then, the conditional expectation E(y x, y > 0) = x β + E(u x, y > 0) Suppose we are interested in estimating = x β + E(u u > x β) (1) = x β + α ψ(x β). y i = x i β + u i for y i truncated from below at zero. Econometrics 2 (Summer 2008) 5 / 31
6 Sample Selection Bias Sample Selection Bias Problem: the parameters ˆβ c for the conditional model may not converge to the true parameters of the underlying latent relationship y = x β + u. In a sense, the model above suffers from omitted variables bias. The correct model should have been y i = x i β + α ψ(x i β) + u i. To estimate β from the correct model need to know: What form should ψ(x i β) take? if we know, least squares estimation is possible Econometrics 2 (Summer 2008) 6 / 31
7 Conditional Normal Density Preliminaries Let u i N(0, σ 2 ). Then, we may write the density f (u) as 1 u2 f (u) = exp( 2πσ 2 2σ 2 ) = 1 σ 1 exp( 1 2π 2 ( u σ )2 ) = 1 σ φ( u σ ) Also, P(u > c) = P( u σ > c σ ) = 1 Φ( c σ ) = Φ( c σ ) Hence, f (u u > c) = 1 σ φ( u σ ) 1 1 Φ( c σ ) = σ φ( u σ ) Φ( c σ ) Econometrics 2 (Summer 2008) 7 / 31
8 Preliminaries Conditional Expectation for a Normal Random Variable Let u i N(0, σ 2 ) (homoscedastic!). Then the conditional expectation E(u u > c) may be derived as E(u u > c) = = = c u f (u u > c) u = 1 1 Φ( c σ ) σ 1 Φ( c σ ) = σ φ( c σ ) 1 Φ( c σ ). c c/σ c u σ φ( u σ ) u z φ(z) z = u 1 σ φ( u σ ) 1 Φ( c σ ) u [ ] σ 1 Φ( c σ ) φ(z) c/σ Econometrics 2 (Summer 2008) 8 / 31
9 Inverse Mills Ratio Limited Dependent Variable Models Preliminaries To determine αψ need to assume distribution for u. Take normal distribution (φ, Φ) with variance σ 2. Then, with straight forward integration E[u y > 0] = E[u u > x β] = = x β σ { φ( u σ ) 1 Φ( x β σ φ(z) }du = σ ) Φ(z) where z = x β/σ Therefore α = σ and ψ(x β) = φ(z)/φ(z) = λ(z) called the inverse Mills ratio sometimes we find in the literature λ(x β) Econometrics 2 (Summer 2008) 9 / 31
10 Truncated Regression Model Truncated Regression Model Recall Conditional distribution / density The conditional density for a random variable u (recall, given x, we need only to consider distribution of error term) with unconditional density f (.) cumulative distribution F (.) for truncation from below at c can be written as f (u u > c) = = = f (u) P(u > c) f (u) 1 P(u c) f (u) 1 F (c). (2) Econometrics 2 (Summer 2008) 10 / 31
11 Truncated Regression Model Truncated Regression Model Recall: this is still a density. It is clearly 0, and integrating over the (limited) range gives c f (u) f (u u > c) u = c 1 F (c) u 1 = 1 F (c) f (u) u = c 1 (1 F (c)) = 1. 1 F (c) Consider a truncated sample where y = y if y > c, but y not observed otherwise. The model to be estimated is y i = x i β + u i. Estimating this model on the truncated sample implies that the distribution of u i is truncated, see eqn (1). Econometrics 2 (Summer 2008) 11 / 31
12 Truncated Regression Model Truncated Regression Model Assume the unconditional distribution of u is normal and homoscedastic: u i N(0, σ 2 ). if y i is truncated from below at 0, then we only observe u i over the limited range for which u i > x i β. Then, conditional density of u i is, compare eqn (2), f (u i u i > c) = 1 σ φ( u i σ ) 1 1 Φ( z i ) = σ φ( u i Φ(z i ) σ ) where z i = (x i β)/σ. To estimate the truncated model use of Maximum Likelihood techniques. Econometrics 2 (Summer 2008) 12 / 31
13 Truncated Regression Model Truncated Regression Model For each observation in the truncated sample, the likelihood contribution is precisely the conditional density L i (β, σ) = The sample likelihood is therefore L(β, σ) = 1 σ φ( y i x i β σ ) Φ(x i β/σ). N L i (β, σ). i=1 Maximization gives ML-parameters β and σ. Econometrics 2 (Summer 2008) 13 / 31
14 Truncated Regression Model Truncated Regression Model The marginal effect of a change in x i on the conditional expectation: is not simply β. E(y i x i, y i > 0) Comparing with eqn (1), it is rather E(y i x i, y i > 0) x i = β + α ψ(z i) x i Nevertheless, also β has a clear interpretation; actually Discuss! E(yi x i) = β x i Need to know ψ (inverse Mills ratio, commonly denoted by λ). Econometrics 2 (Summer 2008) 14 / 31
15 The Tobit Model (Tobit I) The Tobit Models Censored regression model or Tobit model (attributable to Tobin, 1958, gives Tobin-probit). The difference here is that data is available on the entire sample. But the dependent variable is censored at some value (say, C1). As before, the censoring (or truncation) is determined by the equation / model of interest. Consider again the latent relationship with a censored variable y i = y i y i = x i β + u i, u i N(0, σ 2 ) 1I(y i > 0). Will derive MLE making use what we have learned above (inverse Mills ration) Econometrics 2 (Summer 2008) 15 / 31
16 The Tobit I Model Limited Dependent Variable Models The Tobit Models Given normality for u i, the probability of observing a censored observation is where z i = x i β/σ. P(y i = 0 x i ) = P(y i 0 x i ) = P(u i x i β) = Φ( z i ) = 1 Φ(z i ) For uncensored observations, we write the density of y i in the normal way as f (y i ) = 1 σ φ(y i x i β ). σ Then, the sample likelihood function can be compiled as L(β, σ) = 1 Φ( x iβ σ ) 1 σ φ(y i x i β ). σ y i =0 y i >0 Econometrics 2 (Summer 2008) 16 / 31
17 The Tobit Models Interpreting the Tobit I Given a latent relationship of the form with an observability criterion we have that y i = x i β + u i, u i N(0, σ 2 ) y i = y i 1I(y i > 0) P(y i = 0 x i ) = 1 Φ(x i β/σ) P(y i > 0 x i ) = Φ(x i β/σ). The expected value of the observed dependent variable y i (censored at zero) is where λ(z) = φ(z)/φ(z). E(y i x i ) = P(y i > 0 x i ) E(y i x i, y i > 0) = Φ(x i β/σ) [x i β + σ λ(x i β/σ)] Econometrics 2 (Summer 2008) 17 / 31
18 Interpreting the Tobit The Tobit Models Marginal Effects On the latent variable: E(yi ) = β x i On the observed dependent variable, censored at zero, for the whole sample: E(y i x i ) = β Φ(x i β/σ) x i On the non-censored observed dependent variable: E(y i x i, y i > 0) x i = β + σ λ x i Econometrics 2 (Summer 2008) 18 / 31
19 The Tobit Models Limitations of the Tobit There may be problems with the Tobit model: Often, also for censored data we find missing explanatory data, What if the latent model is non-linear? (in literature one speaks of problems due to characterizing censored observations as corner solutions.) Econometrics 2 (Summer 2008) 19 / 31
20 The Tobit Models Testing and Miscellaneous Specification tests can be derived easily due to the use of maximum likelihood methods: think of LM-tests, Likelihood ratio test, etc. Extensions to models and estimators without the normality assumption are available though not trivial. For example, with logit, the inverse Mills ratio looks more complicated. Moreover, semiparametric methods to avoid distribution assumptions can get complicated. Can be constructed e.g. from Methods of Moments, see future chapters. If, as will happen on the next slides, the selection rule is based on a different model, then a two step estimator is thinkable, too. Econometrics 2 (Summer 2008) 20 / 31
21 Tobit II, III, etc Tobit II, III, etc. : Alternative Selection Rules Recall that so far we have assumed that censoring and truncation are driven by the model of interest. What happens if there is a simultaneous or sequential decision process behind? Consider a structural latent relationship: y i = x i β + u i and a latent, possibly censored, observability relationship for y i : I i = z i γ + v i. Disturbance terms u i and v i are (jointly) normal and homoscedastic with covariance cov(u i, v i ) = σ uv. Without loss of generality, for identification set σ v = 1 (discuss) Econometrics 2 (Summer 2008) 21 / 31
22 Tobit II, III, etc. Limited Dependent Variable Models Tobit II, III, etc Then, we get the following observability criteria Truncated Regression Tobit I Model Selectivity / Tobit II Model Selectivity / Tobit III Model Double Hurdle Model y i = yi if yi > 0 (not observed else) y i = yi 1I(yi > 0) y i = y i y i = y i y i = y i 1I(Ii > 0) 1I(Ii > 0) with I i censored 1I(Ii > 0 and yi > 0) (Further assumption: y i 0) Econometrics 2 (Summer 2008) 22 / 31
23 Selectivity Model Selectivity Tobit II Model For the Selectivity tobit II model, Observability criterion: y i = x i β + u i, I i = z i γ + v i. y i = y i 1I(I i > 0). General likelihood for the Selectivity model: L(β, γ, Σ) = y i =0 P(I i = 0 z i ) y i >0 f (y i x i, z i, I i = 1), (3) with P(I i = 0 z i ) = P(Ii < 0 z i ) = P(v i < z i γ) = Φ( z i γ) = 1 Φ(z i γ). Econometrics 2 (Summer 2008) 23 / 31
24 Selectivity Model Limited Dependent Variable Models Selectivity Model For the uncensored observations, need the conditional density f (y i x i, z i, I i = 1). Decompose into a product, f (y i x i, z i, I i = 1) = P(I i > 0 y i ) f (y i ). For bivariate normal disturbances u and v, Note that if σ uv = 0, P(Ii > 0 y i ) = P(v i > z i γ u i ) ( z i γ + σuv σ = Φ u (y i x i β) ). 1 ( σuv σ u ) 2 P(I i > 0 y i ) = Φ(z i γ). Econometrics 2 (Summer 2008) 24 / 31
25 Selectivity Model Two-Step Estimation of the Selectivity Model The Selectivity model may be is rather complex Can get consistent (but not efficient) estimates for Selectivity model using probit and OLS. The two-step estimation procedure derives from the fact that E(y i y i > 0) = x i β + E(u i y i > 0). Note that the conditional expectation for the Selectivity model is E(y i y i > 0) = E(y i I i > 0) = x i β + E(u i I i > 0) = x i β + E(u i v i > z i γ) = x i β + σ uv σ u λ(z i γ) Econometrics 2 (Summer 2008) 25 / 31
26 Selectivity Model Two-Step Estimation of the Selectivity Model Heckman (1979) proposes a two-stage procedure: 1 Use a Probit on the entire sample to estimate ˆγ. 2 Construct λ(z i ˆγ) for the non-censored observations in the sample. 3 Regress y i on x i and λ(z i ˆγ) by OLS for the subsample of non-censored observations. We must correct the standard errors of the second stage OLS for the inclusion of what is termed a generator regressor (ie. λ(z i ˆγ) in the structural regression). Even having done so, the estimates are inefficient (ie. not minimum variance). In the two-step procedure (indeed, for the full-information ML procedure also), one needs to worry about indentification of the hazard rate. Econometrics 2 (Summer 2008) 26 / 31
27 Selectivity Model Selectivity Tobit III Model Imagine the Selectivity model is a Tobit I might be even nested with main equation. For example, we have labor supply measured in hours. this, typically, happens only if also the selectivity model is of interest, else reduction to binary response would be fine. The three-step estimation procedure would work then as follows 1 Estimate the selectivity model via Tobit I on the entire sample to get ˆγ. 2 as above 3 as above for nested models some steps more to do. Econometrics 2 (Summer 2008) 27 / 31
28 The Double-Hurdle Model The Double-Hurdle Model An alternative to the selectivity model, proposed by Cragg (1971). So-called because of two hurdles to be overcome before observing a non-censored observation. Examples: 1 Labour Supply: 1 Do you want to work? 2 Given that you choose to look for work, can you find a job? 2 Credit constraints: 1 Do you want to buy the good? 2 Given that you want to buy the good, can you obtain credit? Econometrics 2 (Summer 2008) 28 / 31
29 The Double-Hurdle Model The Double-Hurdle Model For the Double Hurdle model, where (as before) y i = y i 1I(I i > 0 and y i > 0), y i = x i β + u i, I i = z i γ + v i. The probability of observing a non-censored observation is P(y i > 0 x i, z i ) = P(Ii > 0 and yi > 0) = P(v i > z i γ and u i > x i β) ( = Φ 2 z i γ, x i β ) ; ρ σ u Econometrics 2 (Summer 2008) 29 / 31
30 The Double-Hurdle Model The Double-Hurdle Model Clearly, for censored observations we write ( P(y i = 0 x i, z i ) = 1 Φ 2 z i γ, x i β ) ; ρ. σ u Further, for non-censored observations, the likelihood contribution is identical to the Selectivity Tobit II model. The general likelihood looks like eqn (3): L(β, γ, Σ) = y i =0 P(y i = 0 x i, z i ) y i >0 = P(y i = 0 x i, z i ) y i =0 y i >0 f (y i x i, z i, I i f (y i x i, z i, y i > 0) > 0, y i > 0) = P(y i = 0 x i, z i ) P(y i > 0 x i, z i ) f (y i ) y i =0 y i >0 Econometrics 2 (Summer 2008) 30 / 31
31 The Double-Hurdle Model The Double-Hurdle Model A simplification is assuming no correlation, so that L(β, γ, σ u ) = [1 Φ(z i γ) Φ( x i β )] Φ(z i γ) 1 ( yi x i φ β ) σ y i =0 u σ y i >0 u σ u discuss: scaling and identification problems (e.g. still works with σ v = 1) discuss: heteroscedasticity?! Econometrics 2 (Summer 2008) 31 / 31
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. A General Formulation 3. Truncated Normal Hurdle Model 4. Lognormal
More informationproblem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved
4 Data Issues 4.1 Truncated Regression population model y i = x i β + ε i, ε i N(0, σ 2 ) given a random sample, {y i, x i } N i=1, then OLS is consistent and efficient problem arises when only a non-random
More informationEconometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
More informationANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided
More informationLOGIT AND PROBIT ANALYSIS
LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y
More informationMarkov Chain Monte Carlo Simulation Made Simple
Markov Chain Monte Carlo Simulation Made Simple Alastair Smith Department of Politics New York University April2,2003 1 Markov Chain Monte Carlo (MCMC) simualtion is a powerful technique to perform numerical
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationOn Marginal Effects in Semiparametric Censored Regression Models
On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the
More informationFinancial Vulnerability Index (IMPACT)
Household s Life Insurance Demand - a Multivariate Two Part Model Edward (Jed) W. Frees School of Business, University of Wisconsin-Madison July 30, 1 / 19 Outline 1 2 3 4 2 / 19 Objective To understand
More informationComparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Arthur Lewbel, Yingying Dong, and Thomas Tao Yang Boston College, University of California Irvine, and Boston
More informationFrom the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
More informationMultivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
More informationStatistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
More informationRegression with a Binary Dependent Variable
Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22 Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel,
More informationSolución del Examen Tipo: 1
Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical
More informationReject Inference in Credit Scoring. Jie-Men Mok
Reject Inference in Credit Scoring Jie-Men Mok BMI paper January 2009 ii Preface In the Master programme of Business Mathematics and Informatics (BMI), it is required to perform research on a business
More informationWhat s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
More informationMultinomial and Ordinal Logistic Regression
Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,
More information1. When Can Missing Data be Ignored?
What s ew in Econometrics? BER, Summer 2007 Lecture 12, Wednesday, August 1st, 11-12.30 pm Missing Data These notes discuss various aspects of missing data in both pure cross section and panel data settings.
More informationPoisson Models for Count Data
Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the
More informationThe VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series.
Cointegration The VAR models discussed so fare are appropriate for modeling I(0) data, like asset returns or growth rates of macroeconomic time series. Economic theory, however, often implies equilibrium
More informationChapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
More informationMultiple Choice Models II
Multiple Choice Models II Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Multiple Choice Models II 1 / 28 Categorical data Categorical
More informationStatistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
More informationModels for Longitudinal and Clustered Data
Models for Longitudinal and Clustered Data Germán Rodríguez December 9, 2008, revised December 6, 2012 1 Introduction The most important assumption we have made in this course is that the observations
More informationStandard errors of marginal effects in the heteroskedastic probit model
Standard errors of marginal effects in the heteroskedastic probit model Thomas Cornelißen Discussion Paper No. 320 August 2005 ISSN: 0949 9962 Abstract In non-linear regression models, such as the heteroskedastic
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More informationAuxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
More informationFactor analysis. Angela Montanari
Factor analysis Angela Montanari 1 Introduction Factor analysis is a statistical model that allows to explain the correlations between a large number of observed correlated variables through a small number
More informationEarnings in private jobs after participation to post-doctoral programs : an assessment using a treatment effect model. Isabelle Recotillet
Earnings in private obs after participation to post-doctoral programs : an assessment using a treatment effect model Isabelle Recotillet Institute of Labor Economics and Industrial Sociology, UMR 6123,
More informationOverview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)
Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and
More informationThe frequency of visiting a doctor: is the decision to go independent of the frequency?
Discussion Paper: 2009/04 The frequency of visiting a doctor: is the decision to go independent of the frequency? Hans van Ophem www.feb.uva.nl/ke/uva-econometrics Amsterdam School of Economics Department
More informationFrom the help desk: hurdle models
The Stata Journal (2003) 3, Number 2, pp. 178 184 From the help desk: hurdle models Allen McDowell Stata Corporation Abstract. This article demonstrates that, although there is no command in Stata for
More informationPSTAT 120B Probability and Statistics
- Week University of California, Santa Barbara April 10, 013 Discussion section for 10B Information about TA: Fang-I CHU Office: South Hall 5431 T Office hour: TBA email: chu@pstat.ucsb.edu Slides will
More informationThe Bivariate Normal Distribution
The Bivariate Normal Distribution This is Section 4.7 of the st edition (2002) of the book Introduction to Probability, by D. P. Bertsekas and J. N. Tsitsiklis. The material in this section was not included
More informationINDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
More informationGamma Distribution Fitting
Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics
More information1 Teaching notes on GMM 1.
Bent E. Sørensen January 23, 2007 1 Teaching notes on GMM 1. Generalized Method of Moment (GMM) estimation is one of two developments in econometrics in the 80ies that revolutionized empirical work in
More informationBayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
More informationHandling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
More informationAverage Redistributional Effects. IFAI/IZA Conference on Labor Market Policy Evaluation
Average Redistributional Effects IFAI/IZA Conference on Labor Market Policy Evaluation Geert Ridder, Department of Economics, University of Southern California. October 10, 2006 1 Motivation Most papers
More informationCHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
More informationEconomics Series. A Simple and Successful Shrinkage Method for Weighting Estimators of Treatment Effects
Economics Series Working Paper No. 304 A Simple and Successful Shrinkage Method for Weighting Estimators of Treatment Effects Winfried Pohlmeier 1, Ruben Seiberlich 1 and Selver Derya Uysal 2 1 University
More informationSAMPLE SELECTION BIAS IN CREDIT SCORING MODELS
SAMPLE SELECTION BIAS IN CREDIT SCORING MODELS John Banasik, Jonathan Crook Credit Research Centre, University of Edinburgh Lyn Thomas University of Southampton ssm0 The Problem We wish to estimate an
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES Contents 1. Random variables and measurable functions 2. Cumulative distribution functions 3. Discrete
More informationDiscussion Papers. Anzelika Zaiceva Klaus F. Zimmermann. Children, Kitchen, Church: Does Ethnicity Matter?
Discussion Papers Anzelika Zaiceva Klaus F. Zimmermann Children, Kitchen, Church: Does Ethnicity Matter? Berlin, September 2007 Opinions expressed in this paper are those of the author and do not necessarily
More informationStatistical Methods for research in International Relations and Comparative Politics
James Raymond Vreeland Dept. of Political Science Assistant Professor Yale University E-Mail: james.vreeland@yale.edu Room 300 Tel: 203-432-5252 124 Prospect Avenue Office hours: Wed. 10am to 12pm New
More informationThe Probit Link Function in Generalized Linear Models for Data Mining Applications
Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications
More informationLecture 15. Endogeneity & Instrumental Variable Estimation
Lecture 15. Endogeneity & Instrumental Variable Estimation Saw that measurement error (on right hand side) means that OLS will be biased (biased toward zero) Potential solution to endogeneity instrumental
More informationLecture 8: Signal Detection and Noise Assumption
ECE 83 Fall Statistical Signal Processing instructor: R. Nowak, scribe: Feng Ju Lecture 8: Signal Detection and Noise Assumption Signal Detection : X = W H : X = S + W where W N(, σ I n n and S = [s, s,...,
More informationi=1 In practice, the natural logarithm of the likelihood function, called the log-likelihood function and denoted by
Statistics 580 Maximum Likelihood Estimation Introduction Let y (y 1, y 2,..., y n be a vector of iid, random variables from one of a family of distributions on R n and indexed by a p-dimensional parameter
More informationIAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
More informationMaximum likelihood estimation of mean reverting processes
Maximum likelihood estimation of mean reverting processes José Carlos García Franco Onward, Inc. jcpollo@onwardinc.com Abstract Mean reverting processes are frequently used models in real options. For
More informationFDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina
FDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina Paula Bustos CREI and Universitat Pompeu Fabra September 2007 Abstract In this paper I analyze the financing and
More informationVI. Introduction to Logistic Regression
VI. Introduction to Logistic Regression We turn our attention now to the topic of modeling a categorical outcome as a function of (possibly) several factors. The framework of generalized linear models
More informationA Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
More informationStephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng. LISREL for Windows: PRELIS User s Guide
Stephen du Toit Mathilda du Toit Gerhard Mels Yan Cheng LISREL for Windows: PRELIS User s Guide Table of contents INTRODUCTION... 1 GRAPHICAL USER INTERFACE... 2 The Data menu... 2 The Define Variables
More informationProblem of Missing Data
VASA Mission of VA Statisticians Association (VASA) Promote & disseminate statistical methodological research relevant to VA studies; Facilitate communication & collaboration among VA-affiliated statisticians;
More informationFIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038
More informationOrdinal Regression. Chapter
Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe
More informationLinear Classification. Volker Tresp Summer 2015
Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong
More informationCohort, Age and Time Effects in Alcohol Consumption by Italian Households: a Double-Hurdle Approach
Cohort, Age and Time Effects in Alcohol Consumption by Italian Households: a Double-Hurdle Approach David Aristei *, Federico Perali * and Luca Pieroni ** Abstract: This paper describes the life-cycle
More informationMULTIVARIATE PROBABILITY DISTRIBUTIONS
MULTIVARIATE PROBABILITY DISTRIBUTIONS. PRELIMINARIES.. Example. Consider an experiment that consists of tossing a die and a coin at the same time. We can consider a number of random variables defined
More informationIntroduction to Fixed Effects Methods
Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed
More informationTiers, Preference Similarity, and the Limits on Stable Partners
Tiers, Preference Similarity, and the Limits on Stable Partners KANDORI, Michihiro, KOJIMA, Fuhito, and YASUDA, Yosuke February 7, 2010 Preliminary and incomplete. Do not circulate. Abstract We consider
More informationASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS
DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.
More information1 Another method of estimation: least squares
1 Another method of estimation: least squares erm: -estim.tex, Dec8, 009: 6 p.m. (draft - typos/writos likely exist) Corrections, comments, suggestions welcome. 1.1 Least squares in general Assume Y i
More informationEconometric Models for Multi-Stage Audit Processes: An Application to the IRS National Research Program
Econometric Models for Multi-Stage Audit Processes: An Application to the IRS National Research Program Brian Erard B. Erard & Associates and Jonathan S. Feinstein Yale School of Management Abstract. We
More informationSession 9 Case 3: Utilizing Available Software Statistical Analysis
Session 9 Case 3: Utilizing Available Software Statistical Analysis Michelle Phillips Economist, PURC michelle.phillips@warrington.ufl.edu With material from Ted Kury Session Overview With Data from Cases
More informationECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE
ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE YUAN TIAN This synopsis is designed merely for keep a record of the materials covered in lectures. Please refer to your own lecture notes for all proofs.
More information1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability).
Examples of Questions on Regression Analysis: 1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability). Then,. When
More informationLinda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
More informationRegression III: Advanced Methods
Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models
More informationEconomics of Strategy (ECON 4550) Maymester 2015 Applications of Regression Analysis
Economics of Strategy (ECON 4550) Maymester 015 Applications of Regression Analysis Reading: ACME Clinic (ECON 4550 Coursepak, Page 47) and Big Suzy s Snack Cakes (ECON 4550 Coursepak, Page 51) Definitions
More informationIMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD
REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT
More informationCS229 Lecture notes. Andrew Ng
CS229 Lecture notes Andrew Ng Part X Factor analysis Whenwehavedatax (i) R n thatcomesfromamixtureofseveral Gaussians, the EM algorithm can be applied to fit a mixture model. In this setting, we usually
More informationDepartment of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March 2015. Due:-March 25, 2015.
Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment -3, Probability and Statistics, March 05. Due:-March 5, 05.. Show that the function 0 for x < x+ F (x) = 4 for x < for x
More informationPenalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
More informationManagerial Economics Prof. Trupti Mishra S.J.M. School of Management Indian Institute of Technology, Bombay. Lecture - 13 Consumer Behaviour (Contd )
(Refer Slide Time: 00:28) Managerial Economics Prof. Trupti Mishra S.J.M. School of Management Indian Institute of Technology, Bombay Lecture - 13 Consumer Behaviour (Contd ) We will continue our discussion
More information16 : Demand Forecasting
16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical
More informationFitting Subject-specific Curves to Grouped Longitudinal Data
Fitting Subject-specific Curves to Grouped Longitudinal Data Djeundje, Viani Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh, EH14 4AS, UK E-mail: vad5@hw.ac.uk Currie,
More informationClustering in the Linear Model
Short Guides to Microeconometrics Fall 2014 Kurt Schmidheiny Universität Basel Clustering in the Linear Model 2 1 Introduction Clustering in the Linear Model This handout extends the handout on The Multiple
More informationLOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as
LOGISTIC REGRESSION Nitin R Patel Logistic regression extends the ideas of multiple linear regression to the situation where the dependent variable, y, is binary (for convenience we often code these values
More informationLogit and Probit. Brad Jones 1. April 21, 2009. University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science
Logit and Probit Brad 1 1 Department of Political Science University of California, Davis April 21, 2009 Logit, redux Logit resolves the functional form problem (in terms of the response function in the
More informationDeflator Selection and Generalized Linear Modelling in Market-based Accounting Research
Deflator Selection and Generalized Linear Modelling in Market-based Accounting Research Changbao Wu and Bixia Xu 1 Abstract The scale factor refers to an unknown size variable which affects some or all
More informationReview Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
More informationUnivariate Time Series Analysis; ARIMA Models
Econometrics 2 Spring 25 Univariate Time Series Analysis; ARIMA Models Heino Bohn Nielsen of4 Outline of the Lecture () Introduction to univariate time series analysis. (2) Stationarity. (3) Characterizing
More information( ) is proportional to ( 10 + x)!2. Calculate the
PRACTICE EXAMINATION NUMBER 6. An insurance company eamines its pool of auto insurance customers and gathers the following information: i) All customers insure at least one car. ii) 64 of the customers
More informationLogistic Regression (1/24/13)
STA63/CBB540: Statistical methods in computational biology Logistic Regression (/24/3) Lecturer: Barbara Engelhardt Scribe: Dinesh Manandhar Introduction Logistic regression is model for regression used
More informationA Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
More informationTechnical Efficiency Accounting for Environmental Influence in the Japanese Gas Market
Technical Efficiency Accounting for Environmental Influence in the Japanese Gas Market Sumiko Asai Otsuma Women s University 2-7-1, Karakida, Tama City, Tokyo, 26-854, Japan asai@otsuma.ac.jp Abstract:
More informationEconometric analysis of the Belgian car market
Econometric analysis of the Belgian car market By: Prof. dr. D. Czarnitzki/ Ms. Céline Arts Tim Verheyden Introduction In contrast to typical examples from microeconomics textbooks on homogeneous goods
More informationChildren s Working Hours, School Enrolment and Human Capital Accumulation: Evidence from Pakistan and Nicaragua. F. Rosati M.
Children s Working Hours, School Enrolment and Human Capital Accumulation: Evidence from Pakistan and Nicaragua F. Rosati M. Rossi October, 2001 Children s Working Hours, School Enrolment and Human Capital
More informationQuantile Regression under misspecification, with an application to the U.S. wage structure
Quantile Regression under misspecification, with an application to the U.S. wage structure Angrist, Chernozhukov and Fernandez-Val Reading Group Econometrics November 2, 2010 Intro: initial problem The
More informationThe Graphical Method: An Example
The Graphical Method: An Example Consider the following linear program: Maximize 4x 1 +3x 2 Subject to: 2x 1 +3x 2 6 (1) 3x 1 +2x 2 3 (2) 2x 2 5 (3) 2x 1 +x 2 4 (4) x 1, x 2 0, where, for ease of reference,
More informationCovariance and Correlation
Covariance and Correlation ( c Robert J. Serfling Not for reproduction or distribution) We have seen how to summarize a data-based relative frequency distribution by measures of location and spread, such
More informationEnrollment Projections for Nebraska s Medicaid Insurance for Workers with Disabilities (Medicaid Buy-In Program) Mary G. McGarvey, Ph.D.
Enrollment Projections for Nebraska s Medicaid Insurance for Workers with Disabilities (Medicaid Buy-In Program) Mary G. McGarvey, Ph.D. March, 2003 1. Introduction Many persons with significant disabilities
More informationSYSTEMS OF REGRESSION EQUATIONS
SYSTEMS OF REGRESSION EQUATIONS 1. MULTIPLE EQUATIONS y nt = x nt n + u nt, n = 1,...,N, t = 1,...,T, x nt is 1 k, and n is k 1. This is a version of the standard regression model where the observations
More information