Overview and Introduction. Contact
|
|
- Jack Casey
- 7 years ago
- Views:
Transcription
1 Overview and Introduction Contact Lecturer: S. Sperlich (MZG 8.128) Lecture: Tue at 8-10am, MZG Contact: (Office hours: by appointment) Assistent: M. Dickel (MZG 8.135) Tutorial: Thu at 8-10am, MZG or WiSoRZ Start: Thu April 16 at 8am s.t. WiSoRZ Contact: (Office hours: by appointment) Econometrics 2 (Summer 2009) 1 / 35
2 Overview and Introduction Contents Binary Choice Models Multiple Diskrete Variables (Polychotome) Models Tobit 1, Tobit 2, Tobit 3 Hypothesis Tests (with recapitulation) GMM (including recapitulation of Endogeneity and IV-methods) Duration and Survival Models Paneldata Analysis (basics) Econometrics 2 (Summer 2009) 2 / 35
3 Overview and Introduction Literature Arellano, M. (2003) Panel Data Econometrics, Oxford University Press. Baltagi, B. (2001) Econometric Analysis of Panel Data, Wiley College Textbooks. Berndt, E. R. (1996) The practice of econometrics: classic and contemporary, Addison Wesley. Greene, W. (2003) Econometric Analysis, Prentice Hall. Hsiao, C. (2003) Analysis of Panel Data, Cambridge University Press. Judge, G., Hill, R., Griffiths, W., Lütkepohl, H. (1988) Introduction to the Theory and Practice of Econometrics, New York: Wiley. Maddala, G. S. (1986) Limited dependent and Qualitative Variables in Econometrics. Wooldridge, J. (2002) Econometric Analysis of Cross Section and Panel Data, MIT Press. Econometrics 2 (Summer 2009) 3 / 35
4 Overview and Introduction Organization Language (english - german) Tutorials (IT-room and/or seminarroom) Timetable for tutorials Exams Requirements: maths, statistics, profound knowledge of multivariate regression, Desired: Introduction of Econometrics or Econometrics 1 Econometrics 2 (Summer 2009) 4 / 35
5 Chapter 1 Binary and Multiple Choice Models Examples? Previous knowledge linear, logit, probit Differences between normal linear regression Econometrics 2 (Summer 2009) 5 / 35
6 Discrete Choice Models Classes of Discrete Variables binary multinomial Further classification of multinomial discrete variables: categorical y = 1, if income < 3000 e. y = 2, if income between 3000 and 5000 e. y = 3, if income > 5000 e. non categorical y = number of cars in the household. Further classification of categorical variables depends if they have a natural order or sequence. Econometrics 2 (Summer 2009) 6 / 35
7 Discrete Choice Models Classes of Discrete Variables nominal/ unordered categorical y = 1, if the mode of transport is by car. y = 2, if the mode of transport is by bus. y = 3, if the mode of transport is by train. ordered and/ or sequential y = 1, if an individual chooses not to work. y = 2, if an individual wants to work, but can t get a job. y = 3, if the individual works. The characteristics of any discrete variable dictate the methods available for model solution. Econometrics 2 (Summer 2009) 7 / 35
8 Binary Choice Models Theoretical Framework Consider a binary dependent variable y, which has only two possible outcomes (0 and 1), and a vector of explanatory variables x thought to influence the realization of y. The unconditional expectation of the binary variable y is by definition a probability: E(y) = P(y = 1). Further, let the set of explanatory variables x influence the outcome of y. Then, the conditional expectation of y given x is: E(y x) = P(y = 1 x). Econometrics 2 (Summer 2009) 8 / 35
9 Binary Choice Models Theoretical Framework Relate this term to the standard regression analysis: for the conditional expectation y = F (x, β) + u, E(y x) = E(F (X, β) + u x) = F (x, β) + E(u x) = F (x, β). Hence, the standard regression functional F (x, β) is a representation of the conditional expectation of y given x. If the dependent variable in a regression relationship is binary, then the regression functional F (x, β) equates directly to the conditional probability of observing y = 1. Thus, the characteristics of binary choice models crucially depend on the way we specify the regression functional F (x, β). Econometrics 2 (Summer 2009) 9 / 35
10 The Linear Probability Model The Linear Probability Model Consider a binary dependent variable y and a (k dim.) vector of explanatory variables x. We may specify the conditional probability directly as: P(y = 1 x) = F (x, β) = x β. Introducing random disturbances, we have y = x β + u, where u represents the stochastic disturbance term in the relationship, f (u) represents its density and E(u x) = 0 by definition. For a sample of n observations {y i, x i } drawn at random from a population, y i = x i β + u i. OLS-estimation procedures may be applied. Known as the Linear Probability Model (LPM) Econometrics 2 (Summer 2009) 10 / 35
11 The Linear Probability Model Econometrics 2 (Summer 2009) 11 / 35
12 The Linear Probability Model Problems with the LPM disturbance terms are non-normal u i = 1 x i β with probability f (u i) = x i β u i = x i β with probability f (u i) = 1 x i β disturbance terms are heteroskedastic Var(u i ) = E(u 2 i ) = ( x i β) 2 (1 x i β) + (1 x i β) 2 (x i β) = (x i β) (1 x i β) = P(y i = 1 x i ) P(y i = 0 x i ). the conditional expectation is not bounded between zero and one E(y i x i ) = P(y i = 1 x i ) = x i β, which is defined over the entire real line. Econometrics 2 (Summer 2009) 12 / 35
13 Possible Solutions The Linear Probability Model Weighted Least Squares to account for the heteroskedasticity, with weights w i = (x i ˆβ) (1 x i ˆβ), calculated from a first-stage estimation. The adjusted model then becomes y i = x i u i β +. w i w i w i This still does not return probabilities within the range [0, 1]. A better solution is to re-specify or to transform the regression model itself to constrain the probability outcome. Econometrics 2 (Summer 2009) 13 / 35
14 The Linear Probability Model Econometrics 2 (Summer 2009) 14 / 35
15 Probit and Logit Models Probit and Logit Models In general, For the Linear Probability Model, E(y i x i ) = P(y i = 1 x i ) = F (x i, β). F (x i, β) = x i β To solve the probability problem, constrain the outcome F (x i, β) to the interval [0, 1]. Which alternatives do we know? Econometrics 2 (Summer 2009) 15 / 35
16 Probit and Logit Models The Transformation Approach For the Probit F (x i, β) = Φ(x i β), where Φ represents the cumulative distribution of the standard normal density. For the Logit where Λ(z) = represents the Logistic function. F (x i, β) = Λ(x i β), exp(z) 1 + exp(z) = exp( z) Which effect has the choice of the linkfunction? First, some characteristics: Econometrics 2 (Summer 2009) 16 / 35
17 Probit and Logit Models The Transformation Approach Notice that the functions Φ(z) and Λ(z) are both monotone increasing functions of z. Moreover, in both cases, F (x i, β) 0 falls x i β, F (x i, β) 1 falls x i β +. So, Probit and Logit models both return well-defined probabilities. However, because the transformed regression function is non-linear in β, we can no longer use OLS and must move to ML techniques. The LPM might therefore be considered a first-order approximation to the arbitrary non-linear probability function F ( ). That is F (x, β) F (x 0, β) + (x x 0 ) F (x 0, β) β = x β 0. using a first-order Taylor series expansion around x = x 0. Econometrics 2 (Summer 2009) 17 / 35
18 Probit and Logit Models Latent Variable Assume that there is some underlying (and unobserved) latent propensity variable y where y (, ). Whilst we do not observe y directly, we do observe a binary outcome y such that y = 1I{y > 0}. where 1I is termed the indicator function, taking the value 1 if the condition within parentheses is satisfied, and 0 otherwise. Define the latent equation in linear form: y = x β + u, where u is random with symmetric density f and corresponding cumulative density function F. Econometrics 2 (Summer 2009) 18 / 35
19 Probit and Logit Models Latent Variable We now have that E(y x) = P(y = 1 x) = P(y > 0 x) = P(x β + u > 0) = P(u > x β) = 1 F ( x β) = F (x β). By specifying an appropriate distribution function for u, we can derive the Probit and Logit models. When u is assumed normally distributed, parameters must be scaled to force the variance of u to σ 2 = Var(u) = 1. Why? P(y = 1 x) = P(u > x β) = P(u/σ > x (β/σ)) = P(z > x (β/σ)) = Φ(x (β/σ)). Econometrics 2 (Summer 2009) 19 / 35
20 Probit and Logit Models Theoretical Foundations Suppose, y = 1 represents a person that works, and y = 0 one that doesn t. Consider state-specific utilities U y : U y=1 = x β 1 + u 1, U y=0 = x β 0 + u 0. Participation in the work force requires that Uy=1 > U y=0, such that y = 1I{U y=1 > U y=0} = 1I{x β 1 + u 1 > x β 0 + u 0 } = 1I{u 1 u 0 > x (β 1 β 0 )}. Identify the difference β 1 β 0. Hence, where y = 1I{y > 0}, y = x (β 1 β 0 ) + (u 1 u 0 ) = x β + u. Econometrics 2 (Summer 2009) 20 / 35
21 ML Estimation ML Estimation Consider a sample of n observations {y i, x i }, where y i is binary. Assume y i = 1I{y i > 0} for yi = x i β + u i. For any vector β, the probability of observing y i conditional on x i is: Taking logs, L(β x i ) = = n P(y i x i, β) i=1 n P(y i = 0 x i, β) 1 y i P(y i = 1 x i, β) y i. i=1 ln L(β x i ) = n {(1 y i ) ln P(y i = 0 x i, β) + y i ln P(y i = 1 x i, β)}. i=1 Econometrics 2 (Summer 2009) 21 / 35
22 ML Estimation For the Probit model, P(y i = 1 x i, β) = Φ(x i β), P(y i = 0 x i, β) = 1 Φ(x i β) giving a log-likelihood of the form n ln L(β x i ) = {(1 y i ) ln(1 Φ(x i β)) + y i ln Φ(x i β)}. i=1 For the Logit model, to give ln L(β x i ) = P(y i = 1 x i, β) = Λ(x i β) = exp(x i β) 1 + exp(x i β), P(y i = 0 x i, β) = 1 Λ(x i 1 β) = 1 + exp(x i β) n {(1 y i ) ln(1 Λ(x i β)) + y i ln Λ(x i β)}. i=1 Econometrics 2 (Summer 2009) 22 / 35
23 First Order Conditions ML Estimation Parameters which maximize the general log likelihood require that For the Probit, S(β) = ln L(β x i) β = 0. For the Logit, S(β) = n i=1 S(β) = y i Φ(x i β) Φ(x i β) (1 Φ(x i β)) φ(x i β) x i. n i=1 [ y i exp(x i β) ] 1 + exp(x i β) x i. Solution to ML is obtained by finding parameters for which S(β) = 0. Econometrics 2 (Summer 2009) 23 / 35
24 Interpretation Binary Choice Models Let s keep concentrating on the following Binary Choice models: LPM P(y i = 1 x i, β) = x i β Probit P(y i = 1 x i, β) = Φ(x i β) Logit P(y i = 1 x i, β) = Λ(x i β). If β j is positive (negative), then P(y i = 1 x i, β) = F (x i β) will increase (decrease) with an increase in x j. Econometrics 2 (Summer 2009) 24 / 35
25 Marginal Effects Interpretation LPM Probit Logit P(y i = 1 x i, β) x ij P(y i = 1 x i, β) x ij = β j P(y i = 1 x i, β) x ij = = φ(x i β) β j exp(x i β) (1 + exp(x i β))2 β j Implications Slope estimates are not directly comparable. E. g. variance of disturbances in Logit model and the Probit model are different. Hence the parameters are also scaled differently. Econometrics 2 (Summer 2009) 25 / 35
26 Interpretation Econometrics 2 (Summer 2009) 26 / 35
27 Marginal Effects Interpretation Notice also: The marginal effects in the LPM are constant (i. e. independent of the data). The marginal effects in the Probit and Logit models depend on x i. A popular transformation: β LPM 0.25 β L for the slopes, and β LPM 0.25 β L for the intercept. β P β L. Econometrics 2 (Summer 2009) 27 / 35
28 Interpretation An Empirical Example: childcare take-up estimates Parameter Estimates Variable LPM Probit Logit single woman other children aged woman works left school at attended college/uni youngest child aged youngest child aged receives maintenance constant Datasource: 1991/92 General Household Survey, from which a random sample of n = 1288 women was taken, which are responsible for at least one child in pre-school-age. Econometrics 2 (Summer 2009) 28 / 35
29 Interpretation An Empirical Example: childcare take-up estimates The dependent variable is 1, if the woman pays for childcare, else 0. The reference in all cases is a married woman who doesn t work, has left school at 16, has one child aged less than two and who receives no maintenance. For the reference household, all explanatory variables take a value of 0, which leads to probability estimates in each model of: LPM P(y i = 1 x i ) = x i β = 0.153, Probit P(y i = 1 x i ) = Φ(x i β) = Φ( 0.995) = 0.161, Logit P(y i = 1 x i ) = Λ(x i β) = exp( 1.645) 1 + exp( 1.645) = Econometrics 2 (Summer 2009) 29 / 35
30 Interpretation An Empirical Example: childcare take-up estimates How, for example, does the probability change for women who attend university? LPM P(y i = 1 x i ) = x i β = = 0.313, Probit P(y i = 1 x i ) = Φ(x i β) = Φ( ) = Φ( 0.537) = 0.296, Logit P(y i = 1 x i ) = Λ(x i β) = exp( ) 1 + exp( ) = Econometrics 2 (Summer 2009) 30 / 35
31 Statistical Inference Binary Choice Models: statistical inference For the LPM, estimated standard errors are easily derived and evaluated. But remember, LPM is heteroskedastic. For the Probit and Logit models, where I is the Fisher-Information. n ( β β ) asym. N(0, I ( β) 1 ), Computer software for ML estimation evaluates the variance-covariance matrix V ( β) directly. Hence, statistical inference and hypothesis testing can be carried out using standard inferential techniques. Econometrics 2 (Summer 2009) 31 / 35
32 Goodness of fit Statistical Inference Let L UR represent likelihoods for the full model. Let L R represent likelihoods for a restricted model estimated on an intercept alone. Then the formulation for two proposed measures are as follows: Cragg Uhler pseudo R 2 = L2/n UR L2/n R 1 L 2/n R McFadden pseudo R 2 = 1 ln L UR ln L R. Remember: the classical and adjusted R 2 can t be used. (Why?), Econometrics 2 (Summer 2009) 32 / 35
33 Goodness of fit Statistical Inference An alternative outcome-based measure: the proportion of correct predictions. For P i = P(y i = 1 x i ), eg. Φ(x i β) (Probit), let ỹ i = 1I{ P i > 0.5} Define the proportion of correct predictions as P = 1 n 1I{y i = ỹ i }. n i=1 In many statistic computer programs, you can see tables of predicted and observes binary values: predicted observed n 00 n 01 1 n 10 n 11 This measure should be avoided. It doesn t make sense if one of the two conditions is hardly represented in the random sample. Econometrics 2 (Summer 2009) 33 / 35
34 Statistical Inference Testing the Overall Significance of the Regression Let L UR represent likelihoods for the full model. Let L R represent likelihoods for a restricted model. r represents the number of restrictions imposed. Then: 2 ln(l R /L UR ) = 2(ln L UR ln L R ) χ 2 r. For example: H 0 : β 2 = β 3 = = β k = 0 H A : at least one β j 0, j = 2,..., k. Econometrics 2 (Summer 2009) 34 / 35
35 Transitions in Binary Choice models Transitions in Binary Choice models Let y i = x i β + u i. Imagine a change from x i to x R i x i. This clearly alters the latent variable from y i to Measure used often: Odds Ratio y R i = (x R i ) β + u i. R(x i ) = P(y i = 1 x i )/P(y i = 0 x i ) But how does this exogenous shock impact on the probability P i(j k) of transition from any state j to k, (j, k = 0, 1)? Hence, we need probabilities P i(0 1) = P(y R i > 0 y i < 0). Econometrics 2 (Summer 2009) 35 / 35
LOGIT AND PROBIT ANALYSIS
LOGIT AND PROBIT ANALYSIS A.K. Vasisht I.A.S.R.I., Library Avenue, New Delhi 110 012 amitvasisht@iasri.res.in In dummy regression variable models, it is assumed implicitly that the dependent variable Y
More informationMultiple Choice Models II
Multiple Choice Models II Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Multiple Choice Models II 1 / 28 Categorical data Categorical
More informationThe Probit Link Function in Generalized Linear Models for Data Mining Applications
Journal of Modern Applied Statistical Methods Copyright 2013 JMASM, Inc. May 2013, Vol. 12, No. 1, 164-169 1538 9472/13/$95.00 The Probit Link Function in Generalized Linear Models for Data Mining Applications
More informationStandard errors of marginal effects in the heteroskedastic probit model
Standard errors of marginal effects in the heteroskedastic probit model Thomas Cornelißen Discussion Paper No. 320 August 2005 ISSN: 0949 9962 Abstract In non-linear regression models, such as the heteroskedastic
More informationSAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
More informationOrdinal Regression. Chapter
Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationWhat s New in Econometrics? Lecture 8 Cluster and Stratified Sampling
What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling Jeff Wooldridge NBER Summer Institute, 2007 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of Groups and
More informationRegression with a Binary Dependent Variable
Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22 Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel,
More informationMultinomial and Ordinal Logistic Regression
Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,
More informationLinear Threshold Units
Linear Threshold Units w x hx (... w n x n w We assume that each feature x j and each weight w j is a real number (we will relax this later) We will study three different algorithms for learning linear
More informationBayesX - Software for Bayesian Inference in Structured Additive Regression
BayesX - Software for Bayesian Inference in Structured Additive Regression Thomas Kneib Faculty of Mathematics and Economics, University of Ulm Department of Statistics, Ludwig-Maximilians-University Munich
More informationTeaching model: C1 a. General background: 50% b. Theory-into-practice/developmental 50% knowledge-building: c. Guided academic activities:
1. COURSE DESCRIPTION Degree: Double Degree: Derecho y Finanzas y Contabilidad (English teaching) Course: STATISTICAL AND ECONOMETRIC METHODS FOR FINANCE (Métodos Estadísticos y Econométricos en Finanzas
More informationUsing the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes JunXuJ.ScottLong Indiana University August 22, 2005 The paper provides technical details on
More informationStatistical Machine Learning
Statistical Machine Learning UoC Stats 37700, Winter quarter Lecture 4: classical linear and quadratic discriminants. 1 / 25 Linear separation For two classes in R d : simple idea: separate the classes
More informationHURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. A General Formulation 3. Truncated Normal Hurdle Model 4. Lognormal
More informationECON 523 Applied Econometrics I /Masters Level American University, Spring 2008. Description of the course
ECON 523 Applied Econometrics I /Masters Level American University, Spring 2008 Instructor: Maria Heracleous Lectures: M 8:10-10:40 p.m. WARD 202 Office: 221 Roper Phone: 202-885-3758 Office Hours: M W
More informationChapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
More informationLinear Classification. Volker Tresp Summer 2015
Linear Classification Volker Tresp Summer 2015 1 Classification Classification is the central task of pattern recognition Sensors supply information about an object: to which class do the object belong
More informationAuxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus
Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives
More informationLogit Models for Binary Data
Chapter 3 Logit Models for Binary Data We now turn our attention to regression models for dichotomous data, including logistic regression and probit analysis. These models are appropriate when the response
More informationSolución del Examen Tipo: 1
Solución del Examen Tipo: 1 Universidad Carlos III de Madrid ECONOMETRICS Academic year 2009/10 FINAL EXAM May 17, 2010 DURATION: 2 HOURS 1. Assume that model (III) verifies the assumptions of the classical
More informationFrom the help desk: Bootstrapped standard errors
The Stata Journal (2003) 3, Number 1, pp. 71 80 From the help desk: Bootstrapped standard errors Weihua Guan Stata Corporation Abstract. Bootstrapping is a nonparametric approach for evaluating the distribution
More informationLOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as
LOGISTIC REGRESSION Nitin R Patel Logistic regression extends the ideas of multiple linear regression to the situation where the dependent variable, y, is binary (for convenience we often code these values
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 6 Three Approaches to Classification Construct
More informationCHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES
Examples: Monte Carlo Simulation Studies CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES Monte Carlo simulation studies are often used for methodological investigations of the performance of statistical
More informationPoisson Models for Count Data
Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the
More informationLogit and Probit. Brad Jones 1. April 21, 2009. University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science
Logit and Probit Brad 1 1 Department of Political Science University of California, Davis April 21, 2009 Logit, redux Logit resolves the functional form problem (in terms of the response function in the
More informationfor an appointment, e-mail j.adda@ucl.ac.uk
M.Sc. in Economics Department of Economics, University College London Econometric Theory and Methods (G023) 1 Autumn term 2007/2008: weeks 2-8 Jérôme Adda for an appointment, e-mail j.adda@ucl.ac.uk Introduction
More informationCHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS
Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships
More informationESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Quantile Treatment Effects 2. Control Functions
More informationEconometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
More informationStatistical Methods for research in International Relations and Comparative Politics
James Raymond Vreeland Dept. of Political Science Assistant Professor Yale University E-Mail: james.vreeland@yale.edu Room 300 Tel: 203-432-5252 124 Prospect Avenue Office hours: Wed. 10am to 12pm New
More informationOn Marginal Effects in Semiparametric Censored Regression Models
On Marginal Effects in Semiparametric Censored Regression Models Bo E. Honoré September 3, 2008 Introduction It is often argued that estimation of semiparametric censored regression models such as the
More informationTechniques of Statistical Analysis II second group
Techniques of Statistical Analysis II second group Bruno Arpino Office: 20.182; Building: Jaume I bruno.arpino@upf.edu 1. Overview The course is aimed to provide advanced statistical knowledge for the
More informationFinancial Vulnerability Index (IMPACT)
Household s Life Insurance Demand - a Multivariate Two Part Model Edward (Jed) W. Frees School of Business, University of Wisconsin-Madison July 30, 1 / 19 Outline 1 2 3 4 2 / 19 Objective To understand
More informationA Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit non-response. In a survey, certain respondents may be unreachable or may refuse to participate. Item
More informationFIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS
FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038
More informationLogistic regression modeling the probability of success
Logistic regression modeling the probability of success Regression models are usually thought of as only being appropriate for target variables that are continuous Is there any situation where we might
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
More informationComparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors
Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors Arthur Lewbel, Yingying Dong, and Thomas Tao Yang Boston College, University of California Irvine, and Boston
More informationLAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE
LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-
More information1 Teaching notes on GMM 1.
Bent E. Sørensen January 23, 2007 1 Teaching notes on GMM 1. Generalized Method of Moment (GMM) estimation is one of two developments in econometrics in the 80ies that revolutionized empirical work in
More informationCIDE XXIV Corso Residenziale di Econometria
CIDE XXIV Corso Residenziale di Econometria 8-13 Settembre 2014 - Palermo Topics in Microeconometrics with Applications to Energy & Environmental Economics General Information Students requiring accommodation
More informationSAMPLE SELECTION BIAS IN CREDIT SCORING MODELS
SAMPLE SELECTION BIAS IN CREDIT SCORING MODELS John Banasik, Jonathan Crook Credit Research Centre, University of Edinburgh Lyn Thomas University of Southampton ssm0 The Problem We wish to estimate an
More informationIt is important to bear in mind that one of the first three subscripts is redundant since k = i -j +3.
IDENTIFICATION AND ESTIMATION OF AGE, PERIOD AND COHORT EFFECTS IN THE ANALYSIS OF DISCRETE ARCHIVAL DATA Stephen E. Fienberg, University of Minnesota William M. Mason, University of Michigan 1. INTRODUCTION
More informationANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION
ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? SAMUEL H. COX AND YIJIA LIN ABSTRACT. We devise an approach, using tobit models for modeling annuity lapse rates. The approach is based on data provided
More informationPanel Data Econometrics
Panel Data Econometrics Master of Science in Economics - University of Geneva Christophe Hurlin, Université d Orléans University of Orléans January 2010 De nition A longitudinal, or panel, data set is
More informationCREDIT SCORING MODEL APPLICATIONS:
Örebro University Örebro University School of Business Master in Applied Statistics Thomas Laitila Sune Karlsson May, 2014 CREDIT SCORING MODEL APPLICATIONS: TESTING MULTINOMIAL TARGETS Gabriela De Rossi
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationVI. Introduction to Logistic Regression
VI. Introduction to Logistic Regression We turn our attention now to the topic of modeling a categorical outcome as a function of (possibly) several factors. The framework of generalized linear models
More informationLinda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
More informationStatistics in Retail Finance. Chapter 6: Behavioural models
Statistics in Retail Finance 1 Overview > So far we have focussed mainly on application scorecards. In this chapter we shall look at behavioural models. We shall cover the following topics:- Behavioural
More informationLogistic Regression (1/24/13)
STA63/CBB540: Statistical methods in computational biology Logistic Regression (/24/3) Lecturer: Barbara Engelhardt Scribe: Dinesh Manandhar Introduction Logistic regression is model for regression used
More informationFrom the help desk: hurdle models
The Stata Journal (2003) 3, Number 2, pp. 178 184 From the help desk: hurdle models Allen McDowell Stata Corporation Abstract. This article demonstrates that, although there is no command in Stata for
More informationMULTIPLE REGRESSION WITH CATEGORICAL DATA
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting
More informationINDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)
INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition) Abstract Indirect inference is a simulation-based method for estimating the parameters of economic models. Its
More informationMultinomial Logistic Regression
Multinomial Logistic Regression Dr. Jon Starkweather and Dr. Amanda Kay Moske Multinomial logistic regression is used to predict categorical placement in or the probability of category membership on a
More informationON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS
ON THE ROBUSTNESS OF FIXED EFFECTS AND RELATED ESTIMATORS IN CORRELATED RANDOM COEFFICIENT PANEL DATA MODELS Jeffrey M. Wooldridge THE INSTITUTE FOR FISCAL STUDIES DEPARTMENT OF ECONOMICS, UCL cemmap working
More informationRegression III: Advanced Methods
Lecture 4: Transformations Regression III: Advanced Methods William G. Jacoby Michigan State University Goals of the lecture The Ladder of Roots and Powers Changing the shape of distributions Transforming
More informationCalculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: vasilev@ue-varna.bg) Varna University of Economics, Bulgaria ABSTRACT The
More informationRegression for nonnegative skewed dependent variables
Regression for nonnegative skewed dependent variables July 15, 2010 Problem Solutions Link to OLS Alternatives Introduction Nonnegative skewed outcomes y, e.g. labor earnings medical expenditures trade
More informationDealing with Missing Data
Dealing with Missing Data Roch Giorgi email: roch.giorgi@univ-amu.fr UMR 912 SESSTIM, Aix Marseille Université / INSERM / IRD, Marseille, France BioSTIC, APHM, Hôpital Timone, Marseille, France January
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models General Linear Models - part I Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs. Lyngby
More informationElements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
More informationLecture 3: Linear methods for classification
Lecture 3: Linear methods for classification Rafael A. Irizarry and Hector Corrada Bravo February, 2010 Today we describe four specific algorithms useful for classification problems: linear regression,
More informationMarkov Chain Monte Carlo Simulation Made Simple
Markov Chain Monte Carlo Simulation Made Simple Alastair Smith Department of Politics New York University April2,2003 1 Markov Chain Monte Carlo (MCMC) simualtion is a powerful technique to perform numerical
More informationLeast Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
More informationASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS
DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.
More informationWhy Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
More informationNominal and ordinal logistic regression
Nominal and ordinal logistic regression April 26 Nominal and ordinal logistic regression Our goal for today is to briefly go over ways to extend the logistic regression model to the case where the outcome
More informationCAPM, Arbitrage, and Linear Factor Models
CAPM, Arbitrage, and Linear Factor Models CAPM, Arbitrage, Linear Factor Models 1/ 41 Introduction We now assume all investors actually choose mean-variance e cient portfolios. By equating these investors
More informationMortgage Loan Approvals and Government Intervention Policy
Mortgage Loan Approvals and Government Intervention Policy Dr. William Chow 18 March, 214 Executive Summary This paper introduces an empirical framework to explore the impact of the government s various
More informationHandling attrition and non-response in longitudinal data
Longitudinal and Life Course Studies 2009 Volume 1 Issue 1 Pp 63-72 Handling attrition and non-response in longitudinal data Harvey Goldstein University of Bristol Correspondence. Professor H. Goldstein
More informationIntroduction to Fixed Effects Methods
Introduction to Fixed Effects Methods 1 1.1 The Promise of Fixed Effects for Nonexperimental Research... 1 1.2 The Paired-Comparisons t-test as a Fixed Effects Method... 2 1.3 Costs and Benefits of Fixed
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationGeneralized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
More informationLecture 6: Poisson regression
Lecture 6: Poisson regression Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction EDA for Poisson regression Estimation and testing in Poisson regression
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationModels for Longitudinal and Clustered Data
Models for Longitudinal and Clustered Data Germán Rodríguez December 9, 2008, revised December 6, 2012 1 Introduction The most important assumption we have made in this course is that the observations
More informationChapter 10: Basic Linear Unobserved Effects Panel Data. Models:
Chapter 10: Basic Linear Unobserved Effects Panel Data Models: Microeconomic Econometrics I Spring 2010 10.1 Motivation: The Omitted Variables Problem We are interested in the partial effects of the observable
More informationThe Cobb-Douglas Production Function
171 10 The Cobb-Douglas Production Function This chapter describes in detail the most famous of all production functions used to represent production processes both in and out of agriculture. First used
More informationMultivariate Logistic Regression
1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation
More informationInstitute of Actuaries of India Subject CT3 Probability and Mathematical Statistics
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in
More informationPS 271B: Quantitative Methods II. Lecture Notes
PS 271B: Quantitative Methods II Lecture Notes Langche Zeng zeng@ucsd.edu The Empirical Research Process; Fundamental Methodological Issues 2 Theory; Data; Models/model selection; Estimation; Inference.
More informationEfficient and Practical Econometric Methods for the SLID, NLSCY, NPHS
Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS Philip Merrigan ESG-UQAM, CIRPÉE Using Big Data to Study Development and Social Change, Concordia University, November 2103 Intro Longitudinal
More informationCS229 Lecture notes. Andrew Ng
CS229 Lecture notes Andrew Ng Part X Factor analysis Whenwehavedatax (i) R n thatcomesfromamixtureofseveral Gaussians, the EM algorithm can be applied to fit a mixture model. In this setting, we usually
More informationSection Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
More informationMULTIVARIATE PROBABILITY DISTRIBUTIONS
MULTIVARIATE PROBABILITY DISTRIBUTIONS. PRELIMINARIES.. Example. Consider an experiment that consists of tossing a die and a coin at the same time. We can consider a number of random variables defined
More informationFDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina
FDI as a source of finance in imperfect capital markets Firm-Level Evidence from Argentina Paula Bustos CREI and Universitat Pompeu Fabra September 2007 Abstract In this paper I analyze the financing and
More informationThe Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.
The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables Kathleen M. Lang* Boston College and Peter Gottschalk Boston College Abstract We derive the efficiency loss
More informationThe Assessment of Fit in the Class of Logistic Regression Models: A Pathway out of the Jungle of Pseudo-R²s
The Assessment of Fit in the Class of Logistic Regression Models: A Pathway out of the Jungle of Pseudo-R²s Dr. Wolfgang Langer Martin-Luther-University of Halle-Wittenberg Institute of Sociology 2000
More informationChapter 4: Vector Autoregressive Models
Chapter 4: Vector Autoregressive Models 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie IV.1 Vector Autoregressive Models (VAR)...
More informationIAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
More informationPractical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University
Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction
More informationShort title: Measurement error in binary regression. T. Fearn 1, D.C. Hill 2 and S.C. Darby 2. of Oxford, Oxford, U.K.
Measurement error in the explanatory variable of a binary regression: regression calibration and integrated conditional likelihood in studies of residential radon and lung cancer Short title: Measurement
More informationAileen Murphy, Department of Economics, UCC, Ireland. WORKING PAPER SERIES 07-10
AN ECONOMETRIC ANALYSIS OF SMOKING BEHAVIOUR IN IRELAND Aileen Murphy, Department of Economics, UCC, Ireland. DEPARTMENT OF ECONOMICS WORKING PAPER SERIES 07-10 1 AN ECONOMETRIC ANALYSIS OF SMOKING BEHAVIOUR
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationLOGISTIC REGRESSION ANALYSIS
LOGISTIC REGRESSION ANALYSIS C. Mitchell Dayton Department of Measurement, Statistics & Evaluation Room 1230D Benjamin Building University of Maryland September 1992 1. Introduction and Model Logistic
More information