Regression with a Binary Dependent Variable



Similar documents
Logit and Probit. Brad Jones 1. April 21, University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science

LOGIT AND PROBIT ANALYSIS

Multinomial and Ordinal Logistic Regression

2. Linear regression with multiple regressors

Standard errors of marginal effects in the heteroskedastic probit model

Classification Problems

Logit Models for Binary Data

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes

ECON 523 Applied Econometrics I /Masters Level American University, Spring Description of the course

VI. Introduction to Logistic Regression

SAS Software to Fit the Generalized Linear Model

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Poisson Models for Count Data

From the help desk: Bootstrapped standard errors

Generalized Linear Models

Nominal and ordinal logistic regression

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Handling missing data in Stata a whirlwind tour

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University

Discussion Section 4 ECON 139/ Summer Term II

Introduction to Quantitative Methods

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009

Nonlinear Regression Functions. SW Ch 8 1/54/

CREDIT SCORING MODEL APPLICATIONS:

Ordinal Regression. Chapter

Comparing Features of Convenient Estimators for Binary Choice Models With Endogenous Regressors

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Efficient and Practical Econometric Methods for the SLID, NLSCY, NPHS

ANNUITY LAPSE RATE MODELING: TOBIT OR NOT TOBIT? 1. INTRODUCTION

LOGISTIC REGRESSION. Nitin R Patel. where the dependent variable, y, is binary (for convenience we often code these values as

Unit 12 Logistic Regression Supplementary Chapter 14 in IPS On CD (Chap 16, 5th ed.)

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

Applying Survival Analysis Techniques to Loan Terminations for HUD s Reverse Mortgage Insurance Program - HECM

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

Regression for nonnegative skewed dependent variables

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Parametric Survival Models

Regression III: Advanced Methods

CSI 333 Lecture 1 Number Systems

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

11. Analysis of Case-control Studies Logistic Regression

Statistics in Retail Finance. Chapter 2: Statistical models of default

SUMAN DUVVURU STAT 567 PROJECT REPORT

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

XPost: Excel Workbooks for the Post-estimation Interpretation of Regression Models for Categorical Dependent Variables

Logistic Regression (1/24/13)

SAMPLE SELECTION BIAS IN CREDIT SCORING MODELS

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Interpretation of Somers D under four simple models

Calculating the Probability of Returning a Loan with Binary Probability Models

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, Last revised February 21, 2015

STAT 360 Probability and Statistics. Fall 2012

Normal distribution. ) 2 /2σ. 2π σ

Local classification and local likelihoods

Lesson 9 Hypothesis Testing

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Weak instruments: An overview and new techniques

PROBABILITY AND STATISTICS. Ma To teach a knowledge of combinatorial reasoning.

Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

Introduction to Regression and Data Analysis

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

Multiple Choice Models II

13. Poisson Regression Analysis

Pr(X = x) = f(x) = λe λx

CALCULATIONS & STATISTICS

Regression III: Advanced Methods

Keep It Simple: Easy Ways To Estimate Choice Models For Single Consumers

COMP 250 Fall 2012 lecture 2 binary representations Sept. 11, 2012

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

MATHEMATICAL METHODS OF STATISTICS

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

Study Guide for the Final Exam

Binary Adders: Half Adders and Full Adders

Lecture 6: Discrete & Continuous Probability and Random Variables

Normal Distribution as an Approximation to the Binomial Distribution

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Covariance and Correlation

USING LOGISTIC REGRESSION TO PREDICT CUSTOMER RETENTION. Andrew H. Karp Sierra Information Services, Inc. San Francisco, California USA

Prime numbers and prime polynomials. Paul Pollack Dartmouth College

The Probit Link Function in Generalized Linear Models for Data Mining Applications

Logistic regression modeling the probability of success

PS 271B: Quantitative Methods II. Lecture Notes

Random variables, probability distributions, binomial random variable

Econometrics Simple Linear Regression

Elements of statistics (MATH0487-1)

Probit Analysis By: Kim Vincent

Deploying Regional Jets to Add New Spokes to a Hub. Ian Savage* and Burgess Scott Northwestern University

Chapter 3 RANDOM VARIATE GENERATION

STATISTICA Formula Guide: Logistic Regression. Table of Contents

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

Transcription:

Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22

Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel, etc.) Problem Set 7 Optional, worth up to 2 percentage points of extra credit Due Friday 19 May Regression with a Binary Dependent Variable

Binary Dependent Variables Outcome can be coded 1 or 0 (yes or no, approved or denied, success or failure) Examples? Interpret the regression as modeling the probability that the dependent variable equals one (Y = 1). Recall that for a binary variable, E(Y ) = Pr(Y = 1)

HMDA example Outcome: loan denial is coded 1, loan approval 0 Key explanatory variable: black Other explanatory variables: P/I, credit history, LTV, etc.

Linear Probability Model (LPM) Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i Simply run the OLS regression with binary Y. β 1 expresses the change in probability that Y = 1 associated with a unit change in X 1. Ŷ i expresses the probability that Y i = 1 Pr(Y = 1 X 1, X 2,..., X k ) = β 0 +β 1 X 1 +β 2 X 2 + +β k X k = Ŷ

Shortcomings of the LPM Nonconforming Predicted Probabilities Probabilities must logically be between 0 and 1, but the LPM can predict probabilities outside this range. Heteroskedastic by construction (always use robust standard errors)

Probit and Logit Regression Addresses nonconforming predicted probabilities in the LPM Basic strategy: bound predicted values between 0 and 1 by transforming a linear index, β 0 + β 1 X 1 + β 2 X 2 + + β k X k, which can range over (, ) into something that ranges over [0, 1] When the index is big and positive, Pr(Y = 1) 1. When the index is big and negative, Pr(Y = 1) 0. How to transform? Use a Cumulative Distribution Function.

Probit Regression The CDF is the cumulative standard normal distribution, Φ. The index β 0 + β 1 X 1 + β 2 X 2 + + β k X k is treated as a z-score. Pr(Y = 1 X 1, X 2,..., X k ) = Φ(β 0 + β 1 X 1 + β 2 X 2 + + β k X k )

Interpreting the results Pr(Y = 1 X 1, X 2,..., X k ) = Φ(β 0 + β 1 X 1 + β 2 X 2 + + β k X k ) β j positive (negative) means that an increase in X j increases (decreases) the probability of Y = 1. β j reports how the index changes with a change in X, but the index is only an input to the CDF. The size of β j is hard to interpret because the change in probability for a change in X j is non-linear, depends on all X 1, X 2,..., X k. Easiest approach to interpretation is computing the predicted probability Ŷ for alternative values of X Same interpretation of standard errors, hypothesis tests, and confidence intervals as with OLS

HMDA example See Figure 9.2 Pr(deny = 1 P/I, black) = Φ( 2.26 + 2.74 P/I + 0.71 black) (0.16) (0.44) (0.083) White applicant with P/I = 0.3: Pr(deny = 1 P/I, black) = Φ( 2.26 + 2.74 0.3 + 0.71 0) = Φ( 1.44) = 7.5% Black applicant with P/I = 0.3: Pr(deny = 1 P/I, black) = Φ( 2.26 + 2.74 0.3 + 0.71 1) = Φ( 0.71) = 23.3%

Logit or Logistic Regression Logit, or logistic regression, uses a slightly different functional form of the CDF (the logistic function) instead of the standard normal CDF. The coefficients of the index can look different, but the probability results are usually very similar to the results from probit and from the LPM. Aside from the problem with non-conforming probabilities in the LPM, the three models generate similar predicted probabilities.

Estimation of Logit and Probit Models OLS (and LPM, which is an application of OLS) has a closed-form formula for ˆβ Logit and Probit require numerical methods to find ˆβ s that best fit the data.

Nonlinear Least Squares One approach is to choose coefficients b 0, b 1,..., b k that minimize the sum of squares of how far the actual outcome, Y i, is from the prediction, Φ(b 0 + b 1 X 1i + + b k X ki ). n [Y i Φ(b 0 + b 1 X 1i + + b k X ki )] 2 i=i

Maximum Likelihood Estimation An alternative approach is to choose coefficients b 0, b 1,..., b k that make the current sample, Y 1,..., Y n as likely as possible to have occurred. For example, if you observe data {4, 6, 8}, the predicted mean that would make this sample most likely to occur is ˆµ MLE = 6. Stata probit and logistic regression (logit) commands are under Statistics Binary Outcomes

Inference and Measures of Fit Standard errors, hypothesis tests, and confidence intervals are exactly as in OLS, but they refer to the coefficients and must be translated into probabilities by applying the appropriate CDF. fraction correctly predicted using one probability cut-off, e.g., 0.50, and check the fraction correctly predicted, but... sensitivity/specificity Choose a cut-off. Sensitivity is the fraction of observed positive-outcomes that are correctly classified. Specificity is the fraction of observed negative outcomes that are correctly specified. Pseudo-R 2 is analogous to the R 2 Expresses the predictive quality of the model with explanatory variables relative to the predictive quality of the sample proportion p of cases where Y i = 1 Adjusts for adding extra regressors

Sensitivity and Specificity Sensitivity/Specificity 0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 Probability cutoff Sensitivity Specificity

Reviewing the HMDA results (Table 9.2) LPM, logit, probit (minor differences) Four probit specifications Highly robust result: 6.0 to 8.4 percentage-point gap in white-black denial rates, controlling for a wide range of other explanatory variables. Internal Validity External Validity

Other LDV Models Limited Dependent Variable (LDV) Count Data (discrete non-negative integers), Y 0, 1, 2,..., k with k small. Poisson or negative binomial regression. Ordered Responses, e.g., completed educational credentials. Ordered logit or probit. Discrete Choice Data, e.g., mode of travel. Characteristics of choice, chooser, and interaction. Multinomial logit or probit, Can sometimes convert to several binary problems. Censored and Truncated Regression Models. Tobit or sample selection models.