IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD



Similar documents
ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

Lecture 15. Endogeneity & Instrumental Variable Estimation

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Econometrics Simple Linear Regression

Can Entrepreneurship Programs Transform the Economic Lives of the Poor?

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Introduction to Fixed Effects Methods

Lecture 3: Differences-in-Differences

Solución del Examen Tipo: 1

Does Microfinance Really Help the Poor? New Evidence from Flagship Programs in Bangladesh

UNIVERSITY OF WAIKATO. Hamilton New Zealand

Chapter 10: Basic Linear Unobserved Effects Panel Data. Models:

16 : Demand Forecasting

A LONGITUDINAL AND SURVIVAL MODEL WITH HEALTH CARE USAGE FOR INSURED ELDERLY. Workshop

IPDET Module 6: Descriptive, Normative, and Impact Evaluation Designs

14.74 Lecture 11 Inside the household: How are decisions taken within the household?

On Marginal Effects in Semiparametric Censored Regression Models


2. Linear regression with multiple regressors

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

14.74 Lecture 9: Child Labor

14.74 Lecture 7: The effect of school buildings on schooling: A naturalexperiment

The Effect of Housing on Portfolio Choice. July 2009

Chapter 7: Dummy variable regression

Difference in differences and Regression Discontinuity Design

Using Heterogeneous Choice Models. To Compare Logit and Probit Coefficients Across Groups

Poverty Indices: Checking for Robustness

How To Calculate Export Earnings

A Basic Introduction to Missing Data

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Panel Data Analysis in Stata

SYSTEMS OF REGRESSION EQUATIONS

Monitoring & Evaluation for Results. Reconstructing Baseline Data for Monitoring & Evaluation -Data Collection Methods-

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Univariate Regression

August 2012 EXAMINATIONS Solution Part I

Effect of Microfinance Operations on Poor Rural Households and the Status of Women

Forecast. Forecast is the linear function with estimated coefficients. Compute with predict command

FIXED EFFECTS AND RELATED ESTIMATORS FOR CORRELATED RANDOM COEFFICIENT AND TREATMENT EFFECT PANEL DATA MODELS

The Life-Cycle Motive and Money Demand: Further Evidence. Abstract

Financial Risk Management Exam Sample Questions/Answers

Public Health Insurance Expansions for Parents and Enhancement Effects for Child Coverage

Optimal Social Insurance Design: UI Benefit Levels

Using instrumental variables techniques in economics and finance

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

Wooldridge, Introductory Econometrics, 4th ed. Chapter 7: Multiple regression analysis with qualitative information: Binary (or dummy) variables

VI. Real Business Cycles Models

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Standard errors of marginal effects in the heteroskedastic probit model

Topic 1 - Introduction to Labour Economics. Professor H.J. Schuetze Economics 370. What is Labour Economics?

Introduction to Dynamic Models. Slide set #1 (Ch in IDM).

Econometrics I: Econometric Methods

Can Entrepreneurship Programs Transform the Economic Lives of the Poor?

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

Mgmt 469. Fixed Effects Models. Suppose you want to learn the effect of price on the demand for back massages. You

ESTIMATING AN ECONOMIC MODEL OF CRIME USING PANEL DATA FROM NORTH CAROLINA BADI H. BALTAGI*

Panel Data: Linear Models

Explanatory note on the 2014 Human Development Report composite indices. United Kingdom

Adult Apprenticeships

4. Multiple Regression in Practice

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Correlated Random Effects Panel Data Models

17. SIMPLE LINEAR REGRESSION II

OUTLINE OF PRINCIPLES OF IMPACT EVALUATION

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*

Stata Walkthrough 4: Regression, Prediction, and Forecasting

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

CHAPTER VI ON PRIORITY SECTOR LENDING

3.1 Least squares in matrix form

Why Sample? Why not study everyone? Debate about Census vs. sampling

Regression Analysis (Spring, 2000)

The Wage Return to Education: What Hides Behind the Least Squares Bias?

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)

PS 271B: Quantitative Methods II. Lecture Notes

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Models for Longitudinal and Clustered Data

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week (0.052)

Clustering in the Linear Model

Concept Note: Impact evaluation of vocational and entrepreneurship training in Nepal

Explanatory note on the 2014 Human Development Report composite indices. Togo. HDI values and rank changes in the 2014 Human Development Report

Introduction to Quantitative Methods

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

individualdifferences

Government Spending Multipliers in Developing Countries: Evidence from Lending by Official Creditors

3. Regression & Exponential Smoothing

Transcription:

REPUBLIC OF SOUTH AFRICA GOVERNMENT-WIDE MONITORING & IMPACT EVALUATION SEMINAR IMPACT EVALUATION: INSTRUMENTAL VARIABLE METHOD SHAHID KHANDKER World Bank June 2006 ORGANIZED BY THE WORLD BANK AFRICA IMPACT EVALUATION INITIATIVE IN COLLABORATION WITH HUMAN DEVELOPMENT NETWORK AND WORLD BANK INSTITUTE

The main objective of Impact Evaluation. To estimate the treatment effect of an intervention T on an outcome Y For example: What is the effect of an increase in the minimum wage on employment? What is the effect of a school feeding program on learning outcomes? What is the effect of a training program on employment and earnings? What is the effect of micro-finance participation on consumption and employment? 2

The Counterfactual Recall the evaluator s key question is: What would have happened to the beneficiaries if the program had not existed? How do you rewrite history? How do you get baseline data? 3

The Ideal strategy With equivalent control group. Income level 30 17 10 Effect or impact = 13 In theory, a single observation is not enough Beneficiaries Equivalent control group Program Time Extreme care must be taken when selecting the control group to ensure comparability. 4

Problem with ideal strategy is: It is extremely difficult to assemble an exactly comparable control group: Ethical problem (to condemn a group to not being beneficiaries) Difficulties associated with finding an equivalent group outside the project Costs People change behaviour over time Therefore, this approach is practically difficult 5

Examples of less than ideal strategies Comparing individuals before and after the intervention Comparing individuals that participated in the intervention to those that did not Example of a micro-credit program Often targeted not everyone is eligible Some individuals chose to participate and some don t even among the eligible households/individuals Why can t we estimate the program s impact by comparing outcomes of participants to non-participants? 6

Example with less than ideal strategies such as using a before/after comparison 30 Income level 21 17 14 10 Base Line Study Time series? Time Effect or impact? Findings on the impact are inconclusive Beneficiaries Broad descriptive survey Program 7

Comparison with nonequivalent control group Income level 30 21 17 14 10 Effect or impact = 13 Projection for beneficiaries if no policy Beneficiaries control group Program Time Data on before and after situations is needed. 8

Micro-credit program Participation is a choice variable endogenous Observed and unobserved characteristics about participants that make them different from those who did not participate Omitted variables can control for many observed characteristics, but are still missing things that are hard or impossible to measure Our treatment effect is picking up the effect of other characteristics that explain treatment, resulting in a biased estimate of the treatment effect 9

Micro-credit program Estimation: compare outcomes of individuals that chose to participate to those who did not: Where T = 1 if enrolled in program T = 0 if not enrolled in program x = exogenous regressors (i.e. controls) The problem: y = + T + x+ Corr (T, ε) 0 α β β ε 1 2 10

Micro-credit Program Examples of why Corr (T, ε) 0 Different motivation Different ability Different information Different opportunity cost of participating Different level of access There are characteristics of the treated that should be in ε, but are being picked up by T. We have violated one of the key assumptions of OLS: Independence of regressors x from disturbance term ε 11

What can we do about this problem? Try to clean up the correlation between T and ε: Isolate the variation in T that is uncorrelated with ε To do this, we need to find an instrumental variable (IV) OR, design programs with instrumental variables in mind 12

Basic idea behind IV Find a variable Z which satisfies two conditions: 1. Correlated with T: corr (Z, T) 0 2. Uncorrelated with ε: corr (Z, ε) = 0 Examples of Z in the micro-credit program example? interest rate or the price of credit 13

Two Stage Least Squares (2SLS) y = + T + x+ α β β ε 1 2 Stage 1: Regress endogenous variable on the IV (Z) and other exogenous regressors T = δ + δ x+ θ Z + τ 0 1 1 Calculate predicted value for each observation: T hat 14

Two stage Least Squares (2SLS) Stage 2: Regress outcome y on predicted variable (and other exogenous variables) ^ y = α + β ( T) + β x+ ε 1 2 Need to correct Standard Errors (they are based on T hat rather than T) In practice just use STATA - ivreg Intuition: T has been cleaned of its correlation with ε. 15

Two stage Least Squares (2SLS) Another way of presenting the same resul β 1 = Cov(y,z X)/Cov(T,z) Cov(y,z) is called The Reduced Form Cov(T,z) is called The First Stage 16

Example: Grameen Bank impact C is credit demand; Y is outcome variable (consumption, assets, employment, schooling, family planning); Subscripts: household i; village j; male m; female f; time t; Want to allow separate effects on outcome of male and female credit; X represents individual or household characteristics. 17

Measuring Impacts: Cross-Sectional data Credit demand equations: C ijm = X ijm β c + Z ijm γ c + μ c jm + ε c ijm C ijf = X ijf β c + Z ijf γ c + μ c jf + ε c ijf Outcome equation: Y ij = X ij β y + C ijm δ m + C ijf δ f + μ y j + ε y ij Note: the Z represent variables assumed to affect credit demand but to have no direct effect on the outcome. 18

Endogeneity Issues Correlation among μ c jm, μ c jf and μ y j, and among ε c ijm, ε c ijf and ε y ij Estimation that ignores these correlations have endogeneity bias. Endogeneity arises from three sources: 1) Non-random placement of credit programs; 2) Unmeasured village attributes that affect both program credit demand and household outcomes; 3) Unmeasured household attributes that affect both program credit demand and household outcomes. 19

Resolving endogeneity Village-level endogeneity resolved by village FE; Household-level endogeneity resolved by instrumental variables (IV); In credit demand equation Z ijm and Z ijf represent instrumental variables; Selecting Z ij variables is difficult; Possible solution: identification using quasiexperimental survey design. 20

Quasi-experimental survey design Households are sampled from program and non-program villages; Both eligible and non-eligible households are sampled from both types of villages; Both participants and non-participants are sampled from eligible households. Identification conditions: Exogenous land-holding; Gender-based program design. 21

Instruments for identification Exogenous land-holding criteria: - Only households owning up to 0.5 acre of land qualify for program participation. In practice there is some deviation from this cutoff. Gender-based program participation criteria: - Male members of qualifying households cannot participate in program if village does not have a male program group. - Female members of qualifying households cannot participate in program if village does not have a female program group. 22

Construction of Zij variables Male choice=1 if household has up to 0.5 acre of land and village has male credit group =0 otherwise Female choice=1 if household has up to 0.5 acre of land and village has female credit group =0 otherwise Male and female choice variables are interacted with household characteristics to form Z ij Variables. 23

What this means.. Intuitively, the outcome regression now relates variation in Y to variation in C associated with variation in Z. 24

Table 1: Weighted Mean and Standard Deviations of Dependent Variables Dependent Variables Obs. Participants Nonparticipants Obs. Total Obs. Nonprogram areas Obs. Aggregate Obs. Sum of program loans of females (Taka) Sum of program loans by males (Taka) Per capita HH weekly food expenditure (Taka) Per capita HH weekly nonfood expenditure (Taka) 5498.854 (7229.351) 3691.993 (7081.581) 59.166 (19.865) 17.848 (31.538) 779 326 2604.454 (5682.398) 631-263 1729.631 (5184.668) 2696 62.265 (23.256) 2696 23.621 (54.791) 1650 61.242 (22.239) 1650 21.716 (48.439) 1105- - 2604.454 (5682.398) 894 - - 1729.631 (5184.668) 4326 61.985 (23.897) 4346 27.676 (51.409) 872 61.366 (22.522) 872 22.706 (48.990) 1105 895 5218 5218 Per capita HH total weekly expenditure (Taka) 77.014 (41.496) 2696 85.886 (64.820) 1650 82.959 (58.309) 4346 89.661 (66.823) 872 84.072 (59.851) 5218 25

Tables 2: Alternate Estimates of the Impact of Credit on Per Capita Expenditure Explanatory Variables Naive weighted (OLS) Total Food Non-food Log Total Expenditure per Capita Log Total Expenditure per Capita Log Total Expenditure per Capita Naive Naive weighted weighted IV IV-FE (OLS) IV IV-FE (OLS) IV IV-FE Amount borrowed by females from GB.004 (1.765).0317 (2.174).0432 (4.249).005 (2.700).0114 (1.435).0114 (1.263).009 (1.760) -.0184 (-.759).0199 (.467) Amount borrowed by males from GB.001 (.325) -.0225 (-2.291).0179 (1.431) -.001 (-.471) -.0142 (-1.602).0087 (1.163).004 (.476) -.0220 (-.982).0182 (.665) 26

Alternative Models for Estimation (--) Ordinary Least Squares (OLS) estimation of outcome equation with no correction for weights; (-) OLS on outcome equation with correction for sampling weights; (+) IV (2-stage) estimation of outcome equation with weights correction; (++) IV, weights, village fixed effects. 27

Table 3. Comparing Results among Alternative Models: Log-log impacts of GB women s credit on HH per capita consumption Models Coefficient (t-stat) Un-weighted OLS 0.003 (1.400) Weighted OLS 0.004 (1.765) Weighted IV 0.0371 (2.174) Weighted IV, village FE 0.0432 (4.249) 28

Conclusions Participants are observed to consume less on average than counterpart non-participants does this mean program does not matter? Econometric estimation shows participation has indeed caused some 18 percent return to consumption for female and 11 percent for male borrowing IV-FE > IV > OLS 29

IV s in Program Evaluation Very hard to find appropriate instrumental variables ex post Can build IVs into design of program Two Cases: Problematic Randomization Randomization not feasible despite best attempts 30

IV s in Program Evaluation Problematic Randomization Two Groups: Treatment (T=1) and Control (T = 0) Noticed Some people in T=1 did not get the treatment! Some people in T=0 did get the treatment So, people who received the treatment (RT for Received Treatment) is different from people chosen: RT T Don t throw up hands in despair! T is a valid IV for RT 31

IV s in Program Evaluation Randomization not feasible Working with NGO to evaluate new program Suggest quasi-experimental program design NGO to require to follow eligibility criteria based on observables such as landholding, gender, poverty, etc Construct two-groups: Z=1 and Z=0 Let NGO choose treated households BUT ensure more treatment samples in Z=1 than Z=0 Use Z as instrumental variable 32

Recall classic assumption of OLS Assuming we can find valid IVs, we can overcome endogeneity (X is a choice variable correlated with the error): Simulteneity: X and Y cause each other Omitted Variables: X is picking up the effect of other variables Measurement Error: X is not measured precisely 33

Caution. Bad instruments: if corr (Z, ε) 0, we are in trouble! we must ensure that corr (Z, ε) = 0 this is not always easy! We use theory and common sense 34