Online Appendix Assessing the Incidence and Efficiency of a Prominent Place Based Policy



Similar documents
ONLINE APPENDIX FOR PUBLIC HEALTH INSURANCE, LABOR SUPPLY,

From the help desk: Bootstrapped standard errors

Clustering in the Linear Model

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

Online Appendices to the Corporate Propensity to Save

SAMPLE SIZE TABLES FOR LOGISTIC REGRESSION

Messaging and the Mandate: The Impact of Consumer Experience on Health Insurance Enrollment Through Exchanges

Gains from Trade: The Role of Composition

Clustered Standard Errors

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

Calculating Effect-Sizes

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

Simple Linear Regression Inference

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE

STATISTICA Formula Guide: Logistic Regression. Table of Contents

The Elasticity of Taxable Income: A Non-Technical Summary

Factors affecting online sales

Marginal Person. Average Person. (Average Return of College Goers) Return, Cost. (Average Return in the Population) (Marginal Return)

Eigenvalues, Eigenvectors, Matrix Factoring, and Principal Components

Integrating Financial Statement Modeling and Sales Forecasting

Calculating the Probability of Returning a Loan with Binary Probability Models

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Solución del Examen Tipo: 1

Introduction to General and Generalized Linear Models

Revisiting Inter-Industry Wage Differentials and the Gender Wage Gap: An Identification Problem

Weather Insurance to Protect Specialty Crops against Costs of Irrigation in Drought Years

Using simulation to calculate the NPV of a project

On the Efficiency of Competitive Stock Markets Where Traders Have Diverse Information

The Variability of P-Values. Summary

Markups and Firm-Level Export Status: Appendix

The Long-Run Effects of Attending an Elite School: Evidence from the UK (Damon Clark and Emilia Del Bono): Online Appendix

2013 MBA Jump Start Program. Statistics Module Part 3

Week 4 Market Failure due to Moral Hazard The Principal-Agent Problem

How Much Equity Does the Government Hold?

Regression Analysis: A Complete Example

Online Appendix for Dynamic Inputs and Resource (Mis)Allocation

Accurately and Efficiently Measuring Individual Account Credit Risk On Existing Portfolios

Trading Probability and Turnover as measures of Liquidity Risk: Evidence from the U.K. Stock Market. Ian McManus. Peter Smith.

Least Squares Estimation

Linear and quadratic Taylor polynomials for functions of several variables.

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

The Assumption(s) of Normality

Statistics 104: Section 6!

Time Series Analysis

CHAPTER 12 EXAMPLES: MONTE CARLO SIMULATION STUDIES

Should New Merger Guidelines Give UPP Market Definition?

Introduction to Regression and Data Analysis

Practical Calculation of Expected and Unexpected Losses in Operational Risk by Simulation Methods

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Permutation Tests for Comparing Two Populations

SYSTEMS OF REGRESSION EQUATIONS

Valuation of Razorback Executive Stock Options: A Simulation Approach

5.1 Identifying the Target Parameter

August 2012 EXAMINATIONS Solution Part I

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Do Supplemental Online Recorded Lectures Help Students Learn Microeconomics?*

Introduction to Statistical Computing in Microsoft Excel By Hector D. Flores; and Dr. J.A. Dobelman

Logarithmic and Exponential Equations

Modeling and Analysis of Call Center Arrival Data: A Bayesian Approach

Male and Female Marriage Returns to Schooling

Marketing Mix Modelling and Big Data P. M Cain

Part 2: Analysis of Relationship Between Two Variables

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

17. SIMPLE LINEAR REGRESSION II

The RBC methodology also comes down to two principles:

An Exploration into the Relationship of MLB Player Salary and Performance

DEMOGRAPHICS OF PAYDAY LENDING IN OKLAHOMA

Redistributive Taxation and Personal Bankruptcy in US States

( ) = ( ) = {,,, } β ( ), < 1 ( ) + ( ) = ( ) + ( )

Integrated Resource Plan

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

NCSS Statistical Software

Appendix 1: Time series analysis of peak-rate years and synchrony testing.

General Sampling Methods

Pricing Barrier Option Using Finite Difference Method and MonteCarlo Simulation

Math 370, Actuarial Problemsolving Spring 2008 A.J. Hildebrand. Practice Test, 1/28/2008 (with solutions)

Manual for SOA Exam MLC.

Multivariate Analysis of Ecological Data

Non-Linearity in the Determinants of Capital Structure: Evidence from UK Firms

Life Table Analysis using Weighted Survey Data

Session 9 Case 3: Utilizing Available Software Statistical Analysis

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

Variance Reduction. Pricing American Options. Monte Carlo Option Pricing. Delta and Common Random Numbers

EXAMINATION: EMPIRICAL ANALYSIS AND RESEARCH METHODOLOGY. Yale University. Department of Political Science. January 2013

Do R&D or Capital Expenditures Impact Wage Inequality? Evidence from the IT Industry in Taiwan ROC

Domestic Multinationals and Foreign-Owned Firms in Italy: Evidence from Quantile Regression 1

Social Security Eligibility and the Labor Supply of Elderly Immigrants. George J. Borjas Harvard University and National Bureau of Economic Research

Organizing Your Approach to a Data Analysis

Estimation of σ 2, the variance of ɛ

C(t) (1 + y) 4. t=1. For the 4 year bond considered above, assume that the price today is 900$. The yield to maturity will then be the y that solves

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

11. Analysis of Case-control Studies Logistic Regression

Binary Diagnostic Tests Two Independent Samples

In this section, we will consider techniques for solving problems of this type.

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Multiple regression - Matrices

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Quantitative Methods for Finance

Transcription:

Online Appendix Assessing the Incidence and Efficiency of a Prominent Place Based Policy By MATIAS BUSSO, JESSE GREGORY, AND PATRICK KLINE This document is a Supplemental Online Appendix of Assessing the Incidence and Efficiency of a Prominent Place Based Policy published in the American Economic Review Vol. XX No. YY. A. Model Extension with Two Types of Workers Let a fixed proportion π S of the agents be skilled and more productive than their unskilled counterparts who constitute the remaining fraction π U = π S of the population. Write the utility of individual i of skill group g {S, U} living in community j N and working in community k {, N } and sector s {, 2} as: u g i jks = w g jks r j κ jk + A j + ε g i jks = v g jks + εg i jks where w g jks is the wage a worker of skill group g from neighborhood j receives when { } working in sector s of neighborhood k. Define a set of indicator variables D g i jks equal { } to one if and only if max u g j k s i j k s = u g i jks for worker i and denote the measure of agents of skill group g in each residential/work location by N g jks = P ( D g i jks = { v g j k s }). Suppose that skilled and unskilled workers are perfect substitutes in production so that firm output may be written B k (q S ks + U ks ) f ( ) χ ks where the Sks and U ks refer to K total skilled and unskilled labor inputs respectively, χ ks = ks B k (q S ks +U ks is the capital to ) effective labor ratio, and q is the relative efficiency of skilled labor. Now wages will obey [ ( ) B k f χ ks χ ks f ( )] ( ) χ ks = w U jks τδ jks w S jks = qw U jks f ( ) χ ks = ρ Busso: Research Department, Inter-American Development Bank, 300 New York Ave. NW, Washington, DC 20577 (email: mbusso@iadb.org). Gregory: Department of Economics, University of Wisconsin-Madison, 80 Observatory Dr., Madison, WI 53706-393, (email: jmgregory@ssc.wisc.edu). Kline: Department of Economics, UC Berkeley and NBER, 530 Evans Hall 3880, Berkeley, CA 94720-3880 (email: pkline@econ.berkeley.edu).

2 THE AMERICAN ECONOMIC REVIEW APRIL 203 where w U jks is the wage for unskilled workers and w S jks the wage for skilled workers. Note that d ln w U jks = d ln ws jks = d ln B k d ln B k so that productivity increases may still be detected by examining impacts on the wages of commuters. However, productivity effects may also shift the skill composition of local workers and commuters which could lead us to over or understate these effects. For this reason we adjust our wage impacts in the paper for observable skill characteristics. Our final modification is that with two skill groups, clearing in the housing market requires: H j = N g jks g s With these features in place the social welfare function may be written: W = g V g + j k r j H j H j 0 G j (x) dx It is straightforward then to verify that for some community m : d W d B = m τ=0 g j k s N g dw g jks jks d B m = [ π U N Ụ m + qπ S N.m] S R (ρ) d W d A = N m. m τ=0 where N.m g = N g jms and N m. = j s g k s deadweight losses attributable to taxes as: DW L τ = g 2 ψdτ 2 g N g jk wg jk j N k N 0 N g mks. Furthermore we may write the dτ N g jk wg jk j N k N t d ln N g jk dt dt where in the second line we have assumed a constant semi-elasticity of local employment ψ = d ln N g jk. This formula is effectively the same as that in equation (0) of the dτ paper, relying on the total covered wage bill and the elasticity ψ. Were the elasticity to vary by type we would simply need to compute the deadweight loss separately within skill group and average across groups using the marginal frequencies. Finally, we

VOL. 03 NO. 2 ONLINE APPENDIX 3 may write the deadweight attributable to the block grants as: [ dw DW L G C λ a dw d ln A λ b j N j τ=0 d ln B k N k [ = C λ a A j N j. λ b j N g j s k N τ=0 ] N g jks wg jks As before, the deadweight loss computation relies on the parameters λ a and λ b. Heterogeneity provides no essential complication to the exercise since, with knowledge of these parameters, one only needs to know the total wage bill and population inside of the zone to compute DW L G. ] B. Monte Carlo Experiments We simulated hierarchical datasets of 64 zones with a random number of tracts N z within each zone. The number of tracts per zone was generated according to N z = 0+ η z where η z is a Negative Binomial distributed random variable with the first two moments matching the ones observed in the data (i.e. a mean 2 and a standard deviation of 6 tracts). Hence, each simulated sample was expected to yield approximately,344 census tracts with no zone containing less than 0 tracts in any draw. Outcomes were generated according to the model: Y tz = β z T z + α x t X tz + α p z P z + ξ z + e tz where T z is an EZ assignment dummy, X tz is a tract level regressor, P z is a zone level regressor, ξ z a random zone effect, and e tz an idiosyncratic tract level error. We assume throughout that: X tz P z N (0, I 3 ) e tz To build in some correlation between the covariates and EZ designation, and to reflect the fact that treated zones tend to be larger, we model the EZ assignment mechanism as: () T z = I ( rank ( ) ) Tz 6 Tz = X z + P z + 0.008 N z + u z u z N (0, ) where X z = N z X tz t z and the rank (.) function ranks the T z in descending order. Note that this assignment process imposes that exactly six zones will be treated. Hence, each simulation sample will face the inference challenges present in our data.

4 THE AMERICAN ECONOMIC REVIEW APRIL 203 The nature of the coefficients ( β z, αt x, α z p ) and the random effect ξ z vary across our Monte Carlo designs as described in the following table. We have two sets of results. In the first set, which we label symmetric, ξ z follows a normal distribution. In a second set of results, which we label asymmetric, ξ z follows a χ 2 distribution. Table S: Data Generating Processes Symmetric Asymmetric ξ z N (0, ) ξ z χ 2 (4). Baseline β z = 0, αt x = αz p = β z = 0, αt x = αz p = 2. Random Coefficient on X tz 3. Random Coefficient on P z Same as ) but, αt x N (, ) Same as ) but, αz p N (, ) Same as ) but, (αt x +3) χ 2 (4) Same as ) but, (αz p +3) χ 2 (4) 4. Random Coefficient on T z Same as ) but, β z N (0, ) Same as ) but, (β z +4) χ 2 (4) 5. All deviations from baseline (2) + (3) + (4) (2) + (3) + (4) Note that the null of zero average treatment effect among the treated is satisfied in each simulation design. Specification ) corresponds to the relatively benign case where our regression model is properly specified and the errors are homoscedastic. Specification 2) allows for heteroscedasticity with respect to the tract level regressor, while specification 3) allows some heteroscedasticity in the zone level regressor. Specification 4) allows heteroscedasticity with respect to the treatment, or alternatively, a heterogeneous but mean zero treatment effect. Specification 5) combines all of these complications so that heteroscedasticity exists with respect to all of the regressors. For each Monte Carlo design we compute three sets of tests of the true null that EZ designation had no average effect on treated tracts. The first (analytical) uses our analytical cluster-robust standard error to construct a test statistic t = β where and rejects when t >.96. The second (wild bootstrap-se) uses a clustered wild bootstrap procedure to construct a bootstrap standard error σ and rejects when β σ >.96. The third approach (wild bootstrap-t) estimates the wild bootstrap distribution Ft (.) of the test statistic t = β σ and rejects when t > Ft (0.95) where Ft (0.95) denotes the 95th percentile of the bootstrap distribution of t statistics. Both the bootstrap-se and bootstrap-t procedures simulate the bootstrap distribution imposing the null that β = 0 as recommended by Cameron, Gelbach, and Miller (2008). The false rejection rates for these three tests in each of the five simulation designs are given in the table below. σ

VOL. 03 NO. 2 ONLINE APPENDIX 5 Table S2: False Rejection Rates in Monte Carlo Simulations Tract-level models Symmetric Analytical Analytical Wild Wild Wild Wild s.e. s.e. BS-s.e. BS-s.e. BS-t BS-t OLS PW OLS PW OLS PW Baseline 0.26 0.074 0.039 0. 0.054 0.053 Random Coefficient on X tz 0.25 0.075 0.036 0.3 0.056 0.05 Random Coefficient on P z 0.24 0.077 0.04 0.0 0.055 0.048 Random Coefficient on T z 0.23 0.073 0.04 0.0 0.055 0.053 All 0.24 0.080 0.042 0.0 0.059 0.05 Asymmetric Baseline 0.23 0.06 0.037 0.38 0.055 0.056 Random Coefficient on X tz 0.2 0.09 0.04 0.36 0.047 0.049 Random Coefficient on P z 0.23 0. 0.039 0.39 0.054 0.052 Random Coefficient on T z 0.32 0. 0.039 0.42 0.053 0.056 All 0.25 0. 0.038 0.28 0.05 0.05 Standard error based methods tend to overreject in both designs save for in the case of OLS where the wild bootstrapped standard errors perform well. However the wild bootstrapped-t procedure yields extremely accurate inferences for both the OLS and PW estimators across all designs.