Lecture Note: Self-Selection The Roy Model. David H. Autor MIT 14.661 Spring 2003 November 14, 2003



Similar documents
Section 1.3 P 1 = 1 2. = P n = 1 P 3 = Continuing in this fashion, it should seem reasonable that, for any n = 1, 2, 3,..., =

problem arises when only a non-random sample is available differs from censored regression model in that x i is also unobserved

Introduction to Matrix Algebra

Lecture 3: Differences-in-Differences

Do Commodity Price Spikes Cause Long-Term Inflation?

Lecture Notes on Elasticity of Substitution

Chapter 13 Composition of the Market Portfolio 1. Capital markets in Flatland exhibit trade in four securities, the stocks X, Y and Z,

Labor Demand The Labor Market

A Classical Monetary Model - Money in the Utility Function

Multi-variable Calculus and Optimization

Lectures, 2 ECONOMIES OF SCALE

Economic Growth: Lecture 11, Technology Diffusion, Trade and World Growth

6. Budget Deficits and Fiscal Policy

Online Appendix to Stochastic Imitative Game Dynamics with Committed Agents

CHAPTER 10 MARKET POWER: MONOPOLY AND MONOPSONY

Introduction to Fixed Effects Methods

U = x x x 1 4. What are the equilibrium relative prices of the three goods? traders has members who are best off?

Association Between Variables

Problem Set 1 Solutions

ECON 443 Labor Market Analysis Final Exam (07/20/2005)

Econometrics Simple Linear Regression

14.02 Principles of Macroeconomics Problem Set 1 Fall 2005 ***Solution***

5.1 Identifying the Target Parameter

Self-Selection and the Economics of Immigration. George J. Borjas Harvard University June 2010

3.4 Statistical inference for 2 populations based on two samples

Labour Market Discrimination

Study Guide for the Final Exam

Increasing Returns and Economic Geography

Lecture Note 17: Private Information, Adverse Selection and Market Failure

ECON20310 LECTURE SYNOPSIS REAL BUSINESS CYCLE

ECON 459 Game Theory. Lecture Notes Auctions. Luca Anderlini Spring 2015

Sensitivity Analysis 3.1 AN EXAMPLE FOR ANALYSIS

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

4 The M/M/1 queue. 4.1 Time-dependent behaviour

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

Step 6: Writing Your Hypotheses Written and Compiled by Amanda J. Rockinson-Szapkiw

SOCIAL AND NONMARKET BENEFITS FROM EDUCATION IN AN ADVANCED ECONOMY

Section 1.1 Linear Equations: Slope and Equations of Lines

The fundamental question in economics is 2. Consumer Preferences

Average Redistributional Effects. IFAI/IZA Conference on Labor Market Policy Evaluation

Nursing Journal Toolkit: Critiquing a Quantitative Research Article

Introduction to Hypothesis Testing OPRE 6301

CAPM, Arbitrage, and Linear Factor Models

Introduction to Hypothesis Testing

Understanding the Impact of Weights Constraints in Portfolio Theory

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

An Introduction to Path Analysis. nach 3

Keynesian Macroeconomic Theory

Standard Deviation Estimator

I d ( r; MPK f, τ) Y < C d +I d +G

The Mathematics 11 Competency Test Percent Increase or Decrease

DEMAND FORECASTING. Demand. Law of Demand. Definition of Law of Demand

3. Mathematical Induction

Simple Regression Theory II 2010 Samuel L. Baker

E3: PROBABILITY AND STATISTICS lecture notes

MATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.

ECO 199 B GAMES OF STRATEGY Spring Term 2004 PROBLEM SET 4 B DRAFT ANSWER KEY

1 Portfolio mean and variance

On Marginal Effects in Semiparametric Censored Regression Models

Lecture Note 1 8 Education, Human Capital, and Labor Market Signaling

MACROECONOMICS II INFLATION, UNEMPLOYMENT AND THE PHILLIPS CURVE

Parametric Survival Models

Product Mix as a Framing Exercise: The Role of Cost Allocation. Anil Arya The Ohio State University. Jonathan Glover Carnegie Mellon University

Lecture 1: Asset pricing and the equity premium puzzle

The Cobb-Douglas Production Function

An introduction to Value-at-Risk Learning Curve September 2003

Walrasian Demand. u(x) where B(p, w) = {x R n + : p x w}.

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

Part 2: Analysis of Relationship Between Two Variables

HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009

Productioin OVERVIEW. WSG5 7/7/03 4:35 PM Page 63. Copyright 2003 by Academic Press. All rights of reproduction in any form reserved.

Covariance and Correlation

Economics of Strategy (ECON 4550) Maymester 2015 Applications of Regression Analysis

A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution

Real Business Cycle Theory. Marco Di Pietro Advanced () Monetary Economics and Policy 1 / 35

Chapter 1. Vector autoregressions. 1.1 VARs and the identi cation problem

Does student performance reduce imprisonment? *

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS

Descriptive Statistics

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Notes - Gruber, Public Finance Chapter 20.3 A calculation that finds the optimal income tax in a simple model: Gruber and Saez (2002).

Elasticity Theory Basics

Labor Market Assimilation of Recent Immigrants in Spain

Holding Period Return. Return, Risk, and Risk Aversion. Percentage Return or Dollar Return? An Example. Percentage Return or Dollar Return? 10% or 10?

Constrained optimization.

Theories of Discrimination

Note on growth and growth accounting

FEGYVERNEKI SÁNDOR, PROBABILITY THEORY AND MATHEmATICAL

A Basic Introduction to Missing Data

The Engle-Granger representation theorem

The Cost of Production

The Stewardship Role of Accounting

arxiv: v2 [math.ho] 4 Nov 2009

CHAPTER 2 Estimating Probabilities

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Tuesday, September 11 Normal Approximations and the Central Limit Theorem

Chapter 3. Cartesian Products and Relations. 3.1 Cartesian Products

Transcription:

Lecture Note: Self-Selection The Roy Model David H. Autor MIT 14.661 Spring 2003 November 14, 2003 1

1 Introduction A core topic in labor economics is self-selection. What this term means in theory is that rational actors make optimizing decisions about what markets to participate in job, location, education, marriage, crime, etc. What it means in practice is that observed economic relationships should generally be viewed as endogenous outcomes of numerous optimizing decisions, rather than as exogenous causal relationships. Understanding self-selection should make you skeptical of treating ecological correlations as causal. The starting point of formal treatment of this topic in economics is Roy s (1951) Thoughts on the Distribution of Earnings, which discusses the optimizing choices of workers selecting between fishingandhunting. Roy skeyobservation isthattherearethreefactorsthataffect this choice: 1. Fundamental distribution of skills and abilities 2. The correlations among these skills in the population 3. The technologies for applying these skills 4. Consumer tastes that impact demand for different types of outputs At the time of Roy s writing, the presumption was that the distribution of income that arises from economic processes is arbitrary. Hence, if we compare the mean earnings of hunters and fishermen, ȳ h and ȳ f,thenȳ h ȳ f is an estimate of the earnings gain or loss that an individual would receive from switching from fishing to hunting. Roy s article explains why this view is incorrect. The essential departure of Roy s model from previous work is that it is a multiple-index model (in this case, 2 indices): workers have skills in each occupation, but they can only use one skill or the other. Hence, workers self-select the sector that gives them the highest expected earnings. Equilibrium in each market equates supply and demand, while a selfselection condition means that the marginal worker is indifferent between the two sectors. Roy s 1951 paper is amusing to read to a contemporary economist because it so awkwardly straddles the line between older-style narrative economics and modern mathematical economics. 2

Roy is clearly writing with equations and distributions in mind (probably all written out), but he writes mostly about rabbits and fish only occasionally interjecting that therefore earnings will be log-normal. This is very hard to follow. Borjas 1987 AER Paper on Self-Selection and the Earnings of Immigrants is the first paper that I know that writes down a simple, parametric 2-sector Roy model. The enduring contribution of Borjas paper is this model (sometimes called a Borjas selection model) rather than the empirical findings. As a labor economist, you should be well versed with this model. 2 Borjas 1987: Self-Selection and the Earnings of Immigrants Who chooses to immigrate to the United States? One ready-made answer is that workers from low wage countries will immigrate. This may be true on average, but it s probably too simple. The workers immigrating to the United States are probably not a random subset of the Mexican workforce. Rather, we should expect that potential migrants make some rough comparison of their wages in the home country and their expected wages in the U.S. On average, we d expect those who immigrate to have higher expected earnings in the U.S. than Mexico and vice versa for those who stay. Consider two countries 0 and 1, denoting the source and host country. Log earnings in the source country are given by w 0 = 0 + ε 0, where ε 0 N (0,σ 2 0). It s useful to think of ε o as the de-meaned value of worker s skill inthesourcecountry. If everyone from country 0 were to migrate to the host country, their earnings would be (ignoring any general equilibrium effects!): w 1 = 1 + ε 1, where ε 1 N (0,σ 2 1). 3

Assume that the cost of migrating is C, which Borjas puts into time equivalent terms as π = C/w 0. Borjas further assumes that π is constant, meaning that C is directly proportional to w 0. Assume further that each worker knows C, 0, 1 and his individual epsilons: ε 0,ε 1. You, the econometrician, only observe a worker in one country or the other, and hence you only know ε 0 or ε 1 for any individual. What can you infer about what wages for immigrants in the United States would have been had they stayed in their source countries? What would wages in the United States be for non-migrants had they come to the United States? The Roy Model answers these questions. The correlation between source and host country earnings is where 1 is cov(,σ 1 ). ρ = 1 σ 1, To implement this model, we need to know ρ, although we do not need to know both ε 0,ε 1 for any worker. A worker will choose to migrate if ( 1 0 π)+(ε 1 ε 0 ) > 0, (1) (Borjas defines the indicator variable I, equalto1 if this selection condition is satisfied, 0 otherwise). Now, define ν = ε 1 ε 0. The probability that a randomly chosen worker from the source country chooses to migrate is equal to P = Pr[ν>( 0 1 + π)] ν = Pr > ( 0 1 + π) σ ν (0 = 1 Φ 1 + π), 4

where z = ( 0 1 + π) /, and Φ ( ) is the CDF of the standard normal. Notice that the higher larger is z, the lower is the probability of migration. This is because z is rising in the mean earnings of the home country and the cost of migration. So P/ 0 < 0, P/ 1 > 0, P/ π <0. These are mean effects. It s useful to assume from here forward that 1 0, so we can focus on self-selection rather than mean differences. Let s examine the self-selection properties. 2.1 Selection conditions What is the expectation of earnings in the source country for workers who choose to immigrate? E(w 0 Immigrate) = 0 + E ε 0 ν >z σ ν ε0 = 0 + E ν >z. (2) Notice that this equation depends on three things: 1. Mean earnings in the source country 2. Both error terms (ε 0,ε 1 ) through ν. 3. Implicitly, it also depends on the correlation between the error terms. We want to know the expectation of ε 0 given some value ν. Given the normality of ε 0,ε 1, this is simply equal to the regression coefficient E (ε 0 ν) = ν ν. σ 2 ν 5

Applying this to (2), we get E ε0 ν = ν σ 2 ν = ν = ρ 0ν. 1 σ 2 ν ν 1 ν Notice that due to the normalizations, the covariance cov(ε 0,ε 1 ) is reduced by 1/ σ 1 and the variance of ν/ is 1. Hence, we can rewrite (2) as ε0 E(w 0 Immigrate) = 0 + E ν >z σ ν ε0 = 0 + E ν >z σ ν ν = 0 + ρ 0ν E ν >z σ ν φ (z) = 0 + ρ 0ν (3) where φ (z) / () is the Inverse Mills Ratio, equal to the conditional expectation of a standard normal random variable truncated from the left at point z. The IMR is sometimes referred to as a hazard ratio. This is because a hazard function answers the question what is the probability of an event given that the event has not already occurred. Similarly, the IMR answers the question: what is the expectation of epsilon given that epsilon is greater than or equal to z? We can calculate the expected wage in the source country of workers who do migrate as: E(w 1 Immigrate) = 1 + E ε 1 ν >z (4) σ ν φ (z) = 1 + ρ 1ν σ 1 Φ ( z) 6

2.2 Three cases It s convenient to rearrange (3) and (4) to get φ (z) E(w 0 Immigrate) = 0 + ρ 0ν = 0 + σ 0σ 1 ρ σ 1 φ (z), (5) and φ (z) E(w 1 Immigrate) = 1 + ρ 1ν σ 1 = 1 + σ 0σ 1 σ1 φ (z) ρ, (6) where ρ = 1 / σ 1 as above. [Algebra check 0 + σ 1 ρ σ 0 φ (z) σ 1 = 0 + σ 0σ 1 σ01 σ 0 φ (z) σ 1 σ 1 σ01 σ 2 0 φ (z) = 0 +. Notice that cov (ν,ε 0 )=ν = E [(ε 1 ε 0 ) ε 0 ]=1 σ 2 0, which we can substitute in: = 0 + = 0 + φ (z) φ (z). σ0ν σ ν σ0ν = 0 + ρ 0ν φ (z) So, this substitution works.] Define Q 0 = E (ε 0 I =1),Q 1 = E (ε 1 I =1). We now have three cases: 7

2.2.1 Positive hierarchical sorting This is a case where immigrants are positively selected from the source country distribution and are also above the mean of the host country distribution: Q 0 > 0,Q 1 > 0. This will be true iff σ 1 > 1 and ρ> σ 1. What do these conditions mean? First, σ 1 > 0 implies that the host country has a higher return to skill than the source country. Second, ρ> σ 1, implies that the correlation between the skills valued in the host and source country is sufficiently high. If you were a skilled worker in the source country, you would not want to migrate to a host country with a very high return to skills if the skills valued in the host country were uncorrelated (or negatively correlated) with skills value in the home country. This case embodies the canonical American view of immigration: The best and the brightest leave their home countries for greater opportunity (that is, higher return to skill) in the U.S. One way of restating this type of migration is: a source country with low earnings variance taxes the earnings of high skill workers and insures the earnings of low skill workers. High skill workers may want to emigrate, accordingly. But this is not the only possibility. 2.2.2 Negative hierarchical sorting This is a case where immigrants are negative selected from the source country distribution and are also below the average of the host country distribution: Q 0 < 0,Q 1 < 0. This will be true iff σ 1 > 1 and ρ> σ 1. This is simply the converse case. Here, the source country is unattractive to low earnings workers because of high wage dispersion. Again assuming that wages are sufficiently correlated between the source and host country, low skill workers will want to migrate 8

to take advantage of the insurance provided by a narrow wage structure in the host country. So, this is the potentially unattractive case (certainly from Borjas perspective) where a compressed wage structure subsidizes low skill workers, thus attracting low skill workers from abroad. 2.2.3 Refugee sorting A third case is where Q 0 < 0,Q 1 > 0, that is, immigrants are selected from the lower tail of the home country distribution but arrive in the upper tail of the host country distribution. This can only occur if σ1 ρ<min, σ 0, σ 1 meaning that the correlation between earnings in the two countries is sufficiently low (could be negative). This might occur, for example, for a minority group whose opportunities in the host country are depressed by prejudice. Or for the case of migration from a non-market economy where the set of skills rewarded is quite different from the economy in the receiving country (e.g., European Jews in the first case, intellectuals from the former Communist block in the second). 2.2.4 A fourth case? Note that there is not afourthcasewhereq 0 > 0,Q 1 < 0. Why not? This would suggest irrational migration, where people leave the upper tail of the source country income distribution to join the lower tail of the host country distribution. This is inconsistent with income maximization. Mathematically, this case would require that σ1 ρ>max, σ 0, σ 1 which would imply that ρ>1, which cannot be true for a correlation coefficient. 9

3 Relevance? How relevant is the Borjas/Roy selection model to the problem he studies in the 1987 paper, self-selection of immigrants? The evidence is not overwhelming. There are probably more relevant applications of this model. 1. One is Borjas recent NBER paper on self-selection into government employment in the U.S. Though the returns to skills/education rose rapidly in the U.S. during the 1980s, the wage structure in government jobs was stable, meaning that in relative terms, the public sector wage structure became compressed. You would therefore expect a negative hierarchical sorting case to become potentially relevant: high skill workers leave the government for the private sector and low skill workers remain in government jobs to be sheltered from falling wages. This is what Borjas paper purports to show, and casual empiricism suggests that this is likely to be right. 2. A closely related application, but one I ve not applied, is the changing selection of workers into the teaching profession. The teaching profession probably used to offer relatively high returns to skills for educated women. For many women, it was the only profession open. This has changed considerably in the U.S. Teaching now probably provides a high floor and a low ceiling for wages for college educated females. This would suggest growing adverse selection into teaching over time. Worth exploring. The enduring contribution of Borjas paper for labor economists is its simple and useful formulation of the Roy model. (Note that one limitation of this model is that it ignores general equilibrium effects whereby large immigrant flows would actually change the wage structure parameters in the source and host countries.) The growing focus of empirical economists on applying instrumental variables to causal estimation is in large part a response to the realization that self-selection (i.e., optimizing behavior) plagues interpretation of ecological relationships. Hence, understanding the importance of self-selection has vastly improved empirical work. 10

But instrumental variables are not the only answer to testing cause and effect with observed data. Self-selection also points to the existence of equilibrium relationships that should be observed in ecological data (i.e., those who immigrate should in general do better in the host country than the source country), and these can be tested without an instrument. In fact, there are some natural sciences that proceed almost entirely without experimentation for example, astrophysics. How do they do it? Models predict nonobvious relationships in data. These implications can be verified or refuted by data, and this evidence strengthens or overturns the hypotheses. Many economists seem to have forgotten this methodology. 11