Econometrics Problem Set #2



Similar documents
Econometrics Problem Set #3

1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability).

Solución del Examen Tipo: 1

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Session 7 Bivariate Data and Analysis

Simple Regression Theory II 2010 Samuel L. Baker

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

To give it a definition, an implicit function of x and y is simply any relationship that takes the form:

Rockefeller College University at Albany

Econometrics Simple Linear Regression

Relative and Absolute Change Percentages

The Mathematics 11 Competency Test Percent Increase or Decrease

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Pre-Algebra Lecture 6

Chapter 4 Online Appendix: The Mathematics of Utility Functions

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Independent samples t-test. Dr. Tom Pierce Radford University

Wooldridge, Introductory Econometrics, 4th ed. Chapter 7: Multiple regression analysis with qualitative information: Binary (or dummy) variables

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Linear Programming Notes VII Sensitivity Analysis

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Click on the links below to jump directly to the relevant section

MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING

Statistics 151 Practice Midterm 1 Mike Kowalski

Premaster Statistics Tutorial 4 Full solutions

Father s height (inches)

Basic Electronics Prof. Dr. Chitralekha Mahanta Department of Electronics and Communication Engineering Indian Institute of Technology, Guwahati

Vieta s Formulas and the Identity Theorem

Section 10.4 Vectors

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.

Univariate Regression

Scatter Plot, Correlation, and Regression on the TI-83/84

CALCULATIONS & STATISTICS

Part 1 Expressions, Equations, and Inequalities: Simplifying and Solving

Chapter 27: Taxation. 27.1: Introduction. 27.2: The Two Prices with a Tax. 27.2: The Pre-Tax Position

Simple linear regression

4 G: Identify, analyze, and synthesize relevant external resources to pose or solve problems. 4 D: Interpret results in the context of a situation.

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

The Point-Slope Form

14.02 Principles of Macroeconomics Problem Set 1 Fall 2005 ***Solution***

Math 4310 Handout - Quotient Vector Spaces

Pricing I: Linear Demand

Unit 26 Estimation with Confidence Intervals

Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week (0.052)

Module 3: Correlation and Covariance

Binary Adders: Half Adders and Full Adders

Preliminary Mathematics

Chapter 5. Conditional CAPM. 5.1 Conditional CAPM: Theory Risk According to the CAPM. The CAPM is not a perfect model of expected returns.

Nonlinear Regression Functions. SW Ch 8 1/54/

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions.

Topic 1 - Introduction to Labour Economics. Professor H.J. Schuetze Economics 370. What is Labour Economics?

Answer Key for California State Standards: Algebra I

Creating, Solving, and Graphing Systems of Linear Equations and Linear Inequalities

You and your friends head out to a favorite restaurant

SPSS Guide: Regression Analysis

Linear Models in STATA and ANOVA

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Chapter 7: Modeling Relationships of Multiple Variables with Linear Regression

Deriving Demand Functions - Examples 1

MULTIPLICATION AND DIVISION OF REAL NUMBERS In this section we will complete the study of the four basic operations with real numbers.

Updates to Graphing with Excel

c 2008 Je rey A. Miron We have described the constraints that a consumer faces, i.e., discussed the budget constraint.

I. Introduction to Taxation

Descriptive statistics; Correlation and regression

Lecture 15. Endogeneity & Instrumental Variable Estimation

ANSWERS TO END-OF-CHAPTER QUESTIONS

Linear Programming. March 14, 2014

Forecasting in STATA: Tools and Tricks

Dealing with Data in Excel 2010

MOST FREQUENTLY ASKED INTERVIEW QUESTIONS. 1. Why don t you tell me about yourself? 2. Why should I hire you?

Stata Walkthrough 4: Regression, Prediction, and Forecasting

APPENDIX. Interest Concepts of Future and Present Value. Concept of Interest TIME VALUE OF MONEY BASIC INTEREST CONCEPTS

2. Simple Linear Regression

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

file:///d:/my%20webs/econ101-8_fa11_13/probsetanswers/ps6_1.htm

Chapter 3 Review Math 1030

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Kenken For Teachers. Tom Davis June 27, Abstract

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Lesson 7 - The Aggregate Expenditure Model

Example: Boats and Manatees

Traditional Conjoint Analysis with Excel

6.4 Normal Distribution

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Linear Programming. Solving LP Models Using MS Excel, 18

Tom wants to find two real numbers, a and b, that have a sum of 10 and have a product of 10. He makes this table.

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

13 EXPENDITURE MULTIPLIERS: THE KEYNESIAN MODEL* Chapter. Key Concepts

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

MULTIPLE REGRESSION EXAMPLE

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Association Between Variables

The Keynesian Cross. A Fixed Price Level. The Simplest Keynesian-Cross Model: Autonomous Consumption Only

0.8 Rational Expressions and Equations

Study Questions for Chapter 9 (Answer Sheet)

5.1 Simple and Compound Interest

Transcription:

Econometrics Problem Set #2 Nathaniel Higgins nhiggins@jhu.edu Assignment The homework assignment was to read chapter 2 and hand in answers to the following problems at the end of the chapter: 2.1 2.5 and C2.1 C2.3. 2.1 Let kids denote the number of children ever born to a woman, and let educ denote years of education for the woman. A simple model relating fertility to years of education is kids = β 0 + β 1 educ + u, where u is the unobserved error. (i) What kinds of factors are contained in u? Are these likely to be correlated with level of education? Education of the woman s parents, number of siblings the woman has, her income, her marital status, her job, etc., are all likely to be contained in u. That is, each of these factors could be thought of as explanatory. Each of them (and many others that you could come up with) help determine the number of children a woman has. Many of the factors that I just listed are likely to be correlated with a woman s level of education. Education of the woman s parents: certainly correlated. Number of siblings the woman has: arguable whether or not this is correlated with the woman s level of education. Marital status: correlated. Job: correlated. (ii) Will a simple regression analysis uncover the ceteris paribus effect of education on fertility? Explain. 1

Because the omitted explanatory variables (such as those above) are embodied in u, and because these variables are also correlated with education, the regression will not reveal a causal relationship. The assumption E(u x) = 0 is violated. 2.2 In the simple linear regression model y = β 0 + β 1 x + u, suppose that E(u) 0. Leting α 0 = E(u), show that the model can always be rewritten with the sample slope, but a new intercept and error, where the new error has a zero expected value. Suppose that E(u) = α 0 0. We could write up a model with a new intercept. Instead of calling it β 0 like we did before, we ll just call it β 0 + α 0. Then we introduce a new error (or unobservable ) term, call it ɛ, where ɛ = u α 0. But you don t have to know this to start with. You get it if you just subtract α 0 from both sides of the equation (which you can always do if you do the same thing to both sides of an equation, you are not disturbing the underlying relationship). You know you want to subtract α 0 from both sides of the equation because you know you want the expectation of the unobservable to be 0. Subtracting α 0 gets you that 0. E[y α 0 ] =E[(β 0 + β 1 x + u α 0 ] =(β 0 + α 0 ) + β 1 x + E[u α 0 ] =(β 0 + α 0 ) + β 1 x + E[u] E[α 0 ] =(β 0 + α 0 ) + β 1 x + α 0 α 0 =(β 0 + α 0 ) + β 1 x. This is all exactly equivalent to writing a new model with an intercept = (β 0 + α 0 ) and a new unobservable ɛ = u α 0. 2.3 The following table contains the ACT scores and the GPA (grade point average) for eight college students. Grade point average is based on a four-point scale and has been rounded to one digit after the decimal. Student GPA ACT 1 2.8 21 2 3.4 24 3 3.0 26 4 3.5 27 5 3.6 29 6 3.0 25 7 2.7 25 8 3.7 30 2

(i) Estimate the relationship between GPA and ACT using OLS; that is, obtain the intercept and slope estimates in the equation GP A = β 0 + β 1 ACT Comment on the direction of the relationship. Does the intercept have a useful interpretation here? Explain. How much higher is the GPA predicted to be if the ACT score is increased by five points? The easiest way to get the answer to this question would be to enter the data by hand into Stata (or Exel). Then run a simple OLS regression. Since there are only 8 data points, this would be pretty quick. Or, again because there aren t many data points, you could just compute the formula for the OLS coefficients using the formula on page 29 and a calculator. I used Stata (the easiest way to enter this number of data points is by using the edit command and just filling in the spreadsheet) and found the following coefficients (rounded to two decimal places): β 0 = 0.57, β 1 = 0.1. The direction of the relationship is positive, as expected. Students with a higher ACT score have a higher GPA. Whether this has to do with intelligence, test-taking ability, or whatever, I am not sure. But the direction of the relationship is certainly expected. The literal interpretation of the intercept is the GPA if the ACT score was 0. Is that even possible? Can you score a 0 on your ACT? Or do you get 10 points for writing your name? I don t know. But either way, my answer to the question is probably no, the intercept does not have a useful interpretation. All of the ACT scores in our data are well above 0, indicating to me that the notion of a 0 ACT score is well outside our sample, and thus well outside what I am comfortable predicting with this model. If the ACT score increases by 5 points, I expect to observe a 0.5 point increase in GPA (= 0.1 5). (ii) Compute the fitted values and residuals for each observation, and verify that the residuals (approximately) sum to zero. Again, this is very easy to do by hand, and even easier to do in stata. Here s the code which will verify the output. reg gpa act predict gpahat gen resid = gpa - gpahat gen totalresid = sum(resid) browse 3

Look at the data. The variable totalresid is the running total of the residuals. By the time you get to observation 8, the residuals sum to 2.38e-07, or 0.000000238. That s close enough to 0 for me. (iii) What is the predicted value of GPA when ACT = 20? The predicted value of GPA when ACT = 20 is 0.57 + 0.1 20 = 2.57. Even though we don t have an observation with ACT = 20, we can predict what would happen if we did, using our model. (iv) How much of the variation in GPA for these eight students is explained by ACT? Explain. About 58% of the variation in GPA is explained by ACT. We see this in the R-squared statistic, which represents the proportion of the total variation in GPA that is explained by all the explanatory variables. Since we only have one explanatory variable, R-squared gives us exactly what the question is looking for. (This would not be true if there were multiple explanatory variables). 2.4 The data set BWGHT.RAW contains data on births to women in the United States. Two variables of interest are the dependent variable, infant birth weight in ounces (bwght), and an explanatory variable, average number of cigarettes the mother smoked per day during pregnancy (cigs). The following simple regression was estimated using data on n = 1,388 births: bwght = 119.77 0.514cigs (i) What is the predicted birth weight when cigs = 0? What about when cigs = 20 (one pack per day)? Comment on the difference. The predicted birth weight when cigs = 0 is 119.77 ounces (7.5 lbs). When cigs = 20, the predicted birth weight is equal to 119.77-0.514 * 20 = 109.5 ounces (6.8 lbs). It seems clear to me from this equation that (shock!) smoking is nasty bad: it makes your breath stink, your teeth yellow, makes it hard to run a mile, and it makes your kid small (OK, only the last one is justified by the model, but you know what I mean). 4

(ii) Does this simple regression necessarily capture a causal relationship between the child s birth weight and the mother s smoking habits? Explain. The regression captures a causal relationship between the child s birth weight and the mother s smoking habits unless the mother s smoking habits are positively related to other unobserved factors which also negatively influence birth weight. Off the top of my head, I would guess that income and education (which both influence nutrition and the amount of prenatal care that a woman gets) could be such variables. So if we wanted to be sure that smoking reduced birth weight, we d have to make sure that these variables were accounted for. My guess is that we would still find a negative relationship between smoking and birth weight, but that the effect would be attenuated (i.e. the coefficient would be smaller in absolute value). (iii) To predict a birth weight of 125 ounces, what would cigs have to be? Comment. Uh-oh. Gotta drop some algebra to get this one done. 125 is the y-variable, in this case. 125 = 119.77 0.514 cigs. Solve for cigs: 119.77 125 cigs = = 10.2. 0.514 Impossible! You can t smoke negative cigarettes (if anybody figures out how to do this please let me know. I d like to begin negative smoking immediately). (iv) The proportion of women in the sample who do not smoke while pregnant is about 0.85. Does this help reconcile your finding from part (iii)? I don t really like this question. It s not as illuminating as you might think. As I mentioned in class, this sort of helps reconcile the findings. If most of our sample is pegged at cigs = 0, then I expect that the intercept is going to be pretty close to the mean birth weight for the part of the sample that doesn t smoke (think of the interpretation of the intercept and you ll see why). So even though some children a certainly born with birth weights above 120, our model doesn t say much about them. All of the stuff that causes children to be born heavier than about 120 ounces is unobservable in our model. 5

2.5 In the linear consumption function cons = β 0 + β 1 inc, the (estimated) marginal propensity to consume (MPC) out of income is simply the cons inc = β 0 slope, β1, while the average propensity to consume (APC) is inc + β 1. Using observations for 100 families on annual income and consumption (both measured in dollars), the following equation is obtained: (i) cons = 124.84 + 0.853inc n =100, R 2 = 0.692 Interpret the intercept in this equation, and comment on its sign and magnitude. The marginal propensity to consume when income is zero is -124.84 when you aren t making any money, you de-consume (you sell stuff). Its sign isn t surprising; even though interpreting a negative number literally is dangerous, in this case. We don t have the data in front of us, so I can t be sure, but I d guess there are no negative entries in consumption. So this is probably another case where the intercept doesn t have much of an interpretation. If it really is possible (in the data) to observe negative consumption (selling your stuff) then the interpretation I gave is meaningful. But I doubt it. (Note: if you just saved and did nothing, without selling stuff, your MPC would be 0 when you get money, you do nothing). (ii) What is the predicted consumption when family income is $30,000? Consumption when income is $30,000 is given by i.e. $25,465.16. (iii) 124.84 + 0.853 30000 = 25465.16, With inc on the x-axis, draw a graph of the estimated MPC and APC. The first thing Wooldridge asks you to do is very silly. Graphing MPC against inc is just graphing a constant number (0.853) against income. That is a flat line at 0.853. 6

Not worth doing. If you graphed consumption against income, that would be more interesting (but only slightly). You would have a line that passed through the origin at -$124 and increased with a slope of $0.85 per dollar of income. See below. Consumption and income Consumption 0 20000 40000 60000 80000 0 20000 40000 60000 80000 100000 Income Graphing the APC is just slightly more interesting. See below. APC and income cons_inc -.5 0.5 1 0 20000 40000 60000 80000 100000 Income The average propensity to consume increases steeply when income is low, then levels off significantly at around $2,000 at about 0.8. By the time a family is making around $100,000, their average propensity to consume is still only about 0.85. Graphing the two together is sort of cool. 7

APC and MPC against income -.5 0.5 1 0 20000 40000 60000 80000 100000 Income apc mpc C2.1 The data in 401K.RAW are a subset of data analyzed by Papke (1995) to study the relationship between participation in a 401(K) pension plan and the generosity of the plan. The variable prate is the percentage of eligible workers with an active account; this is the variable we would like to explain. The measure of generosity is the plan match rate, mrate. This variable gives the average amount the firm contributes to each worker s plan for each $1 contribution by the worker. For example, if mrate = 0.50, then a $1 contribution by the worker is matched by a 50 cent contribution by the firm. (i) Find the average participation rate and the average match rate in the sample of plans. Use Stata code sum prate mrate to find that the averages are 87.4% participation and 0.73 match-rate, respectively. 8

(ii) Now, estimate the simple regression equation prate = β 0 + β 1 mrate and report the results along with the sample size and R-squared. The following Stata code produces the results below. reg prate mrate β 0 = 83.07, β 1 = 5.86, n = 1, 534, R-squared = 0.07. (iii) Interpret the intercept in your equation. Interpret the coefficient on mrate. The intercept = 83.07. If the match-rate were 0, i.e. if employers contributed nothing to employees 401k plans, 83% of employees would still participate in a 401k. For every dollar increase in the match-rate, employee participation increases by almost 6%. 0 50 100 0 1 2 3 4 5 401k plan match rate participation rate, percent Fitted values (iv) Find the predicted prate when mrate = 3.5. Is this a reasonable prediction? Explain what is happening here. The predicted prate when mrate is 3.5 is given by 83.07 + 5.86 3.5 = 103.58. That is, 103% of employees are predicted to participate. Of course this is unreasonable. See the graphic above to get a picture of what is going on. Basically, we are using a linear 9

model (which predicts out into infinity in both the positive and negative directions) to fit data that is constrained to be less than or equal to 100. That is, the linear model is capable of predicting stuff that is incapable of happening. If we want to use the linear model for this type of problem, we have to live with this tradeoff. We should be very skeptical of the model to begin with, and we should absolutely not interpret the model s out-of-possibility predictions literally. (v) How much of the variation in prate is explained by mrate? Is this a lot in your opinion? Very little. About 7% according to the R-squared statistic. Most of the participation decision appears to be determined by factors other than mrate. (Maybe new employees are enrolled in company 401k plans by default, e.g.) (C2.2) The data set in CEOSAL2.RAW contains information on chief executive officers for U.S. corporations. The variable salary is annual compensation, in thousands of dollars, and ceoten is prior number of years as company CEO. (i) Find the average salary and the average tenure in the sample. Use Stata code sum salary ceoten to find that the average salary of a CEO in our sample is $865,860, and the average tenure is 7.95 years. (ii) How many CEOs are in their first year as CEO (that is, ceoten = 0)? What is the longest tenure as a CEO? There are lots of ways to find this information. There are a total of 177 ceos in the sample. Since there are 177 observations of ceoten (no missing values in that particular variable), we can just count the number of observations that meet the criteria. A command such as sum ceoten if ceoten == 0 will do the trick. You will see in the output that the number of observations fitting the criteria is 5. Alternatively, you could create a variable to do the calculation directly. For instance, you could create an indicator variable using the code gen newguy = 0 10

replace newguy = 1 if ceoten == 0 then summarize this indicator variable with the code sum newguy. You could do two things at this point to find the answer. You could multiply the number of observations (177) by the mean (.0282486) to find the answer (5.0000022). Or you could use the direct way. This is a useful Stata tip: Every estimation command (even the very simple command sum) saves some calculations in the background without printing them to the screen. To see what results are saved and available to you (though not printed to the screen), look at the bottom of the help file for the estimation command. Type help summarize to see the help file for summarize. Scroll to the bottom and look at the material below the Saved results heading. One of the things you will see is that the sum of the variable being summarized (in this case, the newguy variable) is stored in a scalar referred to in Stata as r(sum). A scalar is simply a single number. A constant. A regular old number. In Stata, scalars are referred to as such to distinguish them from data. Most stuff in Stata is data, i.e. it is a whole column of information. If you want to refer to a single entity (like a number equal to the sum of the newguy indicator variable) then you preface it with the scalar command. Run the following commands to see what happens. sum newguy scalar count = r(sum) scalar list count You should see that the result (5) is printed to the screen. You have generated a scalar that is equal to the sum of a variable which takes on the value 1 when a ceo is tenured for zero years. This is just one way to do it. You could successfully do it a number of different ways. To find the longest tenured ceo, simply summarize the ceoten variable and look at the maximum value. The longest tenured ceo has been on the job for 37 years. (iii) Estimate the simple regression model log(salary) = β 0 + β 1 ceoten + u, and report your results in the usual form. What is the (approximate) predicted percentage increase in salary given one more year as CEO? gen logsalary = log(salary) reg logsalary ceoten We see that for every year increase in ceo tenure, salary increases by about 1% (the coefficient on ceoten is.0097236, which we multiply by 100 to obtain the percentage increase). 11

(C2.3) Use the data in SLEEP75.RAW from Biddle and Hamermesh (1990) to study whether there is a tradeoff between the time spent sleeping per week and the time spent in paid work. We could use either variable as the dependent variable. For concreteness, estimate the model sleep = β 0 + β 1 totwrk + u, where sleep is minutes spent sleeping at night per week and totwork is total minutes worked during the week. (i) Report your results in equation form along with the number of observations and R 2. What does the intercept in this equation mean? My results in equation form are: sleep = 3586.38.15 totwrk. The number of observations used to estimate the model is 706, and the R-squared is 0.1 (rounded to the 100ths place). The intercept of 3,586 is the number of minutes of night-sleep (as opposed to nap-sleep) per week that an unemployed person gets. (ii) If totwrk increases by 2 hours, by how much is sleep estimated to fall? Do you find this to be a large effect? The variable totwrk is measured in minutes, so a change of two hours would be equal to an increase in totwrk of 120. An increase of totwrk by 120 would generate a decrease in sleep of 18 minutes per week (120*(-0.15) = -18). Doesn t seem like much. Work an extra 2 hours per week and sleep 18 minutes less. Would you do it? Depends on your wage, I suppose. 12