Econometrics. Midterm (A) March 22 nd, 2010

Similar documents
2. Linear regression with multiple regressors

Solución del Examen Tipo: 1

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Review Jeopardy. Blue vs. Orange. Review Jeopardy

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Forecasting the US Dollar / Euro Exchange rate Using ARMA Models

Chapter 6: Multivariate Cointegration Analysis

Wooldridge, Introductory Econometrics, 3d ed. Chapter 12: Serial correlation and heteroskedasticity in time series regressions

ECON 523 Applied Econometrics I /Masters Level American University, Spring Description of the course

Econometrics Simple Linear Regression

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Air passenger departures forecast models A technical note

Statistics 151 Practice Midterm 1 Mike Kowalski

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software

Module 5: Multiple Regression Analysis

An Analysis of the Effect of Income on Life Insurance. Justin Bryan Austin Proctor Kathryn Stoklosa

Nonlinear Regression Functions. SW Ch 8 1/54/

Descriptive Statistics

Simple linear regression

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Introduction to Quantitative Methods

MISSING DATA TECHNIQUES WITH SAS. IDRE Statistical Consulting Group

Lecture 15. Endogeneity & Instrumental Variable Estimation

Simple Linear Regression Inference

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Chapter 7: Simple linear regression Learning Objectives

Premaster Statistics Tutorial 4 Full solutions

Financial Risk Management Exam Sample Questions/Answers

Testing for Granger causality between stock prices and economic growth

Multiple Regression: What Is It?

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

On the Degree of Openness of an Open Economy Carlos Alfredo Rodriguez, Universidad del CEMA Buenos Aires, Argentina

1. Suppose that a score on a final exam depends upon attendance and unobserved factors that affect exam performance (such as student ability).

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Introduction to Regression and Data Analysis

Multiple Linear Regression in Data Mining

What s New in Econometrics? Lecture 8 Cluster and Stratified Sampling

Linear Models in STATA and ANOVA

Chapter 5 Analysis of variance SPSS Analysis of variance

The Impact of Privatization in Insurance Industry on Insurance Efficiency in Iran

Regression Analysis (Spring, 2000)

The relationship between stock market parameters and interbank lending market: an empirical evidence

Simple Regression Theory II 2010 Samuel L. Baker

3.1 Least squares in matrix form

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Module 3: Correlation and Covariance

Economics of Strategy (ECON 4550) Maymester 2015 Applications of Regression Analysis

Chapter 7: Dummy variable regression

STATISTICA Formula Guide: Logistic Regression. Table of Contents

The Relationship between Life Insurance and Economic Growth: Evidence from India

SYSTEMS OF REGRESSION EQUATIONS

UK GDP is the best predictor of UK GDP, literally.

Directions for using SPSS

5. Multiple regression

17. SIMPLE LINEAR REGRESSION II

Multiple Linear Regression

Moderation. Moderation

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Generalized Linear Models

Independent t- Test (Comparing Two Means)

Hypothesis testing - Steps

II. DISTRIBUTIONS distribution normal distribution. standard scores

IMPACT OF WORKING CAPITAL MANAGEMENT ON PROFITABILITY

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

FORECASTING DEPOSIT GROWTH: Forecasting BIF and SAIF Assessable and Insured Deposits

August 2012 EXAMINATIONS Solution Part I

Part 2: Analysis of Relationship Between Two Variables

Forecasting Using Eviews 2.0: An Overview

Estimation of σ 2, the variance of ɛ

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Online Appendix to Are Risk Preferences Stable Across Contexts? Evidence from Insurance Data

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Standard errors of marginal effects in the heteroskedastic probit model

Lecture Notes Module 1

Using SAS Proc Mixed for the Analysis of Clustered Longitudinal Data

11. Analysis of Case-control Studies Logistic Regression

Week TSX Index

The correlation coefficient

Wooldridge, Introductory Econometrics, 4th ed. Chapter 7: Multiple regression analysis with qualitative information: Binary (or dummy) variables

Least Squares Estimation

Confidence Intervals for One Standard Deviation Using Standard Deviation

2013 MBA Jump Start Program. Statistics Module Part 3

SPSS Guide: Regression Analysis

1 Another method of estimation: least squares

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Factor analysis. Angela Montanari

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

GLM I An Introduction to Generalized Linear Models

5. Linear Regression

Multivariate Logistic Regression

Transcription:

Econometrics Midterm (A) March 22 nd, 2010 João Valle e Azevedo Erica Marujo For each question, please identify the correct answer. For each question, there is one and only one correct answer. A correct answer is worth 1 point; an incorrect answer shall be attributed 0 points. Mark your choice on the answer sheet provided at the end of this examination paper. EITHER USE A PENCIL OR DO NOT MAKE CORRECTIONS. Identify your answer sheet with your name and student number. To answer questions 1 through 6, please consider the following Eviews output of a model which aims to study the effect of school size on student performance, in Michigan, in 1993: Dependent Variable: MATH10 Method: Least Squares Sample: Included observations: Coefficient Std. Error t-statistic Prob. C -207.6649 48.70312-4.263892 0.0000 LTOTCOMP 21.15500 4.055549 5.216311 0.0000 LSTAFF 3.980021 4.189660 0.949963 0.3427 LENROLL -1.268047 0.693204-1.829255 0.0681 R-squared 0.065378 Mean dependent var 24.10686 Adjusted R-squared 0.058437 S.D. dependent var 10.49361 S.E. of regression 10.18239 Akaike info criterion 7.488952 Sum squared resid 41887.14 Schwarz criterion 7.528278 Log likelihood -1523.746 Hannan-Quinn criter. 7.504513 F-statistic 9.420035 Durbin-Watson stat 1.666022 Prob(F-statistic) 0.000005 where represents the percentage of students receiving a passing score on the Michigan Educational Assessment Program (MEAP) standardized tenth grade math test; represents average annual teacher compensation, in dollars (a measure of teacher quality); represents number of staff per one thousand students (a measure of how much attention students receive); and stands for student enrollment (which represents school size). 1

1. What is the sample size? a) Approximately 500 b) Approximately 450 c) Approximately 410 d) Approximately 408 e) We don t have enough information to answer this question f) None of the above 2. What is the meaning of the coefficient on? a) If the average annual teacher compensation increases by 1 dollar, then it is expected that the percentage of students receiving a passing score on the MEAP standardized tenth grade math test will be 21,155% higher, on average, ceteris paribus b) If the average annual teacher compensation increases by 1%, then it is expected that the percentage of students receiving a passing score on the MEAP standardized tenth grade math test will be 21,155% higher, on average, ceteris paribus c) If the average annual teacher compensation increases by 1%, then it is expected that the percentage of students receiving a passing score on the MEAP standardized tenth grade math test will increase by 0,21155 percentage points, on average, ceteris paribus d) It is the elasticity of with respect to and it bears the expected sign e) It is the semi-elasticity of with respect to and it bears the expected sign f) Both b) and e) above are correct g) Both b) and d) above are correct h) Both c) and e) above are correct 3. Considering a significance level of 5%, which coefficients are statistically significant (consider a two-sided alternative to perform the test): a) and b) All c) and Just d) Just e) None f) and g) and 2

4. Now assume that another variable, (which represents average teacher salary, in dollars), was omitted from this model. Knowing that:, which Gauss-Markov assumptions are being violated in this model? a) All b) None c) Linearity in Parameters, Zero Conditional Mean and Homoskedasticity d) Zero Conditional Mean e) No Perfect Collinearity f) Random Sampling, Zero Conditional Mean and No Perfect Collinearity g) Zero Conditional Mean and Homoskedasticity h) Both d) and e) i) Both e) and g) 5. About this regression, it is also known the following Coefficient Variance- Covariance matrix: What is the estimate for the t-statistic of the following null hypothesis:? a) b) c) d) e) 1,25 f) We don t have enough information to answer this question g) None of the above 3

6. Now consider the following output: Dependent Variable: MATH10 Method: Least Squares Sample: 1 408 Included observations: 408 Coefficient Std. Error t-statistic Prob. C -168.3782 37.28778-4.515642 0.0000 LTOTCOMP 19.63746 3.873109 5.070207 0.0000 LSTAFF+LENROLL -1.196471 0.691330-1.730679 0.0843 R-squared 0.061747 Mean dependent var 24.10686 Adjusted R-squared 0.057114 S.D. dependent var 10.49361 S.E. of regression 10.18954 Akaike info criterion 7.487927 Sum squared resid 42049.84 Schwarz criterion 7.517421 Log likelihood -1524.537 Hannan-Quinn criter. 7.499598 F-statistic 13.32671 Durbin-Watson stat 1.676991 Prob(F-statistic) 0.000002 Is it possible to test the same null hypothesis as in the previous question,, but using a different test statistic, given the information presented in this output? a) No, the information reported in this output is useless to perform this test b) Yes, and the new test statistic is equal to -1,730679, the t-statistic of the coefficient on c) Yes, and the new test statistic is an F-statistic equal to 1,5695372, approximately d) No, because we still would need the coefficient variance-covariance matrix to compute any type of test statistic e) Both c) and d) f) None of the above 7. Assume that you have estimated the following model:, where is a dummy variable equal to one if a person is unemployed and zero otherwise. The following 90% confidence interval was estimated for :. If we want to test the null: against a one-sided alternative with a 5% significance level then: a) We will reject for sure, because the 95% confidence interval associated with the 5% significance level becomes wider than the one above and so 1,7 will be inside the confidence interval 4

b) We will reject for sure, because the 95% confidence interval associated with the 5% significance level becomes thinner than the one above and so 1,7 will be outside the confidence interval c) We will reject for sure, because the upper extreme of the 90% confidence interval coincides with the critical value associated with the one-sided alternative at a 5% significance level, and so 1,7 will be outside the confidence interval d) We will reject for sure, because the upper extreme of the 90% confidence interval coincides with the critical value associated with the one-sided alternative at a 5% significance level, and so 1,7 will be inside the confidence interval e) None of the above 8. Consider a multiple linear regression model for cross-sectional data that analyses the impact of the demand for football matches tickets on TV audiences of football matches. Among other regressors, you include dummy variables for Sporting, Porto and Benfica (that equal one if a person is, respectively, a supporter of Sporting, Porto and Benfica, and zero otherwise). The No Perfect Collinearity Assumption is not violated in this model as long as: a) The sum of the three dummy variables is not equal to 1 across all observations b) The sum of the three dummy variables is equal to 1 across all observations c) In this case, the No Perfect Collinearity assumption is always violated, no matter what, because the three dummies are perfectly linearly correlated d) In the sample there are people that are not supporters of none of these three football clubs e) Both a) and d) are correct f) Both b) and d) are correct g) None of the above 9. Consider the model: Suppose that when you decided to regress the model, the variable was not included. Then, assuming that the correlation between and is negative, what would be the expected sign in the bias in (estimated by OLS) in that regression, assuming the original (full) model is correctly specified? a) Negative, because and Correlation(, )<0 b) Positive, because and Correlation(, )<0 5

c) Positive, because and Correlation(, )<0, as long as we assume that the correlation between and and the correlation between and is zero d) Positive, because and Correlation(, )<0, as long as we assume that the correlation between and is zero e) Negative, because and Correlation(, )<0, as long as we assume that the correlation between and is zero f) None of the above 10. Now consider the following model: where is a gender dummy equal to 1 if female and zero otherwise, and equals 1 if Portuguese and zero otherwise. If you want to perform the Chow test in this model in relation to gender, then the null hypothesis will be: a) b) c) d) e) None of the above 11. In testing multiple exclusion restrictions in the multiple regression model under the Classical assumptions, we are more likely to fail to reject the null that some coefficients are zero if: a) The Residual sum of squares of the restricted model is large relative to that of the unrestricted model b) The Residual sum of squares of the restricted model is small relative to that of the unrestricted model c) The R-squared of the unrestricted model is large d) The intercept parameter is negative e) Both a) and d) above are correct f) Both c) and d) above are correct g) Both a) and c) above are correct h) Both b) and c) above are correct 6

12. Consider a multiple linear regression for cross-sectional data satisfying the Gauss-Markov assumptions. Suppose you want to test multiple exclusion restrictions in you model. Which of the following is true? a) You cannot perform such test because only the t-test is valid in large samples b) You cannot use the usual F test for multiple exclusion restrictions c) You can use the usual F test for multiple exclusion restrictions d) You can only use the LM test for multiple exclusion restrictions e) You can use the LM test for multiple exclusion restrictions f) Both b) and d) above are correct g) both c) and e) above are correct h) b) and e) above are correct 13. Consider a multiple linear regression model for cross-sectional data. If the Gauss-Markov assumptions hold, then: a) Ds The OLS estimator will be consistent and unbiased b) We can only guarantee the OLS estimator is unbiased c) The OLS estimator is consistent only if the sample size is very large d) The OLS estimator will be biased e) Both b) and c) above are correct f) Both c) and d) above are correct g) Both a) and d) above are correct 14. If under certain assumptions the OLS estimator is BLUE, Best Linear Unbiased Estimator, then we can conclude that it is: a) the minimum variance unbiased estimator b) an unbiased estimator c) a linear estimator d) an estimator e) both b), c) and d) above are NOT correct f) both b), c) and d) above are correct 15. The adjusted R-squared is a good measure to compare the goodness-of-fit of multiple linear regression models because: a) it can be used to compare models with different dependent variables b) it always increases if we include additional regressors c) all else equal, it is lower than the usual (unadjusted) R-squared 7

d) it increases once we add more regressors only if the residual sum of squares does not decrease enough e) it increases once we add more regressors only if the residual sum of squares does not increase enough f) both a) and b) above are correct g) both c) and d) above are correct h) both c) and e) above are correct 16. Consider a multiple linear regression model where the Zero Conditional Mean assumption fails, but the remaining Gauss-Markov assumptions hold. Specifically, suppose E[u X]=δ, where u is the vector of error terms, X is the matrix of regressors and δ has at least one element different than zero. What can you say about the bias (E[ X]-β) of the OLS estimator in this situation? a) the OLS estimator is still BLUE b) the bias equals (X X) -1 X δ c) the bias is still zero, always d) the bias equals X X δ e) the bias equals X δ f) None of the answers above is correct 17. Why would you want to use heteroskedasticity-robust standard errors to perform hypothesis testing on a multiple linear regression model for cross-sectional data? a) because the zero conditional mean assumption might fail b) because there is the risk that the error term of the model is homoskedastic c) because the error term of the model is for sure heteroskedastic, but the form of heteroskedasticity is unknown d) because you are not sure the homoskedasticity assumption holds e) both a) and b) above are correct f) both b) and c) above are correct g) both c) and d) above are correct h) both a) and d) above are correct 8

18. Consider a simple regression model, satisfying the Gauss-Markov assumptions, with the height of the individual as dependent variable and with the only regressor being a dummy variable for gender (equal to 1 if the individual is a male and zero otherwise). What is the proportion of males in the sample that minimizes the variance of the OLS estimator for the coefficient on the dummy variable? a) Any proportion between 25% and 75% b) 33% c) 50% d) 0% e) 100% f) either 50% or 100% g) none of the answers above is correct 19. WLS stands for what in Econometrics? a) Without Large Sample b) With Large Squares c) None of the answers above and below is correct d) None of the answers below is correct e) Weighted Least Squares f) Windows Live Search 20. In a multiple linear regression model, the random sampling assumption implies: a) the variance covariance matrix of the errors u=(u₁,u₂,...,u n ) is diagonal, i.e., all the elements out of the diagonal are equal to zero b) the variance covariance matrix of the errors u=(u₁,u₂,...,u n ) is not diagonal, i.e., at least one off-diagonal element is different than zero c) the variance covariance matrix of the OLS estimator is not diagonal, i.e., at least one off-diagonal element is different than zero d) the variance covariance matrix of the OLS estimator is diagonal e) none of the answers above is correct 9

Answer Sheet Version A Mark your answer with an X in the table below Any answers outside the table will not be considered Either use a pencil or do NOT make corrections Name: Student Number: _ Question Nr. a) b) c) d) e) f) g) h) i) 1. X 2. X 3. X 4. X 5. X 6. X 7. * X 8. * X 9. X 10. X 11. X 12. X 13. X 14. X 15. X 16. X 17. X 18. X 19. X 20. X * Also considered as a correct answer 10