Discussion Section 4 ECON 139/ Summer Term II
|
|
|
- Bruce Greer
- 9 years ago
- Views:
Transcription
1 Discussion Section 4 ECON 139/ Summer Term II 1. Let s use the CollegeDistance.csv data again. (a) An education advocacy group argues that, on average, a person s educational attainment would increase by approximately 0.15 years in distance to the nearest college decreased by 20 miles. Run a regression of years of completed education (ED) on distance to the nearest college (Dist). Is the advocacy group s claim consistent with the estimated regression? Explain. Solution: the regression model: ED = β 0 + β 1 dist + u the predicted change in ED when dist changes by dist: ED = β 1 dist the argument we want to test: 0.15 = β 1 ( 2) (Note: dist in 10 miles) the null hypothesis: H 0 : β 1 = use "D:\econ139\collegedistance.dta", clear. reg ed dist, robust F( 1, 3794) = R-squared = Root MSE = dist _cons test dist= ( 1) dist = F( 1, 3794) = 0.01 Prob > F = We cannot reject H 0. The advocacy group s claim is consistent with the estimated regression.
2 (b) Other factors also affect how much college a person completes. Does controlling for these other factors change the estimated effect of distance on college years completed? For example, run a regression of ED on Dist, F emale, Black, Hispanic, Bytest, DadColl, M omcoll, Ownhome,Cue80, Stwmf g80, T uition and IncomeHi. Solution:. reg ed dist female black hispanic bytest dadcoll momcoll ownhome cue80 stwmfg80 tuition incomehi, robust F( 12, 3783) = R-squared = Root MSE = dist female black hispanic bytest dadcoll momcoll ownhome cue stwmfg tuition incomehi _cons (c) It has been argued that, controlling for other factors, blacks and Hispanics complete more college than whites. Is this consistent with the regressions that you constructed in part (b)? Page 2
3 Solution:. test black hispanic ( 1) black = 0 ( 2) hispanic = 0 F( 2, 3783) = test black= hispanic ( 1) black - hispanic = 0 F( 1, 3783) = 0.02 Prob > F = The coefficients on blacks and Hispanics are individually significant and jointly significant. They are also positve, so blacks and Hispanics complete more college than whites, holding other factors constant. We can also test if these effects are equal. We cannot reject the null hypothesis that the two coefficients are equal. (d) Test whether β tuition = β ownhome = 0. Solution:. test tuition ownhome ( 1) tuition = 0 ( 2) ownhome = 0 F( 2, 3783) = 4.42 Prob > F = We can reject the null at 5% significance level, but cannot reject the null at 1% significance level. (e) If Dist increases from 20 miles to 30 miles, how are years of education expected to change? If Dist increases from 60 to 70 miles, how are years of education expected to change? Page 3
4 Solution: Since the model is linear in Dist, the marginal effect of Dist on ED is constant, If Dist increases from 20 miles to 30 miles, ED is expected to decrease by If Dist increases from 60 miles to 70 miles, ED is expected to decrease by (f) Run a regression of ED on Dist, Dist 2, F emale, Black, Hispanic, Bytest, DadColl, M omcoll, Ownhome,Cue80, Stwmf g80, T uition and IncomeHi. If Dist increases from 20 miles to 30 miles, how are years of education expected to change? If Dist increases from 60 to 70 miles, how are years of education expected to change? Solution:. gen dist2=dist^2. reg ed dist dist2 female black hispanic bytest dadcoll momcoll ownhome cue80 stwmfg80 tuition incomehi, robust F( 13, 3782) = R-squared = Root MSE = dist dist female black hispanic bytest dadcoll momcoll ownhome cue stwmfg tuition Page 4
5 incomehi _cons dis -.081* *3^2-(-.081* *2^2) dis -.081* *7^2-(-.081* *6^2) (g) Do you prefer the regression that is linear in Dist or the one that is quadratic in Dist? (h) Consider a Hispanic female with T uition = $950, Bytest = 58, Incomehi = 0, Ownhome = 0, DadColl = 1, MomColl = 1, Cue80 = 7.1, and Stwmfg80 = $ Plot the regression relation between Dist and ED for Dist in the range of 0 to 100 miles. Describe the similarities and differences between the estimated regression functions. Would your answer change if you plotted the regression function for a white male with the same characteristics? Solution: Generate one more observation:. edit - preserve - set obs replace female = 1 in replace black = 0 in replace hispanic = 1 in replace bytest = 58 in replace dadcoll = 1 in replace momcoll = 1 in replace ownhome = 0 in replace cue80 = 7.1 in replace stwmfg80 = in replace dist = 0 in replace dist2 = 0 in replace tuition =.950 in replace incomehi = 0 in 3797 Then, predict the value for the new observation when Dist = 0.. reg ed dist female black hispanic bytest dadcoll momcoll Page 5
6 ownhome cue80 stwmfg80 tuition incomehi, robust F( 12, 3783) = R-squared = Root MSE = dist female black hispanic bytest dadcoll momcoll ownhome cue stwmfg tuition incomehi _cons predict ed_hat_linear (option xb assumed; fitted values). reg ed dist dist2 female black hispanic bytest dadcoll momcoll ownhome cue80 stwmfg80 tuition incomehi, robust F( 13, 3782) = R-squared = Root MSE = Page 6
7 dist dist female black hispanic bytest dadcoll momcoll ownhome cue stwmfg tuition incomehi _cons predict ed_hat_quad (option xb assumed; fitted values) Have a look at the predicted value for the new observation:. count list if _n==3797 ED h at l inear = ED h at q uad = Plot the regression relation between Dist and ED:. twoway (function y_quad= *x *x^2, range(0 10)) (function y_linear= *x, range(0 10)) For a white male with the same characteristics: only the intercept changes, the slopes remain the same. (i) Add the interaction term DadColl M omcoll to the regression. What does the coefficient on the interaction term measure? Page 7
8 Solution:. gen dadmom= dadcoll* momcoll. reg ed dist dist2 female black hispanic bytest dadcoll dadmom momcoll ownhome cue80 stwmfg80 tuition incomehi, robust F( 14, 3781) = R-squared = Root MSE = dist dist female black hispanic bytest dadcoll dadmom momcoll ownhome cue stwmfg tuition incomehi _cons (j) Is there any evidence that the effect of Dist on ED depends on the family s income? Solution:. gen incdist= incomehi*dist Page 8
9 . gen incdist2= incomehi*dist2. reg ed dist dist2 female black hispanic bytest dadcoll dadmom momcoll ownhome cue80 stwmfg80 tuition incomehi incdist incdist2, robust F( 16, 3779) = R-squared = Root MSE = dist dist female black hispanic bytest dadcoll dadmom momcoll ownhome cue stwmfg tuition incomehi incdist incdist _cons test incdist incdist2 ( 1) incdist = 0 Page 9
10 ( 2) incdist2 = 0 F( 2, 3779) = 2.34 Prob > F = On the course website you will find a dataset (pntsprd.csv) containing data on the Las Vegas point spreads for 553 men s college basketball games from the season. The variable favwin is a binary variable that equals 1 if the team favored by the Las Vegas spread wins. The variable spread measures the amount by which the favored team is expected to win. (a) A linear probability model to estimate the probability that the favored team wins is P (favwin = 1 spread) = β 0 + β 1 spread Explain why, if the spread incorporates all relevant information, we expect β 0 =.5. (Hint: if we think that the predicted point spread in the game is zero, spread = 0, then what should that say about the chances that our team is going to win?) Solution: If spread is zero, there is no favorite, and the probability that the team we (arbitrarily) label the favorite should have a 50% chance of winning. (b) Estimate the model from part a) by OLS. Test H 0 : β 0 =.5 against a two-sided alternative. Solution: The linear probability model estimated by OLS yields:. reg favwin spread, robust Linear regression Number of obs = 553 F( 1,551) = R-squared = Root MSE = favwin Coef. Std. Err. t P> t Page 10
11 spread _cons Using the robust standard error leads to strong rejection of H 0 at the 2% level against a two-sided alternative: t = = (c) Is the spread statistically significant? What is the estimated probability that the favored team wins when spread = 10? Solution: As we expect, spread is very statistically significant, with t = If spread = 10 the estimated probability that the favored team wins is (10) =.771. (d) Now estimate a probit model for P (favwin = 1 spread). Interpret and test the null hypothesis that the intercept is zero. Solution: The probit results are:. probit favwin spread Iteration 0: log likelihood = Iteration 1: log likelihood = Iteration 2: log likelihood = Iteration 3: log likelihood = Iteration 4: log likelihood = Probit estimates Number of obs = 553 LR chi2(1) = Prob > chi2 = Log likelihood = Pseudo R2 = favwin Coef. Std. Err. z P> z spread _cons In the Probit model P (favwin = 1 spread) = Φ (β 0 + β 1 spread) Page 11
12 where Φ ( ) denotes the standard normal cdf, if β 0 = 0 then P (favwin = 1 spread) = Φ (β 1 spread) and, in particular, P (favwin = 1 spread = 0) = Φ (0) =.5. This is the analog of testing whether the intercept is.5 in the LPM. The t-statistic for testing H 0 : β 0 = 0 is only about.102, so we do not reject H 0. (e) Use the probit model to estimate the probability that the favored team wins when spread = 10. Compare this with the LPM estimate from part c). Solution: When spread = 10 the predicted response probability from the estimated probit model is Φ ( (10)) = Φ (.9144) =.820 This is somewhat above the estimate for the LPM. (f) Repeat only part e) using a logit model. Solution: The logit results are. logit favwin spread Iteration 0: log likelihood = Iteration 1: log likelihood = Iteration 2: log likelihood = Iteration 3: log likelihood = Iteration 4: log likelihood = Iteration 5: log likelihood = Logit estimates Number of obs = 553 LR chi2(1) = Prob > chi2 = Log likelihood = Pseudo R2 = favwin Coef. Std. Err. z P> z spread _cons Page 12
13 When spread = 10 the predicted response probability from the estimated logit model is F ( (10)) = e1.56 = e1.56 This is somewhat above both the estimate for the LPM and the probit. Page 13
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: IV AND CONTROL FUNCTIONS, II Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Quantile Treatment Effects 2. Control Functions
Nonlinear Regression Functions. SW Ch 8 1/54/
Nonlinear Regression Functions SW Ch 8 1/54/ The TestScore STR relation looks linear (maybe) SW Ch 8 2/54/ But the TestScore Income relation looks nonlinear... SW Ch 8 3/54/ Nonlinear Regression General
Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015
Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 21, 2015 References: Long 1997, Long and Freese 2003 & 2006 & 2014,
MULTIPLE REGRESSION EXAMPLE
MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if
Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY
Statistics 104 Final Project A Culture of Debt: A Study of Credit Card Spending in America TF: Kevin Rader Anonymous Students: LD, MH, IW, MY ABSTRACT: This project attempted to determine the relationship
IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results
IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the
ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
Multinomial and Ordinal Logistic Regression
Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,
Department of Economics Session 2012/2013. EC352 Econometric Methods. Solutions to Exercises from Week 10 + 0.0077 (0.052)
Department of Economics Session 2012/2013 University of Essex Spring Term Dr Gordon Kemp EC352 Econometric Methods Solutions to Exercises from Week 10 1 Problem 13.7 This exercise refers back to Equation
August 2012 EXAMINATIONS Solution Part I
August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,
Rockefeller College University at Albany
Rockefeller College University at Albany PAD 705 Handout: Hypothesis Testing on Multiple Parameters In many cases we may wish to know whether two or more variables are jointly significant in a regression.
Handling missing data in Stata a whirlwind tour
Handling missing data in Stata a whirlwind tour 2012 Italian Stata Users Group Meeting Jonathan Bartlett www.missingdata.org.uk 20th September 2012 1/55 Outline The problem of missing data and a principled
MODEL I: DRINK REGRESSED ON GPA & MALE, WITHOUT CENTERING
Interpreting Interaction Effects; Interaction Effects and Centering Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Models with interaction effects
International Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: [email protected] 1. Introduction
Standard errors of marginal effects in the heteroskedastic probit model
Standard errors of marginal effects in the heteroskedastic probit model Thomas Cornelißen Discussion Paper No. 320 August 2005 ISSN: 0949 9962 Abstract In non-linear regression models, such as the heteroskedastic
Interaction effects between continuous variables (Optional)
Interaction effects between continuous variables (Optional) Richard Williams, University of Notre Dame, http://www.nd.edu/~rwilliam/ Last revised February 0, 05 This is a very brief overview of this somewhat
Failure to take the sampling scheme into account can lead to inaccurate point estimates and/or flawed estimates of the standard errors.
Analyzing Complex Survey Data: Some key issues to be aware of Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 24, 2015 Rather than repeat material that is
Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015
Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Stata Example (See appendices for full example).. use http://www.nd.edu/~rwilliam/stats2/statafiles/multicoll.dta,
Interaction effects and group comparisons Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015
Interaction effects and group comparisons Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Note: This handout assumes you understand factor variables,
From this it is not clear what sort of variable that insure is so list the first 10 observations.
MNL in Stata We have data on the type of health insurance available to 616 psychologically depressed subjects in the United States (Tarlov et al. 1989, JAMA; Wells et al. 1989, JAMA). The insurance is
Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software
STATA Tutorial Professor Erdinç Please follow the directions once you locate the Stata software in your computer. Room 114 (Business Lab) has computers with Stata software 1.Wald Test Wald Test is used
Linear Regression Models with Logarithmic Transformations
Linear Regression Models with Logarithmic Transformations Kenneth Benoit Methodology Institute London School of Economics [email protected] March 17, 2011 1 Logarithmic transformations of variables Considering
Chapter 9 Assessing Studies Based on Multiple Regression
Chapter 9 Assessing Studies Based on Multiple Regression Solutions to Empirical Exercises 1. Age 0.439** (0.030) Age 2 Data from 2004 (1) (2) (3) (4) (5) (6) (7) (8) Dependent Variable AHE ln(ahe) ln(ahe)
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009
HURDLE AND SELECTION MODELS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. A General Formulation 3. Truncated Normal Hurdle Model 4. Lognormal
Nonlinear relationships Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised February 20, 2015
Nonlinear relationships Richard Williams, University of Notre Dame, http://www.nd.edu/~rwilliam/ Last revised February, 5 Sources: Berry & Feldman s Multiple Regression in Practice 985; Pindyck and Rubinfeld
Lab 5 Linear Regression with Within-subject Correlation. Goals: Data: Use the pig data which is in wide format:
Lab 5 Linear Regression with Within-subject Correlation Goals: Data: Fit linear regression models that account for within-subject correlation using Stata. Compare weighted least square, GEE, and random
25 Working with categorical data and factor variables
25 Working with categorical data and factor variables Contents 25.1 Continuous, categorical, and indicator variables 25.1.1 Converting continuous variables to indicator variables 25.1.2 Converting continuous
Title. Syntax. stata.com. fp Fractional polynomial regression. Estimation
Title stata.com fp Fractional polynomial regression Syntax Menu Description Options for fp Options for fp generate Remarks and examples Stored results Methods and formulas Acknowledgment References Also
DETERMINANTS OF CAPITAL ADEQUACY RATIO IN SELECTED BOSNIAN BANKS
DETERMINANTS OF CAPITAL ADEQUACY RATIO IN SELECTED BOSNIAN BANKS Nađa DRECA International University of Sarajevo [email protected] Abstract The analysis of a data set of observation for 10
III. INTRODUCTION TO LOGISTIC REGRESSION. a) Example: APACHE II Score and Mortality in Sepsis
III. INTRODUCTION TO LOGISTIC REGRESSION 1. Simple Logistic Regression a) Example: APACHE II Score and Mortality in Sepsis The following figure shows 30 day mortality in a sample of septic patients as
Econometrics I: Econometric Methods
Econometrics I: Econometric Methods Jürgen Meinecke Research School of Economics, Australian National University 24 May, 2016 Housekeeping Assignment 2 is now history The ps tute this week will go through
11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
Quick Stata Guide by Liz Foster
by Liz Foster Table of Contents Part 1: 1 describe 1 generate 1 regress 3 scatter 4 sort 5 summarize 5 table 6 tabulate 8 test 10 ttest 11 Part 2: Prefixes and Notes 14 by var: 14 capture 14 use of the
Chapter 18. Effect modification and interactions. 18.1 Modeling effect modification
Chapter 18 Effect modification and interactions 18.1 Modeling effect modification weight 40 50 60 70 80 90 100 male female 40 50 60 70 80 90 100 male female 30 40 50 70 dose 30 40 50 70 dose Figure 18.1:
Generalized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
SPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
Basic Statistical and Modeling Procedures Using SAS
Basic Statistical and Modeling Procedures Using SAS One-Sample Tests The statistical procedures illustrated in this handout use two datasets. The first, Pulse, has information collected in a classroom
From the help desk: Swamy s random-coefficients model
The Stata Journal (2003) 3, Number 3, pp. 302 308 From the help desk: Swamy s random-coefficients model Brian P. Poi Stata Corporation Abstract. This article discusses the Swamy (1970) random-coefficients
We extended the additive model in two variables to the interaction model by adding a third term to the equation.
Quadratic Models We extended the additive model in two variables to the interaction model by adding a third term to the equation. Similarly, we can extend the linear model in one variable to the quadratic
Ordinal Regression. Chapter
Ordinal Regression Chapter 4 Many variables of interest are ordinal. That is, you can rank the values, but the real distance between categories is unknown. Diseases are graded on scales from least severe
Correlation and Regression
Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look
Regression with a Binary Dependent Variable
Regression with a Binary Dependent Variable Chapter 9 Michael Ash CPPA Lecture 22 Course Notes Endgame Take-home final Distributed Friday 19 May Due Tuesday 23 May (Paper or emailed PDF ok; no Word, Excel,
Pearson's Correlation Tests
Chapter 800 Pearson's Correlation Tests Introduction The correlation coefficient, ρ (rho), is a popular statistic for describing the strength of the relationship between two variables. The correlation
Factors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
Sample Size Calculation for Longitudinal Studies
Sample Size Calculation for Longitudinal Studies Phil Schumm Department of Health Studies University of Chicago August 23, 2004 (Supported by National Institute on Aging grant P01 AG18911-01A1) Introduction
Lecture 15. Endogeneity & Instrumental Variable Estimation
Lecture 15. Endogeneity & Instrumental Variable Estimation Saw that measurement error (on right hand side) means that OLS will be biased (biased toward zero) Potential solution to endogeneity instrumental
Module 14: Missing Data Stata Practical
Module 14: Missing Data Stata Practical Jonathan Bartlett & James Carpenter London School of Hygiene & Tropical Medicine www.missingdata.org.uk Supported by ESRC grant RES 189-25-0103 and MRC grant G0900724
Comparing Nested Models
Comparing Nested Models ST 430/514 Two models are nested if one model contains all the terms of the other, and at least one additional term. The larger model is the complete (or full) model, and the smaller
From the help desk: hurdle models
The Stata Journal (2003) 3, Number 2, pp. 178 184 From the help desk: hurdle models Allen McDowell Stata Corporation Abstract. This article demonstrates that, although there is no command in Stata for
ONLINE SPORTS GAMBLING: A LOOK INTO THE EFFICIENCY OF BOOKMAKERS ODDS AS FORECASTS IN THE CASE OF ENGLISH PREMIER LEAGUE
ONLINE SPORTS GAMBLING: A LOOK INTO THE EFFICIENCY OF BOOKMAKERS ODDS AS FORECASTS IN THE CASE OF ENGLISH PREMIER LEAGUE By: Jasmine Siwei Xu Thesis Advisor: Professor Roger Craine Undergraduate Economics
Using R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
Nested Logit. Brad Jones 1. April 30, 2008. University of California, Davis. 1 Department of Political Science. POL 213: Research Methods
Nested Logit Brad 1 1 Department of Political Science University of California, Davis April 30, 2008 Nested Logit Interesting model that does not have IIA property. Possible candidate model for structured
Hypothesis testing - Steps
Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
Estimation of σ 2, the variance of ɛ
Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated
2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015
Stat 412/512 CASE INFLUENCE STATISTICS Feb 2 2015 Charlotte Wickham stat512.cwick.co.nz Regression in your field See website. You may complete this assignment in pairs. Find a journal article in your field
Logit and Probit. Brad Jones 1. April 21, 2009. University of California, Davis. Bradford S. Jones, UC-Davis, Dept. of Political Science
Logit and Probit Brad 1 1 Department of Political Science University of California, Davis April 21, 2009 Logit, redux Logit resolves the functional form problem (in terms of the response function in the
Forecasting in STATA: Tools and Tricks
Forecasting in STATA: Tools and Tricks Introduction This manual is intended to be a reference guide for time series forecasting in STATA. It will be updated periodically during the semester, and will be
I n d i a n a U n i v e r s i t y U n i v e r s i t y I n f o r m a t i o n T e c h n o l o g y S e r v i c e s
I n d i a n a U n i v e r s i t y U n i v e r s i t y I n f o r m a t i o n T e c h n o l o g y S e r v i c e s Linear Regression Models for Panel Data Using SAS, Stata, LIMDEP, and SPSS * Hun Myoung Park,
Correlated Random Effects Panel Data Models
INTRODUCTION AND LINEAR MODELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. The Linear
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
MARKET EFFECIENCY: IS THE NFL BETTING MARKET EFFICIENT? By: Alexander Kuper. Thesis Advisor: Professor Roger Craine
MARKET EFFECIENCY: IS THE NFL BETTING MARKET EFFICIENT? By: Alexander Kuper Thesis Advisor: Professor Roger Craine Economics Undergraduate Honors Thesis University of California Berkeley May 2012 Acknowledgements:
An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA
ABSTRACT An Introduction to Statistical Tests for the SAS Programmer Sara Beck, Fred Hutchinson Cancer Research Center, Seattle, WA Often SAS Programmers find themselves in situations where performing
Interaction between quantitative predictors
Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors
2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or
Simple and Multiple Regression Analysis Example: Explore the relationships among Month, Adv.$ and Sales $: 1. Prepare a scatter plot of these data. The scatter plots for Adv.$ versus Sales, and Month versus
Wooldridge, Introductory Econometrics, 4th ed. Chapter 7: Multiple regression analysis with qualitative information: Binary (or dummy) variables
Wooldridge, Introductory Econometrics, 4th ed. Chapter 7: Multiple regression analysis with qualitative information: Binary (or dummy) variables We often consider relationships between observed outcomes
1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
Competing-risks regression
Competing-risks regression Roberto G. Gutierrez Director of Statistics StataCorp LP Stata Conference Boston 2010 R. Gutierrez (StataCorp) Competing-risks regression July 15-16, 2010 1 / 26 Outline 1. Overview
Does NFL Spread Betting Obey the E cient Market Hypothesis?
Does NFL Spread Betting Obey the E cient Market Hypothesis? Johannes Harkins May 2013 Abstract In this paper I examine the possibility that NFL spread betting does not obey the strong form of the E cient
I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Linear Algebra Slide 1 of
Introduction to Time Series Regression and Forecasting
Introduction to Time Series Regression and Forecasting (SW Chapter 14) Time series data are data collected on the same observational unit at multiple time periods Aggregate consumption and GDP for a country
Stata Walkthrough 4: Regression, Prediction, and Forecasting
Stata Walkthrough 4: Regression, Prediction, and Forecasting Over drinks the other evening, my neighbor told me about his 25-year-old nephew, who is dating a 35-year-old woman. God, I can t see them getting
outreg help pages Write formatted regression output to a text file After any estimation command: (Text-related options)
outreg help pages OUTREG HELP PAGES... 1 DESCRIPTION... 2 OPTIONS... 3 1. Text-related options... 3 2. Coefficient options... 4 3. Options for t statistics, standard errors, etc... 5 4. Statistics options...
Forecasting the US Dollar / Euro Exchange rate Using ARMA Models
Forecasting the US Dollar / Euro Exchange rate Using ARMA Models LIUWEI (9906360) - 1 - ABSTRACT...3 1. INTRODUCTION...4 2. DATA ANALYSIS...5 2.1 Stationary estimation...5 2.2 Dickey-Fuller Test...6 3.
Chapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents
Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén
A Simple Feasible Alternative Procedure to Estimate Models with High-Dimensional Fixed Effects
DISCUSSION PAPER SERIES IZA DP No. 3935 A Simple Feasible Alternative Procedure to Estimate Models with High-Dimensional Fixed Effects Paulo Guimarães Pedro Portugal January 2009 Forschungsinstitut zur
The following postestimation commands for time series are available for regress:
Title stata.com regress postestimation time series Postestimation tools for regress with time series Description Syntax for estat archlm Options for estat archlm Syntax for estat bgodfrey Options for estat
especially with continuous
Handling interactions in Stata, especially with continuous predictors Patrick Royston & Willi Sauerbrei German Stata Users meeting, Berlin, 1 June 2012 Interactions general concepts General idea of a (two-way)
Logistic Regression. Jia Li. Department of Statistics The Pennsylvania State University. Logistic Regression
Logistic Regression Department of Statistics The Pennsylvania State University Email: [email protected] Logistic Regression Preserve linear classification boundaries. By the Bayes rule: Ĝ(x) = arg max
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
Section 6: Model Selection, Logistic Regression and more...
Section 6: Model Selection, Logistic Regression and more... Carlos M. Carvalho The University of Texas McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Model Building
1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ
STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material
xtmixed & denominator degrees of freedom: myth or magic
xtmixed & denominator degrees of freedom: myth or magic 2011 Chicago Stata Conference Phil Ender UCLA Statistical Consulting Group July 2011 Phil Ender xtmixed & denominator degrees of freedom: myth or
Food Expenditures: The Effect of a Vegetarian Diet and Organic Foods
Food Expenditures: The Effect of a Vegetarian Diet and Organic Foods Ann-Renée Guillemette, Dept. of FARE, M.Sc Student John Cranfield, Professor in Dept. of FARE Food Expenditures: The Effect of a Vegetarian
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries
Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma [email protected] The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association
Part 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
Point Biserial Correlation Tests
Chapter 807 Point Biserial Correlation Tests Introduction The point biserial correlation coefficient (ρ in this chapter) is the product-moment correlation calculated between a continuous random variable
Online Appendix The Earnings Returns to Graduating with Honors - Evidence from Law Graduates
Online Appendix The Earnings Returns to Graduating with Honors - Evidence from Law Graduates Ronny Freier Mathias Schumann Thomas Siedler February 13, 2015 DIW Berlin and Free University Berlin. Universität
Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation
The average hotel manager recognizes the criticality of forecasting. However, most
Introduction The average hotel manager recognizes the criticality of forecasting. However, most managers are either frustrated by complex models researchers constructed or appalled by the amount of time
MODELING AUTO INSURANCE PREMIUMS
MODELING AUTO INSURANCE PREMIUMS Brittany Parahus, Siena College INTRODUCTION The findings in this paper will provide the reader with a basic knowledge and understanding of how Auto Insurance Companies
