Economics of Strategy (ECON 4550) Maymester 2015 Applications of Regression Analysis


 Corey Hopkins
 5 years ago
 Views:
Transcription
1 Economics of Strategy (ECON 4550) Maymester 015 Applications of Regression Analysis Reading: ACME Clinic (ECON 4550 Coursepak, Page 47) and Big Suzy s Snack Cakes (ECON 4550 Coursepak, Page 51) Definitions and Concepts: Sample Maximum the largest realized value of a variable Sample Minimum the smallest realized value of a variable Dummy Variable a variable that indicates whether an observation is characterized by a particular attribute (typically equal to 1 if the attribute is true and equal to 0 otherwise) Omitted Variable Bias a problem of distorted regression results arising from specifying a model which leaves out one or more important independent variables (i.e., a specification of the true model which is wrong because all of the relevant X variables were not included) For such a bias to arise in linear regression, the omitted variable must (i) be a true determinant of the independent variable and (ii) be strongly correlated with one or more of the other included independent variables If such a relevant independent variable is omitted, then the estimated coefficient on the strongly correlated (included) independent variable is partly measuring the impact of the highly correlated omitted variable Note: an Excel file containing the data used in each of the examples discussed in lecture is posted on the course webpage (
2 1. Estimating an Average Cost Function Consider an automobile manufacturer trying to estimate ATC (q), based on past realizations of Average Total Costs for different levels of Output Assume ATC( q) b0 b1q bq We have data on Average Costs and Quantity of Output for each of the past 6 weeks as follows: Average Costs Quantity Average Costs Quantity 39, , , , , , , , , , , , , , , , , , , , , , , , , ,10 40 Start by computing some descriptive statistics for the variables in our data set: sample mean, sample standard deviation, sample maximum, and sample minimum In practice, this partly serves as a check to potentially identify any errors in the dataset Descriptive Statistics: Average Costs Quantity Mean 33, Std Dev 15, Maximum 71, Minimum 1, In order for our data to match the assumed functional form for Average Costs, we need to do a nonlinear transformation of Quantity (i.e., compute Quantity Squared for each observation) Regression results from Excel
3 Example 1 Estimating an Average Cost Function SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 6 ANOVA df SS MS F Significance F Regression E 06 Residual Total Coefficients Standard Error t Stat P value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept E X Variable X Variable Estimated equation of bˆ 0 bˆ ˆ 1q bq 41, q (0.1781) q Note, all pvalues are small enough so that each estimated coefficient is statistically significant at a 0.1% error level R. 6858
4 (?) What is the Efficient Scale of Production for this firm? (A) Recall, the Efficient Scale of Production is the quantity of output that minimizes Average Total Costs of Production. We have estimated Average Total Costs of Production to be: ATC( q) 41, q (0.1781) q From here, we have: AT C ( q) (0.356) q and AT C( q) AT C( q) 0 for small quantities and AT C( q) 0 for large quantities Average Total Costs are minimized where: AT C( q) (0.356) q 0 ( 0.356) q q Thus, the Efficient Scale of Production is roughly 338 units of output
5 . Estimating Demand Consider a coffee house with retail outlets in 3 markets For each market they have data on annual quantity sold, price per unit, average income, and price set by a rival. Store Number Quantity Sold Price Average Income Rival Price 1 476, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,10.5 5, , , , , , , , , , , , , , , , , , , , , , , , , , , , ,800.0
6 Descriptive Statistics: Quantity Price Income Rival Price Mean 494, , Std Dev 86, , Maximum 67, ,750.5 Minimum 34, , Suppose they conjecture that: B 1 B ( ) _ B quantity A price income rival price 3 a b c Note: ln( x y z ) a ln( x) bln( y) c ln( z) Thus, the demand relation above can be expressed as: quantity ln( A) B1 lnprice B lnincome B lnrival _ price quantity B B lnprice B lnincome B lnrival _ price ln 3 ln We can do a transformation of variables and run a linear regression! Regression results from Excel (see following page) From here, we can essentially undo the previous transformation of variables Note, since B0 ln( A) and B ˆ , it follows that A ˆ exp{ } 1, So, our estimated equation is: B 1 B ( ) _ B quantity A price income rival price 3 price income rival _ quantity price Recognize that fixing income and rival price, this demand function is of the constant elasticity form => price elasticity of demand is p => Elastic Demand Further, Income Elasticity of Demand is I => Normal Good And Cross Price Elasticity of Demand (with respect to rival price) is X, p Y => good in question is a Substitute for the good being sold by the rival firm
7 Example Estimating Demand SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 3 ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept X Variable X Variable X Variable Estimated equation of Bˆ Bˆ lnprice Bˆ lnincome Bˆ ln( rival _ ) is: 0 1 price ( ) 1.681ln( price).6309ln( income).7067ln( rival _ price) Note, all pvalues are small enough so that each estimated coefficient is statistically significant at a 5% error level. R. 5104
8 3. ACME Clinic Page 47 in Coursepak 1. Based upon Exhibit A, are male nurses paid less than female nurses? If so, by how much? Is that difference statistically significant?. What about the clinic s claim that Mr. Jones is appropriately paid if you account for his below average education? Is that supported by the data? If education is the only determinant of compensation, what is a fair estimate of what Mr. Jones salary should be? 3. After conducting your preliminary analysis, you interview supervisors in the clinic and find that years of experience are also highly valued by the clinic. Based on that observation, you request data on the experience of the nurses and receive data contained in Exhibit B. How is you analysis altered if you consider experience as a factor that determines compensation? Is Mr. Jones underpaid according to this analysis? Why not? 4. How do you reconcile the apparent contradiction between your answers above? Exhibit A (with Gender=1 for female and Gender=0 for male ) ID # Salary Education Gender 1 49, , , , , , , , , , , , , , , , , , , ,980 0 Descriptive Statistics: Salary Education Gender Mean 41, Std Dev 11, Maximum 64, Minimum 4,
9 1. Based upon Exhibit A, are male nurses paid less than female nurses? If so, by how much? Is that difference statistically significant? Observe that from the dataset we can compute that the Average Salary of Female nurses is $49,158.33, while the Average Salary of Male nurses is only $31, => Male nurses are paid $17, less! If we run a regression to estimate the equation salary b0 b1( female), we get the results labeled Example 3 ACME Clinic [Regression (i)] So, based upon the results of this regression, it appears as if Male nurses are paid less ($17, less!) than Female nurses Further, this difference is statistically significant at a.01% error level. What about the clinic s claim that Mr. Jones is appropriately paid if you account for his below average education? Is that supported by the data? If education is the only determinant of compensation, what is a fair estimate of what Mr. Jones salary should be? To determine the relation between education and salary (assuming education is the only determinant of salary), run a regression on the equation salary b ( 0 b1 education). Doing so, we get the results labeled Example 3 ACME Clinic [Regression (ii)] So, based upon the results of this regression, it appears that nurses with more education are paid higher salaries Mr. Jones education level (only years) is slightly below the sample mean of (.8) But, by the estimated equation 31, ,906.17( education), the expected salary of a nurse with years of education should be 31, ,906.17() 38, => Mr. Jones salary of only $9,980 is well below this amount Thus, the Clinic s claim that Mr. Jones low salary is accounted for by his below average education is not supported by the data 3. After conducting your preliminary analysis, you interview supervisors in the clinic and find that years of experience are also highly valued by the clinic. Based on that observation, you request data on the experience of the nurses and receive data contained in Exhibit B. How is you analysis altered if you consider experience as a factor that determines compensation? Is Mr. Jones underpaid according to this analysis? Why not?
10 we now have Exhibit B ID # Salary Education Female Experience 1 49, , , , , , , , , , , , , , , , , , , , Descriptive Statistics: Salary Education Gender Experience Mean 41, Std Dev 11, Max 64, Min 4, To determine the relation between salary and all three independent variables, run a regression on salary b0 b1 ( education) b ( female) b3 (experience). Doing so, we get the results labeled Example 3 ACME Clinic [Regression (iii)] So, based upon the results of this regression, there is not statistically significant difference in salaries of females versus males Accounting for Mr. Jones education level (only years) and experience (only 3 years), his expected salary is 19,89.50,054.64() (0) 1,855.88(3) 9, His actual salary of $9,980 is greater than this estimated expected salary (an estimate that takes into account his level of education and experience) => if anything, he s slightly overpaid
11 4. How do you reconcile the apparent contradiction between your answers above? To answer Question (1) we ran a regression for the equation salary b ( 0 b1 female) and found the impact of female to be statistically significant To answer Question (3) we ran a regression for salary b0 b1 ( education) b ( female) b3 ( Experience) and found the impact of education and experience to be statistically significant but the impact of female to not be statistically significant When running this latter regression, we are determining the impact of changes in each independent variable, controlling for differences in each of the other independent variables (recall, for multiple regression the interpretation of each coefficient is along the lines of all other factors fixed ) The regression we ran to answer Question (1) suffers from an Omitted Variables Bias, due to the fact that for this population there is a strong, positive correlation between Female and Experience Recall, definition of Correlation Coefficient: cov( X, Y ) XY s X sy Value of the correlation coefficient between each pair of independent variables: Education Female Experience Education 1 Female Experience Correlation Coefficient between Experience and Female is (.7999), which is fairly close to the upper bound of (1) For the regression we ran to answer Question (1), this was precisely the case Recall, the specified equation for this regression was salary b0 b1( female) We omitted Experience, which is highly correlated with Female => when doing so, the estimated coefficient for Female is actually providing a measure of both gender and the highly correlated experience Once we include both Female and Experience, the coefficient on Female only measures the impact of gender and not the impact of experience => from these results we see that experience has a statistically significant impact on salary, while gender does not Thus, the better results in this case are those from the regression which includes all three potential determinants of salary (i.e., results for the estimation of the equation salary b0 b1 ( education) b ( female) b3 ( Experience), as estimated within our answer to Question 3) => these results do NOT suffer from any Omitted Variable Bias
12 Example 3 ACME Clinic [Regression (i)] SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 0 ANOVA df SS MS F Significance F Regression E 05 Residual Total Coefficients Standard Error t Stat P value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept E X Variable E Estimated equation: bˆ ˆ 0 b1 ( female) 31,180 17,978.33( female) => if we run a regression with only one X variable that happens to be a dummy, then ˆb 0 is equal to the average value of the observations with (dummy)=(0) and ˆb 1 is equal to the difference between average value of the observations with (dummy)=(1) and average value of the observations with (dummy)=(0) Each estimated coefficient is significant at a.01% error level R
13 Example 3 ACME Clinic [Regression (ii)] SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 0 ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept E X Variable Estimated equation: bˆ ˆ 0 b1 ( education) 31, ,906.17( education) Each estimated coefficient is significant at a.01% error level R
14 Example 3 ACME Clinic [Regression (iii)] SUMMARY OUTPUT Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 0 ANOVA df SS MS F Significance F Regression E 19 Residual Total Coefficients Standard Error t Stat P value Lower 95% Upper 95% Lower 95.0% Upper 95.0% Intercept E X Variable E X Variable X Variable E bˆ ˆ ˆ ˆ 0 b1 ( education) b ( female) b3 ( Experience) 19,89.50,054.64( education) ( female) 1,855.88( Experience) R However, the coefficient for the Female dummy variable is no longer statistically significant ("pvalue of.8335)
15 Multiple Choice Questions: 1. refers to a problem of distorted regression results arising from specifying a model which leaves out one or more important independent variables. A. Selection Bias B. A Dummy Variable C. Omitted Variable Bias D. A LogTransformation.. A Dummy Variable A. is typically defined in such a way that it can take on any value between ( 1) and (1), but cannot take on values less than ( 1) or greater than (1). B. can only ever be included in a regression as the Y variable (and never as one of the X variables ). C. indicates whether an observation is characterized by a particular attribute. D. More than one (perhaps all) of the above answers is correct. 3. Henry ran a regression to estimate qx b0 b1 ln( px ) b ln( py ) b3 ln( Inc), where q x denotes quantity of good x, p x denotes price of good x, p y denotes price of good y, and Inc denotes per capita Income in the market for good x. His estimated coefficient values are b ˆ , b ˆ , b ˆ , and b ˆ These results would suggest that A. good x is an inferior good. B. good x is a substitute for good y. C. good x is a complement to good y. D. More than one (perhaps all) of the above answers is correct. 4. Suppose you have the following observations on the value of variable X1 : 9, 10, 1, 16, 13, 18, 7, 10, 9, 6, 14, and 11. For these observations, the Sample Minimum is A. 6. B. 1. C. 1. D. 144.
16 Problem Solving or Short Answer Questions: 1. John is planning on running a regression in order to determine the factors influencing salaries of public school teachers in the state of Georgia. He has obtained data on current salary, level of education, number of years of teaching experience, age, and gender for a random sample of,457 teachers in the state. Every teacher in his sample has at least a Bachelor s degree, but some have a Master s Degree or Doctorate. He has created a dummy variable (named AdvDeg ) to indicate whether or not each individual has one of these advanced degrees. He has also created a dummy variable (named Male ) to indicate the gender of each individual. Before running his regression, he computed Descriptive Statistics for each variable, as reported below: Salary AdvDeg Experience Age Male Mean 45, Std Dev 15, Max 7, Min 8, Based upon these reported values, do you have any observations to offer about his dataset? Explain.. Amy ran a regression to estimate the parameters in the equation y b0 b1 x1 b x b3 x3 b4 x4 In part, her regression results are: Regression Statistics R Square Adjusted R Square Observations 357 Coefficients P value Intercept X Variable X Variable X Variable E 10 X Variable A. Based upon her reported pvalues, which of her coefficient estimates are statistically significant at a 5% error level? Which of her coefficient estimates are statistically significant at a 1% error level? B. Do you have any concerns with her reported regression results? If so, explain.
17 Answer Questions 3 through 5 using the data posted online at: 3. You have been hired by Jim Highland Homes (a custom home builder operating in northern Georgia, northeastern Alabama, northwestern South Carolina, and southwestern North Carolina) to conduct an analysis to determine the factors influencing the price of homes. Specifically, you are given the data contained in the worksheet titled Data for Question 3. This dataset contains observations on Selling Price, Square Footage, and Lot Size (in acres), for a sample of 88 recently sold new homes in a market where Jim Highland Homes is considering starting a new development. Some of these properties were also on either a waterfront lot or a golf course lot, as indicated in the dataset. 3A. Determine the value of Sample Mean, Sample Standard Deviation, Sample Maximum, and Sample Minimum for each of the variables in this dataset. 3B. Run a regression on the equation ( price) b0 b1 ( SqFootage) b ( LotSize) b3 ( Waterfront) b4 ( GolfCourse) and state the estimated coefficient values for this regression. 3C. Based upon the estimated coefficient values, how much of a premium are people willing to pay for a Waterfront Lot? How much of a premium are people willing to pay for a Golf Course Lot? 3D. Which coefficient estimates are statistically significant at a 10% error level? Which coefficient estimates are statistically significant at a 1% error level? 4. Mo, Caleb, and Gene have been hired by the U.S. Federal Trade Commission to conduct a study on the impact of market power on the pricing patterns of firms. They have been provided with the data in the worksheet titled Data for Question 4. This dataset contains observations on Price, Marginal Cost, the value of C4, and the value of HHI for 100 firms operating in 9 different industries with the U.S. 4A. Mo claims, I know from my economics classes that firms with substantial market power charge higher prices than firms with less market power. Since C4 is a good measure of market power, we should run a regression on the equation price) b b ( 4). I am very confident that we will get good results, with ( 0 1 C b ˆ1 0. Run the regression suggested by Mo. Based upon the resulting value of R and the resulting pvalues, would Mo obtain the results that he expects? Explain. 4B. Caleb says, It is true that firms with substantial market power charge higher prices than firms with less market power. But, C4 is not a good measure of market power HHI is a superior measure. We should run a regression on the equation ( price) b0 b1 ( HHI). For this regression we are sure to get good results, with b ˆ1 0. Run the regression suggested by Caleb. Based upon the resulting value of R and the resulting pvalues, would Caleb obtain the results that he expects? Explain. 4C. Gene storms out of the room yelling, I can t work with you idiots. IEPR! IEPR!!! Don t you remember anything from your economics classes!? With this
18 data, if you are going to run a regression it should be on an equation along the lines of either p MC 100 b b ( 1 p 0 C 4) p MC p or 100 b ( 0 b1 HHI). IEPR!!! IEPR!!!!! What is this IEPR that Gene is ranting about? Run the regressions suggested by Gene. Based upon the resulting values of R and the resulting pvalues, are the results of these regressions better than those suggested in parts (4A) and (4B)? 4D. Using the results of the first regression suggested by Gene, what would be the impact on firm pricing of a change in market structure that increases the value of C4 by (5)? Explain. 5. Professor Tufnel teaches an introductory marketing class at a small university near Des Moines, Indiana. He has been accused of gender discrimination (specifically, of giving female students lower grades than male students). Using the data in the worksheet titled Data for Problem 5, you need to evaluate the validity of this accusation. This spreadsheet provides a summary of the Semester Average, Combined SAT Score, Age (a dummy variable indicating if the student is over the age of 5), Gender (a dummy variable indicating if the student is male), and Major of each of the 61 students enrolled in his class during the most recent semester. 5A. Determine the Mean of Semester Average for male students and for female students. How do these two values compare to each other? 5B. Run a regression on the equation ( SemAvg) b0 b1 ( SAT) b ( Over5) b3 ( Male). Based upon the results of this regression, is there evidence of gender discrimination? Is the difference in assigned grades between genders statistically significant at a 1% error level? Explain. 5C. After receiving a report of your results from the regression in part (5B), Professor Tufnel discussed your findings with Professors St. Hubbins and Smalls, two econometricians in his college. They think that there is a major error with the analysis above. They suggest that a regression should be run on the equation ( SemAvg) b0 b1 ( SAT) b ( Over5) b3 ( Male) b4 ( Bus), where (Bus) is a dummy variable indicating whether the student is majoring in one of the three business majors (Economics, Finance, or Marketing) offered by their college. (To assist in the construction of this dummy variable, the business majors have been colorcoded light green in Column E of the spreadsheet.) After running the regression suggested by Professors St. Hubbins and Smalls, does there appear to be any evidence of gender discrimination? Explain. 5D. Determine the value of the correlation coefficient between each pair of the variables (SAT), (Over 5), (Male), and (Bus). Based upon these values, explain the apparent discrepancy between the regression results from (5B) and the regression results from (5C).
19 Answers to Multiple Choice Questions: 1. C. C 3. D 4. A Answers to Problem Solving or Short Answer Questions: 1. Based upon the reported Descriptive Statistics, there appear to be some errors in his dataset. First, the reported minimum values for Salary and Age are each negative. These values do not make sense, since each of these variables should always be positive in value. Second, AdvDeg is a dummy variable, which should only take on a value of either (0) or (1). Thus, the reported maximum value of (15) cannot be correct. Finally, the reported maximum value for Experience is (94). Since this variable is measuring number of years of teaching experience, this reported value is most certainly a mistake. A. Based upon the reported pvalues, her estimates for b 0, b 1, and b 3 are statistically significant at a 5% error level (while the estimates for b and b 4 are not). Further, her estimates for b 1 and b 3 are statistically significant at a 1% error level (while the estimates for b 0, b, and b 4 are not). B. Her reported value for R is approximately The mathematical upper bound for R is a value of (1). Thus, there would seem to be some sort of error with her reported results. 3A. Price Sq Footage Lot Size Waterfront Lot Golf Course Lot mean 43,464.98, std dev 58, max 40,950 4, min 143,800 1, B. The estimated coefficients are: b ˆ0 3, , b ˆ , b ˆ 0, , b ˆ3 8, , and b ˆ4 6, C. These results imply that a home on a Waterfront Lot will sell for a premium of $8,17.1, while a home on a Golf Course Lot will sell for a premium of $6, D. Based upon the obtained pvalues, the estimates for b 1, b, b 3, and b 4 are statistically significant at a 10% error level (while the estimate for b 0 is not). Further, only the estimates for b 1 and b 3 are statistically significant at a 1% error level.
20 4A. For this regression, R The pvalues of (.14969) and (.55740) imply that neither ˆb 0 nor ˆb 1 are statistically significant. So, no, the results of this regression are not good. 4B. For this regression, R The pvalue of (.055) implies that ˆb 1 is not statistically significant. So again, no, the results of this regression are not good. 4C. Gene is ranting about the Inverse Elasticity Pricing Rule. Recall, this rule states that in order to be maximizing profit, a firm must be operating where p MC 1. p p That is, where the markup of price over Marginal Costs (as a percentage of price) is equal to the inverse of the absolute value of Price Elasticity of Demand. Since firms with more market power would tend to face demand for their output that is less elastic (so that the inverse of the absolute value of elasticity is greater in value), we could reasonably expect there to be a positive relation between either C4 or HHI (recall, these are measures of market structure for which a larger value corresponds to a market that is less competitive, in which case firms have more market power ) and any increasing function of p MC p p MC p. By considering 100, Gene is simply suggesting that this percentage increase be stated in such a way to make the values be between (0) p MC p and (100). For the regression on 100 b0 b1( C4), we obtain R.78061, along with pvalues of (.00574) and (4.76E34). These results are much better than those in part (4A). Finally, b ˆ , suggesting a positive relation between (C4) and the percentage markup (as expected). For the regression on p MC 100 b0 b1( HHI) p, we obtain R. 7878, along with pvalues of (5.0E10) and (9.E35). These results are much better than those in part (4B). Finally, b ˆ , suggesting a positive relation between (HHI) and the percentage markup (as expected). 4D. If there were a change in market structure causing C4 to increase in value by (5), we see that using the value of b ˆ from the results of the first regression in part (4C) firms in the industry would increase their expected percentage markup by approximately A. There are a total of 35 male students in the sample. These students have a semester,810 average of There are a total of 6 female students in the sample. These 1,999 students have a semester average of Thus, a simple comparison of 6
21 sample means between genders shows that the mean semester average of male students is higher than that of female students. 5B. Running a regression for ( SemAvg) b0 b1 ( SAT) b ( Over5) b3 ( Male), we obtain b ˆ Based upon the pvalue for this estimated coefficient (of.0019), this estimate is statistically significant at the 1% error level. Thus, these results would seem to provide evidence of gender discrimination, since male the expected semester average of a male student is points above that for a female student, even after controlling for SAT Score and Age. 5C. Running a regression for ( SemAvg) b0 b1 ( SAT) b ( Over5) b3 ( Male) b4 ( Bus), we obtain b ˆ (with a pvalue of.8345). Based upon this pvalue, gender no longer has a statistically significant impact on semester average. That it, once we control for SAT Score, Age, and Major, there no longer appear to be a difference in grades between male and female students. 5D. The numerical values of the six relevant correlation coefficients are: SAT Over 5 Male Business SAT 1 Over Male Business Note that there is a strong, positive correlation between being male and being a business major (implied by the value of above). The regression results from (5C) suggest that while semester averages in this marketing course do not differ between male and female students, there is a substantial, statistically significant difference in performance between business majors and nonbusiness majors (the estimated value of the coefficient attached to (Bus) is b ˆ , with a pvalue of E06). When the dummy variable identifying college major is left out of the regression (as was done in part (5B)), the results suffer from an omitted variable bias, since the estimated coefficient for (Male) (of b ˆ , with a pvalue of.0019) is partly capturing this difference in performance resulting from chosen major. In summary, once we control for SAT Score, Age, and Major, there is no longer any evidence of gender discrimination. Perhaps a better explanation is simply that students who choose to major in a business discipline are likely to be more interested in and perform better in a marketing class (compared to students who have chosen to major in a nonbusiness discipline).
Chapter 3 Quantitative Demand Analysis
Managerial Economics & Business Strategy Chapter 3 uantitative Demand Analysis McGrawHill/Irwin Copyright 2010 by the McGrawHill Companies, Inc. All rights reserved. Overview I. The Elasticity Concept
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationRegression stepbystep using Microsoft Excel
Step 1: Regression stepbystep using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression
More informationc. Given your answer in part (b), what do you anticipate will happen in this market in the longrun?
Perfect Competition Questions Question 1 Suppose there is a perfectly competitive industry where all the firms are identical with identical cost curves. Furthermore, suppose that a representative firm
More informationLearning Objectives. Essential Concepts
Learning Objectives After reading Chapter 7 and working the problems for Chapter 7 in the textbook and in this Workbook, you should be able to: Specify an empirical demand function both linear and nonlinear
More informationFinal Exam Practice Problem Answers
Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal
More informationSession 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table covariation least squares
More informationChapter 6 Competitive Markets
Chapter 6 Competitive Markets After reading Chapter 6, COMPETITIVE MARKETS, you should be able to: List and explain the characteristics of Perfect Competition and Monopolistic Competition Explain why a
More informationMULTIPLE REGRESSION WITH CATEGORICAL DATA
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting
More informationLecture 2. Marginal Functions, Average Functions, Elasticity, the Marginal Principle, and Constrained Optimization
Lecture 2. Marginal Functions, Average Functions, Elasticity, the Marginal Principle, and Constrained Optimization 2.1. Introduction Suppose that an economic relationship can be described by a realvalued
More informationCoefficient of Determination
Coefficient of Determination The coefficient of determination R 2 (or sometimes r 2 ) is another measure of how well the least squares equation ŷ = b 0 + b 1 x performs as a predictor of y. R 2 is computed
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationChapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
More informationElasticity. I. What is Elasticity?
Elasticity I. What is Elasticity? The purpose of this section is to develop some general rules about elasticity, which may them be applied to the four different specific types of elasticity discussed in
More informationTraditional Conjoint Analysis with Excel
hapter 8 Traditional onjoint nalysis with Excel traditional conjoint analysis may be thought of as a multiple regression problem. The respondent s ratings for the product concepts are observations on the
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means Oneway ANOVA To test the null hypothesis that several population means are equal,
More informationTable of Contents MICRO ECONOMICS
economicsentrance.weebly.com Basic Exercises Micro Economics AKG 09 Table of Contents MICRO ECONOMICS Budget Constraint... 4 Practice problems... 4 Answers... 4 Supply and Demand... 7 Practice Problems...
More informationPrinciples of Economics: Micro: Exam #2: Chapters 110 Page 1 of 9
Principles of Economics: Micro: Exam #2: Chapters 110 Page 1 of 9 print name on the line above as your signature INSTRUCTIONS: 1. This Exam #2 must be completed within the allocated time (i.e., between
More informationHypothesis testing  Steps
Hypothesis testing  Steps Steps to do a twotailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationMultiple Linear Regression in Data Mining
Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple
More informationAnswer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade
Statistics Quiz Correlation and Regression  ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More informationSimple Methods and Procedures Used in Forecasting
Simple Methods and Procedures Used in Forecasting The project prepared by : Sven Gingelmaier Michael Richter Under direction of the Maria JadamusHacura What Is Forecasting? Prediction of future events
More informationMicroeconomics Instructor Miller Practice Problems Labor Market
Microeconomics Instructor Miller Practice Problems Labor Market 1. What is a factor market? A) It is a market where financial instruments are traded. B) It is a market where stocks and bonds are traded.
More informationASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS
DATABASE MARKETING Fall 2015, max 24 credits Dead line 15.10. ASSIGNMENT 4 PREDICTIVE MODELING AND GAINS CHARTS PART A Gains chart with excel Prepare a gains chart from the data in \\work\courses\e\27\e20100\ass4b.xls.
More informationWeek TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480
1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500
More informationor, put slightly differently, the profit maximizing condition is for marginal revenue to equal marginal cost:
Chapter 9 Lecture Notes 1 Economics 35: Intermediate Microeconomics Notes and Sample Questions Chapter 9: Profit Maximization Profit Maximization The basic assumption here is that firms are profit maximizing.
More informationPART A: For each worker, determine that worker's marginal product of labor.
ECON 3310 Homework #4  Solutions 1: Suppose the following indicates how many units of output y you can produce per hour with different levels of labor input (given your current factory capacity): PART
More informationHedge Effectiveness Testing
Hedge Effectiveness Testing Using Regression Analysis Ira G. Kawaller, Ph.D. Kawaller & Company, LLC Reva B. Steinberg BDO Seidman LLP When companies use derivative instruments to hedge economic exposures,
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) 
More informationExamples on Monopoly and Third Degree Price Discrimination
1 Examples on Monopoly and Third Degree Price Discrimination This hand out contains two different parts. In the first, there are examples concerning the profit maximizing strategy for a firm with market
More informationChapter 9 Assessing Studies Based on Multiple Regression
Chapter 9 Assessing Studies Based on Multiple Regression Solutions to Empirical Exercises 1. Age 0.439** (0.030) Age 2 Data from 2004 (1) (2) (3) (4) (5) (6) (7) (8) Dependent Variable AHE ln(ahe) ln(ahe)
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measuresoffit in multiple regression Assumptions
More informationDoing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
More informationA Basic Introduction to Missing Data
John Fox Sociology 740 Winter 2014 Outline Why Missing Data Arise Why Missing Data Arise Global or unit nonresponse. In a survey, certain respondents may be unreachable or may refuse to participate. Item
More informationMonopoly and Monopsony Labor Market Behavior
Monopoly and Monopsony abor Market Behavior 1 Introduction For the purposes of this handout, let s assume that firms operate in just two markets: the market for their product where they are a seller) and
More information" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
More information1 Calculus of Several Variables
1 Calculus of Several Variables Reading: [Simon], Chapter 14, p. 30031. 1.1 Partial Derivatives Let f : R n R. Then for each x i at each point x 0 = (x 0 1,..., x 0 n) the ith partial derivative is defined
More informationQ = ak L + bk L. 2. The properties of a shortrun cubic production function ( Q = AL + BL )
Learning Objectives After reading Chapter 10 and working the problems for Chapter 10 in the textbook and in this Student Workbook, you should be able to: Specify and estimate a shortrun production function
More informationDetermining Future Success of College Students
Undergraduate Economic Review Volume 5 Issue 1 Article 7 2009 Determining Future Success of College Students Paul Oehrlein Illinois Wesleyan University Recommended Citation Oehrlein, Paul (2009) "Determining
More informationAugust 2012 EXAMINATIONS Solution Part I
August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,
More informationCase Study: Alex Charter School Gordon Johnson, California State University, Northridge, USA Raj Kiani, California State University, Northridge, USA
Case Study: Alex Charter School Gordon Johnson, California State University, Northridge, USA Raj Kiani, California State University, Northridge, USA ABSTRACT This case discusses issues associated with
More informationI L L I N O I S UNIVERSITY OF ILLINOIS AT URBANACHAMPAIGN
Beckman HLM Reading Group: Questions, Answers and Examples Carolyn J. Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANACHAMPAIGN Linear Algebra Slide 1 of
More informationGeneralized Linear Models
Generalized Linear Models We have previously worked with regression models where the response variable is quantitative and normally distributed. Now we turn our attention to two types of models where the
More informationReview of Fundamental Mathematics
Review of Fundamental Mathematics As explained in the Preface and in Chapter 1 of your textbook, managerial economics applies microeconomic theory to business decision making. The decisionmaking tools
More informationECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2
University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages
More informationNonlinear Regression Functions. SW Ch 8 1/54/
Nonlinear Regression Functions SW Ch 8 1/54/ The TestScore STR relation looks linear (maybe) SW Ch 8 2/54/ But the TestScore Income relation looks nonlinear... SW Ch 8 3/54/ Nonlinear Regression General
More informationOutline. Topic 4  Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares
Topic 4  Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test  Fall 2013 R 2 and the coefficient of correlation
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationMarginal Person. Average Person. (Average Return of College Goers) Return, Cost. (Average Return in the Population) (Marginal Return)
1 2 3 Marginal Person Average Person (Average Return of College Goers) Return, Cost (Average Return in the Population) 4 (Marginal Return) 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3 Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationChicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011
Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this
More informationSimple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
More informationECON 103, 20082 ANSWERS TO HOME WORK ASSIGNMENTS
ECON 103, 20082 ANSWERS TO HOME WORK ASSIGNMENTS Due the Week of June 23 Chapter 8 WRITE [4] Use the demand schedule that follows to calculate total revenue and marginal revenue at each quantity. Plot
More informationCORRELATION ANALYSIS
CORRELATION ANALYSIS Learning Objectives Understand how correlation can be used to demonstrate a relationship between two factors. Know how to perform a correlation analysis and calculate the coefficient
More informationChapter 23 Inferences About Means
Chapter 23 Inferences About Means Chapter 23  Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300minute
More informationLinear Programming Notes VII Sensitivity Analysis
Linear Programming Notes VII Sensitivity Analysis 1 Introduction When you use a mathematical model to describe reality you must make approximations. The world is more complicated than the kinds of optimization
More informationEconometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
More informationChapter 5 Estimating Demand Functions
Chapter 5 Estimating Demand Functions 1 Why do you need statistics and regression analysis? Ability to read market research papers Analyze your own data in a simple way Assist you in pricing and marketing
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, TTESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, TTESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationNot Your Dad s Magic Eight Ball
Not Your Dad s Magic Eight Ball Prepared for the NCSL Fiscal Analysts Seminar, October 21, 2014 Jim Landers, Office of Fiscal and Management Analysis, Indiana Legislative Services Agency Actual Forecast
More informationStatistical Functions in Excel
Statistical Functions in Excel There are many statistical functions in Excel. Moreover, there are other functions that are not specified as statistical functions that are helpful in some statistical analyses.
More information4 G: Identify, analyze, and synthesize relevant external resources to pose or solve problems. 4 D: Interpret results in the context of a situation.
MAT.HS.PT.4.TUITN.A.298 Sample Item ID: MAT.HS.PT.4.TUITN.A.298 Title: College Tuition Grade: HS Primary Claim: Claim 4: Modeling and Data Analysis Students can analyze complex, realworld scenarios and
More informationDetermination of g using a spring
INTRODUCTION UNIVERSITY OF SURREY DEPARTMENT OF PHYSICS Level 1 Laboratory: Introduction Experiment Determination of g using a spring This experiment is designed to get you confident in using the quantitative
More informationTopic 1  Introduction to Labour Economics. Professor H.J. Schuetze Economics 370. What is Labour Economics?
Topic 1  Introduction to Labour Economics Professor H.J. Schuetze Economics 370 What is Labour Economics? Let s begin by looking at what economics is in general Study of interactions between decision
More informationLearning Objectives. After reading Chapter 11 and working the problems for Chapter 11 in the textbook and in this Workbook, you should be able to:
Learning Objectives After reading Chapter 11 and working the problems for Chapter 11 in the textbook and in this Workbook, you should be able to: Discuss three characteristics of perfectly competitive
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationTest Bias. As we have seen, psychological tests can be wellconceived and wellconstructed, but
Test Bias As we have seen, psychological tests can be wellconceived and wellconstructed, but none are perfect. The reliability of test scores can be compromised by random measurement error (unsystematic
More informationCovariance and Correlation
Covariance and Correlation ( c Robert J. Serfling Not for reproduction or distribution) We have seen how to summarize a databased relative frequency distribution by measures of location and spread, such
More informationProfit Maximization. 2. product homogeneity
Perfectly Competitive Markets It is essentially a market in which there is enough competition that it doesn t make sense to identify your rivals. There are so many competitors that you cannot single out
More informationOptimization: Optimal Pricing with Elasticity
Optimization: Optimal Pricing with Elasticity Short Examples Series using Risk Simulator For more information please visit: www.realoptionsvaluation.com or contact us at: admin@realoptionsvaluation.com
More informationBusiness Valuation Review
Business Valuation Review Regression Analysis in Valuation Engagements By: George B. Hawkins, ASA, CFA Introduction Business valuation is as much as art as it is science. Sage advice, however, quantitative
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
MBA 640 Survey of Microeconomics Fall 2006, Quiz 6 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A monopoly is best defined as a firm that
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 15 scale to 0100 scores When you look at your report, you will notice that the scores are reported on a 0100 scale, even though respondents
More informationAssociation Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
More informationChapter 7 Monopoly, Oligopoly and Strategy
Chapter 7 Monopoly, Oligopoly and Strategy After reading Chapter 7, MONOPOLY, OLIGOPOLY AND STRATEGY, you should be able to: Define the characteristics of Monopoly and Oligopoly, and explain why the are
More informationPay for performance. Intrinsic (interested in the job as such) Extrinsic motivation. Pay Work environment, nonpay characteristics, benefits
Pay for performance Motivation Intrinsic (interested in the job as such) Extrinsic motivation Pay Work environment, nonpay characteristics, benefits Inefficient to rely on intrinsic motivation only Workers
More informationStatistics 151 Practice Midterm 1 Mike Kowalski
Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and
More informationPredictability Study of ISIP Reading and STAAR Reading: Prediction Bands. March 2014
Predictability Study of ISIP Reading and STAAR Reading: Prediction Bands March 2014 Chalie Patarapichayatham 1, Ph.D. William Fahle 2, Ph.D. Tracey R. Roden 3, M.Ed. 1 Research Assistant Professor in the
More informationPhysics Lab Report Guidelines
Physics Lab Report Guidelines Summary The following is an outline of the requirements for a physics lab report. A. Experimental Description 1. Provide a statement of the physical theory or principle observed
More informationPearson s Correlation
Pearson s Correlation Correlation the degree to which two variables are associated (covary). Covariance may be either positive or negative. Its magnitude depends on the units of measurement. Assumes the
More informationCOST THEORY. I What costs matter? A Opportunity Costs
COST THEORY Cost theory is related to production theory, they are often used together. However, the question is how much to produce, as opposed to which inputs to use. That is, assume that we use production
More informationMultiple Regression. Page 24
Multiple Regression Multiple regression is an extension of simple (bivariate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)
More informationUsing R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
More informationMicroeconomics Topic 6: Be able to explain and calculate average and marginal cost to make production decisions.
Microeconomics Topic 6: Be able to explain and calculate average and marginal cost to make production decisions. Reference: Gregory Mankiw s Principles of Microeconomics, 2 nd edition, Chapter 13. LongRun
More informationNominal, Real and PPP GDP
Nominal, Real and PPP GDP It is crucial in economics to distinguish nominal and real values. This is also the case for GDP. While nominal GDP is easier to understand, real GDP is more important and used
More informationWeek 3&4: Z tables and the Sampling Distribution of X
Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationMicroeconomics Sept. 16, 2010 NOTES ON CALCULUS AND UTILITY FUNCTIONS
DUSP 11.203 Frank Levy Microeconomics Sept. 16, 2010 NOTES ON CALCULUS AND UTILITY FUNCTIONS These notes have three purposes: 1) To explain why some simple calculus formulae are useful in understanding
More informationMissing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
More informationTesting for Lack of Fit
Chapter 6 Testing for Lack of Fit How can we tell if a model fits the data? If the model is correct then ˆσ 2 should be an unbiased estimate of σ 2. If we have a model which is not complex enough to fit
More information