Multiple Regression Exercises
|
|
- Robyn Watkins
- 7 years ago
- Views:
Transcription
1 Multiple Regression Exercises 1. In a study to predict the sale price of a residential property (dollars), data is taken on 20 randomly selected properties. The potential predictors in the study are appraised land value (dollars), appraised value of improvements (dollars), and area of property living space (square feet), and the data is stored in the SPSS data file realestate (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Does the data appear to be observational or experimental? Since the land value, improvement value, and area are all random, the data is observational. (b) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that since there are no dummy variables in this multiple regression model, Step 7 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 1(b) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 1
2 1. - continued Multiple Regression Exercises 2
3 1. - continued Multiple Regression Exercises 3
4 1. - continued (c) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption For each of the predictors land value, improvements, and area, the data points appear to be randomly distributed about the least squares line on the corresponding scatter plot. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals plotted against standardized predicted values looks reasonably uniform around the horizontal line. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions appear to be satisfied, we feel it is appropriate to proceed with the multiple regression analysis. Multiple Regression Exercises 4
5 1. - continued (d) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. Since the correlation matrix does not contain any correlation greater than 0.8 for any pair of independent variables, and tolerance > 0.10 (i.e., VIF < 10) for each independent variable, there is no indication that multicollinearity will be a problem. (e) Write the results of the f test in the ANOVA table for the regression to predict the sale price with all three potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict sale price from land value, improvements, and area is statistically significant at the 0.05 level (f 3, 16 = , f 3, 16; 0.05 = 3.24, p < 0.001). We conclude that the overall regression is significant (i.e., at least one coefficient in the regression is different from zero). (f) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that since there are no dummy variables in this multiple regression model, Step 2 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 1(f) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 5
6 1. - continued Multiple Regression Exercises 6
7 1. - continued (g) Write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. There were two steps in the stepwise multiple regression with a 0.05 significance level for entry and a 0.10 significance level for removal. In the first step, the variable appraised value of improvements was entered, which explained 83.8% of the variance in sale price. In the second step, the variable area of property living space was entered, which explained an additional 4.3% of the variance in sale price. No variable was removed. The two independent variables in this final step explained a total of 88.1% of the variance in sale price. Multiple Regression Exercises 7
8 1. - continued (h) Write the estimated regression equation from the final step of the stepwise regression, and use this regression equation to predict the sale price of a residential property where the appraised land value is $8000, the appraised value of improvements is $20,000, and area of property living space is 1200 square feet. ^ sale_prc = (impr_val) (area) (20000) (1200) = $38, (i) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one dollar in appraised value of improvements, the sale price increases on average by about $0.96. For each increase of one square foot in area of property living space, the sale price increases on average by about $ Multiple Regression Exercises 8
9 1. - continued (j) From the Correlations table of the SPSS output, find the ordinary correlation between the dependent variable sale price and the first independent variable entered into the model. The correlation between sale price and appraised value of improvements is (k) From the Excluded Variables table of the SPSS output, find the partial correlation between the dependent variable sale price and the second independent variable entered into the model given the first independent variable entered into the model; compare this to the ordinary correlation between the dependent variable sale price and the second independent variable entered into the model, which can be found from the Correlations table of the SPSS output. The partial correlation between sale price and area of property living space given appraised value of improvements is The ordinary correlation between sale price and area of property living space is Multiple Regression Exercises 9
10 2. In a study to predict the drying time (hours) for an outdoor house paint, data is taken on 22 house painting jobs. The potential predictors in the study are temperature (degrees Fahrenheit), humidity (percent), wind velocity (miles per hour), and barometric pressure, and the data is stored in the SPSS data file paint (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Does the data appear to be observational or experimental? Since the temperature, humidity, wind velocity, and barometric pressure are all random, the data is observational. (b) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that since there are no dummy variables in this multiple regression model, Step 7 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 2(b) Verify that your SPSS output contains all of the following: four scatter plots, each displaying the least squares line for one of the quantitative predictors; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Multiple Regression Exercises 10
11 2. - continued Create a Word document named Multiple_Regression_Result_Summaries with a section titled Multiple Regression Exercises 2. In this section, create a subsection for each of parts (c), (d), and (e) which follow, and in each subsection created, write the summaries for the corresponding part. Print the page(s) and insert them immediately after this page. (c) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption the uniform variance (homoscedasticity) assumption the normality assumption (d) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. (e) Write the results of the f test in the ANOVA table for the regression to predict the drying time with all four potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. Multiple Regression Exercises 11
12 2. - continued (f) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that since there are no dummy variables in this multiple regression model, Step 2 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 2(f) Verify that your SPSS output contains all of the following: a table titled Variables Entered/Removed; a table titled Model Summary; a table titled ANOVA; a table titled Coefficients; a table titled Excluded Variables. (g) In the section titled Multiple Regression Exercises 2 of the Word document named Multiple_Regression_Result_Summaries (previously created), add a subsection for this part where you write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. Multiple Regression Exercises 12
13 2. - continued (h) Write the estimated regression equation from the final step of the stepwise regression, and use this regression equation to predict drying time with a temperature of 75 degrees Fahrenheit, a relative humidity of 55%, a wind velocity of 15 miles per hour, and a barometric pressure of ^ drying time = (temperature) 0.628(wind velocity) drying time = (75) 0.628(15) = hours (i) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one degree Fahrenheit in temperature, the drying time decreases on average by about hours. For each increase of one mile per hour in wind velocity, the drying time decreases on average by about hours. Multiple Regression Exercises 13
14 2. - continued (j) From the Correlations table of the SPSS output, find the ordinary correlation between the dependent variable paint drying time and the first independent variable entered into the model. The correlation between paint drying time and temperature is (k) From the Excluded Variables table of the SPSS output, find the partial correlation between the dependent variable paint drying time and the second independent variable entered into the model given the first independent variable entered into the model; compare this to the ordinary correlation between the dependent variable paint drying time and the second independent variable entered into the model, which can be found from the Correlations table of the SPSS output. The partial correlation between paint drying time and wind velocity given temperature is The ordinary correlation between paint drying time and wind velocity is Multiple Regression Exercises 14
15 3. Recall that in Basic Statistics Exercise #34 the lifetime of light bulbs was being studied for three brands named Brite, Softlite, and Nodark (i.e., the relationship between brand and lifetime). A 0.05 significance level was used with a one-way ANOVA to see if there is any evidence that mean lifetime is not the same for the brands Brite, Softlite, and Nodark. Light bulbs were randomly selected from each brand, and the lifetimes in hours were recorded as follows: Brite Softlite Nodark (a) Define dummy variables necessary to represent the qualitative variable brand of light bulb. (b) Create an SPSS data file which consists of a variable lifetime, for the lifetimes recorded in the data, and the dummy variables necessary to represent the qualitative variable brand of light bulb. Use SPSS to obtain the ANOVA table for the multiple regression to predict lifetime from brand of light bulb, and compare this ANOVA table to the one-way ANOVA table in Basic Statistics Exercise #34. Multiple Regression Exercises 15
16 4. Recall that in Basic Statistics Exercise #35 the mean length of fish was being studied for North Lake, Blue Lake, Harvey Lake (i.e., the relationship between Lake and length of fish) is to be studied. A 0.05 significance level was used with a one-way ANOVA to see if there is any evidence that mean length of fish is not the same for North Lake, Blue Lake, and Harvey Lake. Fish were randomly selected from each lake, and the lengths in inches were recorded as follows: North Blue Harvey (a) Define dummy variables necessary to represent the qualitative variable Lake. (b) Create an SPSS data file which consists of a variable length, for the lengths recorded in the data, and the dummy variables necessary to represent the qualitative variable Lake. Use SPSS to obtain the ANOVA table for the multiple regression to predict length from Lake, and compare this ANOVA table to the one-way ANOVA table in Basic Statistics Exercise #35. Multiple Regression Exercises 16
17 5. A company conducts a study to see how diastolic blood pressure is influenced by an employee s age, weight, and job stress level classified as high stress, some stress, and low stress. Data recorded on 24 employees treated as a random sample has been stored in the SPSS data file jobstress. A 0.05 significance level is chosen for hypothesis testing. (a) List the independent variables, and indicate whether each is quantitative or qualitative. age weight job stress level quantitative quantitative qualitative (b) Define all possible dummy variables which can be used to represent each qualitative independent variable. Any two of these indicator (dummy) variables is sufficient to represent the qualitative independent variable job stress level: X 1 = 1 for high stress job 0 for otherwise X 2 = 1 for some stress job 0 for otherwise 1 for low stress job X 3 = 0 for otherwise Multiple Regression Exercises 17
18 5.-continued (c) In the SPSS data file jobstress, recode the variable jobtype into the first dummy defined variable in part (b), by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Data Entry and Manipulation, and reading the steps in the subsection titled Creating New Variables by Recoding Existing Variables; then repeat this for the other dummy variable(s) in part (b), after which the data first few lines of the data file should look as follows: (d) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that Step 7 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 5(d) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 18
19 5.-continued Multiple Regression Exercises 19
20 5.-continued Multiple Regression Exercises 20
21 5.-continued (e) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption For each of the quantitative predictors age and weight, the data points appear to be randomly distributed about the least squares line on the corresponding scatter plot. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals plotted against standardized predicted values looks reasonably uniform around the horizontal line. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions appear to be satisfied, we feel it is appropriate to proceed with the multiple regression analysis. Multiple Regression Exercises 21
22 5. - continued (f) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. Since the correlation matrix does not contain any correlation greater than 0.8 for any pair of independent variables, and tolerance > 0.10 (i.e., VIF < 10) for each independent variable, there is no indication that multicollinearity will be a problem. (g) Write the results of the f test in the ANOVA table for the regression to predict the diastolic blood pressure with all potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict diastolic blood pressure from age, weight, and indicator variables representing job stress level is statistically significant at the 0.05 level (f 4, 19 = , f 4, 19; 0.05 = 2.90, p < 0.001). We conclude that the overall regression is significant (i.e., at least one coefficient in the regression is different from zero). (h) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that Step 2 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 5(f) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 22
23 5. - continued Multiple Regression Exercises 23
24 5. - continued Multiple Regression Exercises 24
25 5. - continued (i) Write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. There were three steps in the stepwise multiple regression with a 0.05 significance level for entry and a 0.10 significance level for removal. In the first step, the variable weight was entered, which explained 52.8% of the variance in diastolic blood pressure. In the second step, the variable age was entered, which explained an additional 18.9% of the variance in diastolic blood pressure. No variable was removed. In the third step, the indicator variable for a high stress job was entered, which explained an additional 15.5% of the variance in diastolic blood pressure. No variable was removed. The three independent variables in this final step explained a total of 87.3% of the variance in diastolic blood pressure. (j) Write the estimated regression equation from the final step of the stepwise regression. ^ dbp = (weight) (age) (X 1 ) where 1 for high stress job X 1 = 0 for otherwise Multiple Regression Exercises 25
26 5. - continued (k) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one pound in weight, diastolic blood pressure increases on average by about For each increase of one year in age, diastolic blood pressure increases on average by about On average, diastolic blood pressure is about greater for employees with a high stress job than for employees with other jobs. The only indicator variable included in the model is the one for a high stress job, which suggests a statistically significant difference between the high stress job group and the other two groups combined but no statistically significant difference between the some stress job group and the low stress job group. For the high stress job group, X 1 = 1 so that the least squares regression equation is ^ dbp = (weight) (age) (1) = (weight) (age) For the low stress or some stress job group, X 1 = 0 so that the least squares regression equation is ^ dbp = (weight) (age) (0) = (weight) (age) Multiple Regression Exercises 26
27 5. - continued (l) Use the estimated regression equation from the final step of the stepwise regression to predict diastolic blood pressure for each of the two following employees: a 35-year old employee weighing 180 pounds and having a high stress job dbp = (180) (35) (1) = a 35-year old employee weighing 180 pounds and having a low stress or some stress job dbp = (180) (35) (0) = (m) From the Correlations table of the SPSS output, find the ordinary correlation between the dependent variable diastolic blood and the first independent variable entered into the model. The correlation between diastolic blood pressure and weight is (n) From the Excluded Variables table of the SPSS output, find the partial correlation between the dependent variable diastolic blood and the second independent variable entered into the model given the first independent variable entered into the model; compare this to the ordinary correlation between the dependent variable diastolic blood and the second independent variable entered into the model, which can be found from the Correlations table of the SPSS output. The partial correlation between diastolic blood pressure and age given weight is The ordinary correlation between diastolic blood pressure and age is Multiple Regression Exercises 27
28 6. In a study concerning the prediction of the wheat yield (bushels per acre), potential predictors are total rainfall (inches), average temperature (degrees Fahrenheit), and type of soil; there are three types of soil labeled A, B, and C. Data recorded on 24 employees treated as a random sample is displayed on the right. The data are from randomly selected observations over several seasons, and have been stored in the SPSS data file wheat_yield. A 0.05 significance level is chosen for hypothesis testing. (a) List the independent variables, and indicate whether each is quantitative or qualitative. total rainfall average temperature soil type quantitative quantitative qualtitative (b) Define all possible dummy variables which can be used to represent each qualitative independent variable. Any two of these indicator (dummy) variables is sufficient to represent the qualitative independent variable job stress level: 1 for soil type A 1 for soil type B 1 for soil type C X 1 = X 2 = X 3 = 0 otherwise 0 otherwise 0 otherwise (c) In the SPSS data file wheat_yield, recode the variable soil_typ into the first dummy defined variable in part (b), by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Data Entry and Manipulation, and reading the steps in the subsection titled Creating New Variables by Recoding Existing Variables; then repeat this for the other dummy variable(s) in part (b). Multiple Regression Exercises 28
29 6. - continued (d) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that Step 7 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 6(d) Verify that your SPSS output contains all of the following: two scatter plots, each displaying the least squares line for one of the quantitative predictors; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Multiple Regression Exercises 29
30 6. - continued In the Word document named Multiple_Regression_Result_Summaries (previously created), add a section titled Multiple Regression Exercises 6. In this section, create a subsection for each of parts (c), (d), and (e) which follow, and in each subsection created, write the summaries for the corresponding part. Print the page(s) and insert them immediately after this page. (e) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption the uniform variance (homoscedasticity) assumption the normality assumption (f) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. (g) Write the results of the f test in the ANOVA table for the regression to predict wheat yield with all potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. Multiple Regression Exercises 30
31 6. - continued (h) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that Step 2 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 6(f) Verify that your SPSS output contains all of the following: a table titled Variables Entered/Removed; a table titled Model Summary; a table titled ANOVA; a table titled Coefficients; a table titled Excluded Variables. (i) In the section titled Multiple Regression Exercises 6 of the Word document named Multiple_Regression_Result_Summaries (previously created), add a subsection for this part where you write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. Multiple Regression Exercises 31
32 6. - continued (j) Write the estimated regression equation from the final step of the stepwise regression. ^ yield = (rain) (temp) (X 1 ) (k) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one inch in total rainfall, wheat yield increases on average by about bushels per acre. For each increase of one degree Fahrenheit in temperature, wheat yield increases on average by about bushels per acre. On average, wheat yield is about bushels per acre smaller with soil type A than for other soil types. Multiple Regression Exercises 32
33 6. - continued (l) Use the estimated regression equation from the final step of the stepwise regression to predict wheat yield in each of the following scenarios: Total rainfall is 60 inches, average temperature is 65 degrees Fahrenheit, and soil type A is used (60) (65) = = bushels per acre Total rainfall is 60 inches, average temperature is 65 degrees Fahrenheit, and soil type B or C is used (60) (65) = = bushels per acre Multiple Regression Exercises 33
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationThis chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationSPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationRegression Analysis (Spring, 2000)
Regression Analysis (Spring, 2000) By Wonjae Purposes: a. Explaining the relationship between Y and X variables with a model (Explain a variable Y in terms of Xs) b. Estimating and testing the intensity
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationChapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
More informationCourse Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
More informationHomework 8 Solutions
Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.
More informationch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
ch12 practice test 1) The null hypothesis that x and y are is H0: = 0. 1) 2) When a two-sided significance test about a population slope has a P-value below 0.05, the 95% confidence interval for A) does
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationModeration. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
More information1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ
STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationANALYSIS OF TREND CHAPTER 5
ANALYSIS OF TREND CHAPTER 5 ERSH 8310 Lecture 7 September 13, 2007 Today s Class Analysis of trends Using contrasts to do something a bit more practical. Linear trends. Quadratic trends. Trends in SPSS.
More informationSection 14 Simple Linear Regression: Introduction to Least Squares Regression
Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship
More informationABSORBENCY OF PAPER TOWELS
ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?
More informationDirections for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More informationAnalytical Test Method Validation Report Template
Analytical Test Method Validation Report Template 1. Purpose The purpose of this Validation Summary Report is to summarize the finding of the validation of test method Determination of, following Validation
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) All but one of these statements contain a mistake. Which could be true? A) There is a correlation
More informationThere are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
More informationBetter decision making under uncertain conditions using Monte Carlo Simulation
IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics
More informationSimple Linear Regression, Scatterplots, and Bivariate Correlation
1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.
More informationModerator and Mediator Analysis
Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,
More informationMultiple Regression. Page 24
Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)
More informationDESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationAnalysis of Data. Organizing Data Files in SPSS. Descriptive Statistics
Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Ch. Correlation and Regression. Correlation Interpret Scatter Plots and Correlations MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression We are often interested in studying the relationship among variables to determine whether they are associated with one another. When we think that changes in a
More informationLinear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
More informationSPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
More informationExample: Boats and Manatees
Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationWeek TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480
1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500
More informationRegression step-by-step using Microsoft Excel
Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression
More informationIntroduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
More informationOverview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS
Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical
More information16 : Demand Forecasting
16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical
More informationCorrelation and Regression Analysis: SPSS
Correlation and Regression Analysis: SPSS Bivariate Analysis: Cyberloafing Predicted from Personality and Age These days many employees, during work hours, spend time on the Internet doing personal things,
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationSPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011
SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011 Statistical techniques to be covered Explore relationships among variables Correlation Regression/Multiple regression Logistic regression Factor analysis
More information2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
More informationDoing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
More informationCourse Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationElementary Statistics Sample Exam #3
Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to
More informationSTAT 350 Practice Final Exam Solution (Spring 2015)
PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects
More informationMBA 611 STATISTICS AND QUANTITATIVE METHODS
MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain
More informationCausal Forecasting Models
CTL.SC1x -Supply Chain & Logistics Fundamentals Causal Forecasting Models MIT Center for Transportation & Logistics Causal Models Used when demand is correlated with some known and measurable environmental
More information5. Multiple regression
5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful
More informationDISCRIMINANT FUNCTION ANALYSIS (DA)
DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationStatistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.
Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. 1. How to display English messages from IBM SPSS Statistics
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationAnalysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk
Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationChapter 4 and 5 solutions
Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,
More informationChapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationBusiness Valuation Review
Business Valuation Review Regression Analysis in Valuation Engagements By: George B. Hawkins, ASA, CFA Introduction Business valuation is as much as art as it is science. Sage advice, however, quantitative
More informationIntroduction. Chapter 1. 1.1 Before you start. 1.1.1 Formulation
Chapter 1 Introduction 1.1 Before you start Statistics starts with a problem, continues with the collection of data, proceeds with the data analysis and finishes with conclusions. It is a common mistake
More informationData analysis process
Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis
More informationHow To Run Statistical Tests in Excel
How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting
More informationSPSS-Applications (Data Analysis)
CORTEX fellows training course, University of Zurich, October 2006 Slide 1 SPSS-Applications (Data Analysis) Dr. Jürg Schwarz, juerg.schwarz@schwarzpartners.ch Program 19. October 2006: Morning Lessons
More informationActivity 8 Drawing Isobars Level 2 http://www.uni.edu/storm/activities/level2/index.shtml
Activity 8 Drawing Isobars Level 2 http://www.uni.edu/storm/activities/level2/index.shtml Objectives: 1. Students will be able to define and draw isobars to analyze air pressure variations. 2. Students
More informationIntroduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
More informationChapter 7 Factor Analysis SPSS
Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often
More informationUsing R for Linear Regression
Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional
More informationStepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection
Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.
More informationMultiple Regression Using SPSS
Multiple Regression Using SPSS The following sections have been adapted from Field (2009) Chapter 7. These sections have been edited down considerably and I suggest (especially if you re confused) that
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly
More informationSilvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com
SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING
More informationJanuary 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
More informationHomework 11. Part 1. Name: Score: / null
Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is
More informationCHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression
Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the
More informationRegression III: Advanced Methods
Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More information2013 MBA Jump Start Program. Statistics Module Part 3
2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just
More informationMultiple Linear Regression
Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is
More informationUnit 9 Describing Relationships in Scatter Plots and Line Graphs
Unit 9 Describing Relationships in Scatter Plots and Line Graphs Objectives: To construct and interpret a scatter plot or line graph for two quantitative variables To recognize linear relationships, non-linear
More informationModule 5: Statistical Analysis
Module 5: Statistical Analysis To answer more complex questions using your data, or in statistical terms, to test your hypothesis, you need to use more advanced statistical tests. This module reviews the
More informationRegression and Correlation
Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis
More informationTHE USE OF REGRESSION ANALYSIS IN MARKETING RESEARCH
THE USE OF REGRESSION ANALYSIS IN MARKETING RESEARCH DUMITRESCU Luigi Lucian Blaga University of Sibiu, Romania STANCIU Oana Lucian Blaga University of Sibiu, Romania TICHINDELEAN Mihai Lucian Blaga University
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationLean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY
TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online
More informationScatter Plot, Correlation, and Regression on the TI-83/84
Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page
More informationFoundation of Quantitative Data Analysis
Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1
More informationConcepts of Experimental Design
Design Institute for Six Sigma A SAS White Paper Table of Contents Introduction...1 Basic Concepts... 1 Designing an Experiment... 2 Write Down Research Problem and Questions... 2 Define Population...
More informationIllustration (and the use of HLM)
Illustration (and the use of HLM) Chapter 4 1 Measurement Incorporated HLM Workshop The Illustration Data Now we cover the example. In doing so we does the use of the software HLM. In addition, we will
More information