Multiple Regression Exercises

Size: px
Start display at page:

Download "Multiple Regression Exercises"

Transcription

1 Multiple Regression Exercises 1. In a study to predict the sale price of a residential property (dollars), data is taken on 20 randomly selected properties. The potential predictors in the study are appraised land value (dollars), appraised value of improvements (dollars), and area of property living space (square feet), and the data is stored in the SPSS data file realestate (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Does the data appear to be observational or experimental? Since the land value, improvement value, and area are all random, the data is observational. (b) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that since there are no dummy variables in this multiple regression model, Step 7 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 1(b) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 1

2 1. - continued Multiple Regression Exercises 2

3 1. - continued Multiple Regression Exercises 3

4 1. - continued (c) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption For each of the predictors land value, improvements, and area, the data points appear to be randomly distributed about the least squares line on the corresponding scatter plot. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals plotted against standardized predicted values looks reasonably uniform around the horizontal line. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions appear to be satisfied, we feel it is appropriate to proceed with the multiple regression analysis. Multiple Regression Exercises 4

5 1. - continued (d) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. Since the correlation matrix does not contain any correlation greater than 0.8 for any pair of independent variables, and tolerance > 0.10 (i.e., VIF < 10) for each independent variable, there is no indication that multicollinearity will be a problem. (e) Write the results of the f test in the ANOVA table for the regression to predict the sale price with all three potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict sale price from land value, improvements, and area is statistically significant at the 0.05 level (f 3, 16 = , f 3, 16; 0.05 = 3.24, p < 0.001). We conclude that the overall regression is significant (i.e., at least one coefficient in the regression is different from zero). (f) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that since there are no dummy variables in this multiple regression model, Step 2 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 1(f) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 5

6 1. - continued Multiple Regression Exercises 6

7 1. - continued (g) Write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. There were two steps in the stepwise multiple regression with a 0.05 significance level for entry and a 0.10 significance level for removal. In the first step, the variable appraised value of improvements was entered, which explained 83.8% of the variance in sale price. In the second step, the variable area of property living space was entered, which explained an additional 4.3% of the variance in sale price. No variable was removed. The two independent variables in this final step explained a total of 88.1% of the variance in sale price. Multiple Regression Exercises 7

8 1. - continued (h) Write the estimated regression equation from the final step of the stepwise regression, and use this regression equation to predict the sale price of a residential property where the appraised land value is $8000, the appraised value of improvements is $20,000, and area of property living space is 1200 square feet. ^ sale_prc = (impr_val) (area) (20000) (1200) = $38, (i) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one dollar in appraised value of improvements, the sale price increases on average by about $0.96. For each increase of one square foot in area of property living space, the sale price increases on average by about $ Multiple Regression Exercises 8

9 1. - continued (j) From the Correlations table of the SPSS output, find the ordinary correlation between the dependent variable sale price and the first independent variable entered into the model. The correlation between sale price and appraised value of improvements is (k) From the Excluded Variables table of the SPSS output, find the partial correlation between the dependent variable sale price and the second independent variable entered into the model given the first independent variable entered into the model; compare this to the ordinary correlation between the dependent variable sale price and the second independent variable entered into the model, which can be found from the Correlations table of the SPSS output. The partial correlation between sale price and area of property living space given appraised value of improvements is The ordinary correlation between sale price and area of property living space is Multiple Regression Exercises 9

10 2. In a study to predict the drying time (hours) for an outdoor house paint, data is taken on 22 house painting jobs. The potential predictors in the study are temperature (degrees Fahrenheit), humidity (percent), wind velocity (miles per hour), and barometric pressure, and the data is stored in the SPSS data file paint (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Does the data appear to be observational or experimental? Since the temperature, humidity, wind velocity, and barometric pressure are all random, the data is observational. (b) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that since there are no dummy variables in this multiple regression model, Step 7 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 2(b) Verify that your SPSS output contains all of the following: four scatter plots, each displaying the least squares line for one of the quantitative predictors; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Multiple Regression Exercises 10

11 2. - continued Create a Word document named Multiple_Regression_Result_Summaries with a section titled Multiple Regression Exercises 2. In this section, create a subsection for each of parts (c), (d), and (e) which follow, and in each subsection created, write the summaries for the corresponding part. Print the page(s) and insert them immediately after this page. (c) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption the uniform variance (homoscedasticity) assumption the normality assumption (d) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. (e) Write the results of the f test in the ANOVA table for the regression to predict the drying time with all four potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. Multiple Regression Exercises 11

12 2. - continued (f) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that since there are no dummy variables in this multiple regression model, Step 2 can be skipped. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 2(f) Verify that your SPSS output contains all of the following: a table titled Variables Entered/Removed; a table titled Model Summary; a table titled ANOVA; a table titled Coefficients; a table titled Excluded Variables. (g) In the section titled Multiple Regression Exercises 2 of the Word document named Multiple_Regression_Result_Summaries (previously created), add a subsection for this part where you write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. Multiple Regression Exercises 12

13 2. - continued (h) Write the estimated regression equation from the final step of the stepwise regression, and use this regression equation to predict drying time with a temperature of 75 degrees Fahrenheit, a relative humidity of 55%, a wind velocity of 15 miles per hour, and a barometric pressure of ^ drying time = (temperature) 0.628(wind velocity) drying time = (75) 0.628(15) = hours (i) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one degree Fahrenheit in temperature, the drying time decreases on average by about hours. For each increase of one mile per hour in wind velocity, the drying time decreases on average by about hours. Multiple Regression Exercises 13

14 2. - continued (j) From the Correlations table of the SPSS output, find the ordinary correlation between the dependent variable paint drying time and the first independent variable entered into the model. The correlation between paint drying time and temperature is (k) From the Excluded Variables table of the SPSS output, find the partial correlation between the dependent variable paint drying time and the second independent variable entered into the model given the first independent variable entered into the model; compare this to the ordinary correlation between the dependent variable paint drying time and the second independent variable entered into the model, which can be found from the Correlations table of the SPSS output. The partial correlation between paint drying time and wind velocity given temperature is The ordinary correlation between paint drying time and wind velocity is Multiple Regression Exercises 14

15 3. Recall that in Basic Statistics Exercise #34 the lifetime of light bulbs was being studied for three brands named Brite, Softlite, and Nodark (i.e., the relationship between brand and lifetime). A 0.05 significance level was used with a one-way ANOVA to see if there is any evidence that mean lifetime is not the same for the brands Brite, Softlite, and Nodark. Light bulbs were randomly selected from each brand, and the lifetimes in hours were recorded as follows: Brite Softlite Nodark (a) Define dummy variables necessary to represent the qualitative variable brand of light bulb. (b) Create an SPSS data file which consists of a variable lifetime, for the lifetimes recorded in the data, and the dummy variables necessary to represent the qualitative variable brand of light bulb. Use SPSS to obtain the ANOVA table for the multiple regression to predict lifetime from brand of light bulb, and compare this ANOVA table to the one-way ANOVA table in Basic Statistics Exercise #34. Multiple Regression Exercises 15

16 4. Recall that in Basic Statistics Exercise #35 the mean length of fish was being studied for North Lake, Blue Lake, Harvey Lake (i.e., the relationship between Lake and length of fish) is to be studied. A 0.05 significance level was used with a one-way ANOVA to see if there is any evidence that mean length of fish is not the same for North Lake, Blue Lake, and Harvey Lake. Fish were randomly selected from each lake, and the lengths in inches were recorded as follows: North Blue Harvey (a) Define dummy variables necessary to represent the qualitative variable Lake. (b) Create an SPSS data file which consists of a variable length, for the lengths recorded in the data, and the dummy variables necessary to represent the qualitative variable Lake. Use SPSS to obtain the ANOVA table for the multiple regression to predict length from Lake, and compare this ANOVA table to the one-way ANOVA table in Basic Statistics Exercise #35. Multiple Regression Exercises 16

17 5. A company conducts a study to see how diastolic blood pressure is influenced by an employee s age, weight, and job stress level classified as high stress, some stress, and low stress. Data recorded on 24 employees treated as a random sample has been stored in the SPSS data file jobstress. A 0.05 significance level is chosen for hypothesis testing. (a) List the independent variables, and indicate whether each is quantitative or qualitative. age weight job stress level quantitative quantitative qualitative (b) Define all possible dummy variables which can be used to represent each qualitative independent variable. Any two of these indicator (dummy) variables is sufficient to represent the qualitative independent variable job stress level: X 1 = 1 for high stress job 0 for otherwise X 2 = 1 for some stress job 0 for otherwise 1 for low stress job X 3 = 0 for otherwise Multiple Regression Exercises 17

18 5.-continued (c) In the SPSS data file jobstress, recode the variable jobtype into the first dummy defined variable in part (b), by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Data Entry and Manipulation, and reading the steps in the subsection titled Creating New Variables by Recoding Existing Variables; then repeat this for the other dummy variable(s) in part (b), after which the data first few lines of the data file should look as follows: (d) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that Step 7 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 5(d) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 18

19 5.-continued Multiple Regression Exercises 19

20 5.-continued Multiple Regression Exercises 20

21 5.-continued (e) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption For each of the quantitative predictors age and weight, the data points appear to be randomly distributed about the least squares line on the corresponding scatter plot. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals plotted against standardized predicted values looks reasonably uniform around the horizontal line. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions appear to be satisfied, we feel it is appropriate to proceed with the multiple regression analysis. Multiple Regression Exercises 21

22 5. - continued (f) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. Since the correlation matrix does not contain any correlation greater than 0.8 for any pair of independent variables, and tolerance > 0.10 (i.e., VIF < 10) for each independent variable, there is no indication that multicollinearity will be a problem. (g) Write the results of the f test in the ANOVA table for the regression to predict the diastolic blood pressure with all potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict diastolic blood pressure from age, weight, and indicator variables representing job stress level is statistically significant at the 0.05 level (f 4, 19 = , f 4, 19; 0.05 = 2.90, p < 0.001). We conclude that the overall regression is significant (i.e., at least one coefficient in the regression is different from zero). (h) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that Step 2 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 5(f) Verify that your SPSS output contains all of the following: Multiple Regression Exercises 22

23 5. - continued Multiple Regression Exercises 23

24 5. - continued Multiple Regression Exercises 24

25 5. - continued (i) Write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. There were three steps in the stepwise multiple regression with a 0.05 significance level for entry and a 0.10 significance level for removal. In the first step, the variable weight was entered, which explained 52.8% of the variance in diastolic blood pressure. In the second step, the variable age was entered, which explained an additional 18.9% of the variance in diastolic blood pressure. No variable was removed. In the third step, the indicator variable for a high stress job was entered, which explained an additional 15.5% of the variance in diastolic blood pressure. No variable was removed. The three independent variables in this final step explained a total of 87.3% of the variance in diastolic blood pressure. (j) Write the estimated regression equation from the final step of the stepwise regression. ^ dbp = (weight) (age) (X 1 ) where 1 for high stress job X 1 = 0 for otherwise Multiple Regression Exercises 25

26 5. - continued (k) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one pound in weight, diastolic blood pressure increases on average by about For each increase of one year in age, diastolic blood pressure increases on average by about On average, diastolic blood pressure is about greater for employees with a high stress job than for employees with other jobs. The only indicator variable included in the model is the one for a high stress job, which suggests a statistically significant difference between the high stress job group and the other two groups combined but no statistically significant difference between the some stress job group and the low stress job group. For the high stress job group, X 1 = 1 so that the least squares regression equation is ^ dbp = (weight) (age) (1) = (weight) (age) For the low stress or some stress job group, X 1 = 0 so that the least squares regression equation is ^ dbp = (weight) (age) (0) = (weight) (age) Multiple Regression Exercises 26

27 5. - continued (l) Use the estimated regression equation from the final step of the stepwise regression to predict diastolic blood pressure for each of the two following employees: a 35-year old employee weighing 180 pounds and having a high stress job dbp = (180) (35) (1) = a 35-year old employee weighing 180 pounds and having a low stress or some stress job dbp = (180) (35) (0) = (m) From the Correlations table of the SPSS output, find the ordinary correlation between the dependent variable diastolic blood and the first independent variable entered into the model. The correlation between diastolic blood pressure and weight is (n) From the Excluded Variables table of the SPSS output, find the partial correlation between the dependent variable diastolic blood and the second independent variable entered into the model given the first independent variable entered into the model; compare this to the ordinary correlation between the dependent variable diastolic blood and the second independent variable entered into the model, which can be found from the Correlations table of the SPSS output. The partial correlation between diastolic blood pressure and age given weight is The ordinary correlation between diastolic blood pressure and age is Multiple Regression Exercises 27

28 6. In a study concerning the prediction of the wheat yield (bushels per acre), potential predictors are total rainfall (inches), average temperature (degrees Fahrenheit), and type of soil; there are three types of soil labeled A, B, and C. Data recorded on 24 employees treated as a random sample is displayed on the right. The data are from randomly selected observations over several seasons, and have been stored in the SPSS data file wheat_yield. A 0.05 significance level is chosen for hypothesis testing. (a) List the independent variables, and indicate whether each is quantitative or qualitative. total rainfall average temperature soil type quantitative quantitative qualtitative (b) Define all possible dummy variables which can be used to represent each qualitative independent variable. Any two of these indicator (dummy) variables is sufficient to represent the qualitative independent variable job stress level: 1 for soil type A 1 for soil type B 1 for soil type C X 1 = X 2 = X 3 = 0 otherwise 0 otherwise 0 otherwise (c) In the SPSS data file wheat_yield, recode the variable soil_typ into the first dummy defined variable in part (b), by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Data Entry and Manipulation, and reading the steps in the subsection titled Creating New Variables by Recoding Existing Variables; then repeat this for the other dummy variable(s) in part (b). Multiple Regression Exercises 28

29 6. - continued (d) Use SPSS to do the calculations needed for a multiple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Multiple Linear Regression with Checks for Multicollinearity and of Linearity, Homoscedasticity, and Normality Assumptions; note that Step 7 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 6(d) Verify that your SPSS output contains all of the following: two scatter plots, each displaying the least squares line for one of the quantitative predictors; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Multiple Regression Exercises 29

30 6. - continued In the Word document named Multiple_Regression_Result_Summaries (previously created), add a section titled Multiple Regression Exercises 6. In this section, create a subsection for each of parts (c), (d), and (e) which follow, and in each subsection created, write the summaries for the corresponding part. Print the page(s) and insert them immediately after this page. (e) Use the SPSS output to make a statement concerning whether each of the following assumptions in a multiple linear regression is satisfied: the linearity assumption the uniform variance (homoscedasticity) assumption the normality assumption (f) Use the SPSS output to make a statement concerning whether significant multicollinearity is likely to be present in the multiple regression. (g) Write the results of the f test in the ANOVA table for the regression to predict wheat yield with all potential predictors in the model; write these results in a format suitable for a journal article to be submitted for publication. Multiple Regression Exercises 30

31 6. - continued (h) It is decided to use stepwise regression to select the most important predictors to include in the model. Use SPSS to do the calculations needed for a stepwise regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Stepwise Regression (or Related Procedure) to Build a Model; note that Step 2 has already been completed in part (c). Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Multiple Regression Exercise 6(f) Verify that your SPSS output contains all of the following: a table titled Variables Entered/Removed; a table titled Model Summary; a table titled ANOVA; a table titled Coefficients; a table titled Excluded Variables. (i) In the section titled Multiple Regression Exercises 6 of the Word document named Multiple_Regression_Result_Summaries (previously created), add a subsection for this part where you write the results of the stepwise regression in a format suitable for a journal article to be submitted for publication; include information about the significance level used and changes in R 2. Multiple Regression Exercises 31

32 6. - continued (j) Write the estimated regression equation from the final step of the stepwise regression. ^ yield = (rain) (temp) (X 1 ) (k) For each of the estimated regression coefficients in the estimated regression equation from the final step of the stepwise multiple regression, write a one sentence interpretation describing what the coefficient estimates. For each increase of one inch in total rainfall, wheat yield increases on average by about bushels per acre. For each increase of one degree Fahrenheit in temperature, wheat yield increases on average by about bushels per acre. On average, wheat yield is about bushels per acre smaller with soil type A than for other soil types. Multiple Regression Exercises 32

33 6. - continued (l) Use the estimated regression equation from the final step of the stepwise regression to predict wheat yield in each of the following scenarios: Total rainfall is 60 inches, average temperature is 65 degrees Fahrenheit, and soil type A is used (60) (65) = = bushels per acre Total rainfall is 60 inches, average temperature is 65 degrees Fahrenheit, and soil type B or C is used (60) (65) = = bushels per acre Multiple Regression Exercises 33

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

Regression Analysis (Spring, 2000)

Regression Analysis (Spring, 2000) Regression Analysis (Spring, 2000) By Wonjae Purposes: a. Explaining the relationship between Y and X variables with a model (Explain a variable Y in terms of Xs) b. Estimating and testing the intensity

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

Homework 8 Solutions

Homework 8 Solutions Math 17, Section 2 Spring 2011 Homework 8 Solutions Assignment Chapter 7: 7.36, 7.40 Chapter 8: 8.14, 8.16, 8.28, 8.36 (a-d), 8.38, 8.62 Chapter 9: 9.4, 9.14 Chapter 7 7.36] a) A scatterplot is given below.

More information

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. ch12 practice test 1) The null hypothesis that x and y are is H0: = 0. 1) 2) When a two-sided significance test about a population slope has a P-value below 0.05, the 95% confidence interval for A) does

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Moderation. Moderation

Moderation. Moderation Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

More information

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

ANALYSIS OF TREND CHAPTER 5

ANALYSIS OF TREND CHAPTER 5 ANALYSIS OF TREND CHAPTER 5 ERSH 8310 Lecture 7 September 13, 2007 Today s Class Analysis of trends Using contrasts to do something a bit more practical. Linear trends. Quadratic trends. Trends in SPSS.

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

ABSORBENCY OF PAPER TOWELS

ABSORBENCY OF PAPER TOWELS ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?

More information

Directions for using SPSS

Directions for using SPSS Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Analytical Test Method Validation Report Template

Analytical Test Method Validation Report Template Analytical Test Method Validation Report Template 1. Purpose The purpose of this Validation Summary Report is to summarize the finding of the validation of test method Determination of, following Validation

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) All but one of these statements contain a mistake. Which could be true? A) There is a correlation

More information

There are six different windows that can be opened when using SPSS. The following will give a description of each of them.

There are six different windows that can be opened when using SPSS. The following will give a description of each of them. SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet

More information

Better decision making under uncertain conditions using Monte Carlo Simulation

Better decision making under uncertain conditions using Monte Carlo Simulation IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics

More information

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Simple Linear Regression, Scatterplots, and Bivariate Correlation 1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.

More information

Moderator and Mediator Analysis

Moderator and Mediator Analysis Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,

More information

Multiple Regression. Page 24

Multiple Regression. Page 24 Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)

More information

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to

More information

Simple Predictive Analytics Curtis Seare

Simple Predictive Analytics Curtis Seare Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Ch. Correlation and Regression. Correlation Interpret Scatter Plots and Correlations MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression We are often interested in studying the relationship among variables to determine whether they are associated with one another. When we think that changes in a

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480 1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500

More information

Regression step-by-step using Microsoft Excel

Regression step-by-step using Microsoft Excel Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS About Omega Statistics Private practice consultancy based in Southern California, Medical and Clinical

More information

16 : Demand Forecasting

16 : Demand Forecasting 16 : Demand Forecasting 1 Session Outline Demand Forecasting Subjective methods can be used only when past data is not available. When past data is available, it is advisable that firms should use statistical

More information

Correlation and Regression Analysis: SPSS

Correlation and Regression Analysis: SPSS Correlation and Regression Analysis: SPSS Bivariate Analysis: Cyberloafing Predicted from Personality and Age These days many employees, during work hours, spend time on the Internet doing personal things,

More information

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011 SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011 Statistical techniques to be covered Explore relationships among variables Correlation Regression/Multiple regression Logistic regression Factor analysis

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices: Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:

More information

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS. SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Causal Forecasting Models

Causal Forecasting Models CTL.SC1x -Supply Chain & Logistics Fundamentals Causal Forecasting Models MIT Center for Transportation & Logistics Causal Models Used when demand is correlated with some known and measurable environmental

More information

5. Multiple regression

5. Multiple regression 5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful

More information

DISCRIMINANT FUNCTION ANALYSIS (DA)

DISCRIMINANT FUNCTION ANALYSIS (DA) DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red.

Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. Statistical Analysis Using SPSS for Windows Getting Started (Ver. 2014/11/6) The numbers of figures in the SPSS_screenshot.pptx are shown in red. 1. How to display English messages from IBM SPSS Statistics

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Business Valuation Review

Business Valuation Review Business Valuation Review Regression Analysis in Valuation Engagements By: George B. Hawkins, ASA, CFA Introduction Business valuation is as much as art as it is science. Sage advice, however, quantitative

More information

Introduction. Chapter 1. 1.1 Before you start. 1.1.1 Formulation

Introduction. Chapter 1. 1.1 Before you start. 1.1.1 Formulation Chapter 1 Introduction 1.1 Before you start Statistics starts with a problem, continues with the collection of data, proceeds with the data analysis and finishes with conclusions. It is a common mistake

More information

Data analysis process

Data analysis process Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

SPSS-Applications (Data Analysis)

SPSS-Applications (Data Analysis) CORTEX fellows training course, University of Zurich, October 2006 Slide 1 SPSS-Applications (Data Analysis) Dr. Jürg Schwarz, juerg.schwarz@schwarzpartners.ch Program 19. October 2006: Morning Lessons

More information

Activity 8 Drawing Isobars Level 2 http://www.uni.edu/storm/activities/level2/index.shtml

Activity 8 Drawing Isobars Level 2 http://www.uni.edu/storm/activities/level2/index.shtml Activity 8 Drawing Isobars Level 2 http://www.uni.edu/storm/activities/level2/index.shtml Objectives: 1. Students will be able to define and draw isobars to analyze air pressure variations. 2. Students

More information

Introduction to Statistics and Quantitative Research Methods

Introduction to Statistics and Quantitative Research Methods Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.

More information

Chapter 7 Factor Analysis SPSS

Chapter 7 Factor Analysis SPSS Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often

More information

Using R for Linear Regression

Using R for Linear Regression Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional

More information

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.

More information

Multiple Regression Using SPSS

Multiple Regression Using SPSS Multiple Regression Using SPSS The following sections have been adapted from Field (2009) Chapter 7. These sections have been edited down considerably and I suggest (especially if you re confused) that

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

More information

January 26, 2009 The Faculty Center for Teaching and Learning

January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i

More information

Homework 11. Part 1. Name: Score: / null

Homework 11. Part 1. Name: Score: / null Name: Score: / Homework 11 Part 1 null 1 For which of the following correlations would the data points be clustered most closely around a straight line? A. r = 0.50 B. r = -0.80 C. r = 0.10 D. There is

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

Regression III: Advanced Methods

Regression III: Advanced Methods Lecture 16: Generalized Additive Models Regression III: Advanced Methods Bill Jacoby Michigan State University http://polisci.msu.edu/jacoby/icpsr/regress3 Goals of the Lecture Introduce Additive Models

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Unit 9 Describing Relationships in Scatter Plots and Line Graphs Unit 9 Describing Relationships in Scatter Plots and Line Graphs Objectives: To construct and interpret a scatter plot or line graph for two quantitative variables To recognize linear relationships, non-linear

More information

Module 5: Statistical Analysis

Module 5: Statistical Analysis Module 5: Statistical Analysis To answer more complex questions using your data, or in statistical terms, to test your hypothesis, you need to use more advanced statistical tests. This module reviews the

More information

Regression and Correlation

Regression and Correlation Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis

More information

THE USE OF REGRESSION ANALYSIS IN MARKETING RESEARCH

THE USE OF REGRESSION ANALYSIS IN MARKETING RESEARCH THE USE OF REGRESSION ANALYSIS IN MARKETING RESEARCH DUMITRESCU Luigi Lucian Blaga University of Sibiu, Romania STANCIU Oana Lucian Blaga University of Sibiu, Romania TICHINDELEAN Mihai Lucian Blaga University

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online

More information

Scatter Plot, Correlation, and Regression on the TI-83/84

Scatter Plot, Correlation, and Regression on the TI-83/84 Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page

More information

Foundation of Quantitative Data Analysis

Foundation of Quantitative Data Analysis Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1

More information

Concepts of Experimental Design

Concepts of Experimental Design Design Institute for Six Sigma A SAS White Paper Table of Contents Introduction...1 Basic Concepts... 1 Designing an Experiment... 2 Write Down Research Problem and Questions... 2 Define Population...

More information

Illustration (and the use of HLM)

Illustration (and the use of HLM) Illustration (and the use of HLM) Chapter 4 1 Measurement Incorporated HLM Workshop The Illustration Data Now we cover the example. In doing so we does the use of the software HLM. In addition, we will

More information