Chapter 13 In this chapter, you learn: Multiple Regression Model with k Independent Variables:

Size: px
Start display at page:

Download "Chapter 13 In this chapter, you learn: Multiple Regression Model with k Independent Variables:"

Transcription

1 Chapter 4 4- Business Statistics: A First Course Fifth Edition Chapter 3 Multiple Regression Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc. Chap 3- Learning Objectives In this chapter, you learn: How to develop a multiple regression model How to interpret the regression coefficients How to determine which independent variables to include in the regression model How to determine which independent variables are more important in predicting a dependent variable How to use categorical variables in a regression model Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3- The Multiple Regression Model Idea: Examine the linear relationship between dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables: Y-intercept Population slopes Random Error Y i β β X i β X i β k X ki ε i Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

2 Chapter 4 4- Multiple Regression Equation The coefficients of the multiple regression model are estimated using sample data Multiple regression equation with k independent variables: Estimated (or predicted) value of Y Yˆ b i Estimated intercept b X i Estimated slope coefficients b Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-4 X i In this chapter we will use Excel or Minitab to obtain the regression slope coefficients and other regression summary measures. b k X ki Multiple Regression Equation Two variable model Y Ŷ b bx bx X X Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-5 Example: Independent Variables A distributor of frozen dessert pies wants to evaluate factors thought to influence demand Dependent variable: Pie sales (units per week) Independent variables: Price (in $) Advertising ($ s) Data are collected for 5 weeks Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

3 Chapter Week Pie Sales Price ($) Advertising ($s) Pie Sales Example Multiple regression equation: Sales = b + b (Price) + b (Advertising) Business Statistics: A First Course, 5e Prentice-Hall, 9 Inc.. Chap 3-7 Excel Multiple Regression Output Regression Statistics Multiple R.73 R Square.548 Adjusted R Square.447 Standard Error Observations 5 Sales (Price) 74.3(Advertising) ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-8 Minitab Multiple Regression Output Sales (Price) 74.3(Advertising) The regression equation is Sales = Price Advertising Predictor Coef SE Coef T P Constant Price Advertising S = R-Sq = 5.% R-Sq(adj) = 44.% Analysis of Variance Source DF SS MS F P Regression Residual Error Total Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

4 Chapter The Multiple Regression Equation Sales (Price) 74.3(Advertising) where Sales is in number of pies per week Price is in $ Advertising is in $ s. b = : sales will decrease, on average, by pies per week for each $ increase in selling price, net of the effects of changes due to advertising b = 74.3: sales will increase, on average, by 74.3 pies per week for each $ increase in advertising, net of the effects of changes due to price Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3- Using The Equation to Make Predictions Predict sales for a week in which the selling price is $5.5 and advertising is $35: Sales (Price) 74.3(Advertising) (5.5) 74.3(3.5) 48.6 Predicted sales is 48.6 pies Note that Advertising is in $ s, so $35 means that X = 3.5 Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3- Predictions in Excel using PHStat PHStat regression multiple regression Check the confidence and prediction interval estimates box Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-6 Prentice Hall, Inc.

5 Chapter Predictions in PHStat Input values Predicted Y value Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-3 < Confidence interval for the mean value of Y, given these X values Prediction interval for an individual Y value, given these X values Predictions in Minitab Predicted Values for New Observations Confidence interval for the mean value of Y, given these X values Predicted Y ˆ value New Obs Fit SE Fit 95% CI 95% PI (39., 466.) (38.6, 538.6) Values of Predictors for New Observations New Obs Price Advertising Input values Prediction interval for an individual Y value, given these X values Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-4 Coefficient of Multiple Determination Reports the proportion of total variation in Y explained by all X variables taken together r SSR regression sum of squares SST total sum of squares Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

6 Chapter Multiple Coefficient of Determination In Excel Regression Statistics Multiple R.73 R Square.548 Adjusted R Square.447 Standard Error Observations 5 r SSR SST % of the variation in pie sales is explained by the variation in price and advertising ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-6 Multiple Coefficient of Determination In Minitab The regression equation is Sales = Price Advertising Predictor Coef SE Coef T P Constant SSR 946. r.548 Price SST Advertising S = R-Sq = 5.% R-Sq(adj) = 44.% Analysis of Variance Source DF SS MS F P Regression Residual Error Total % of the variation in pie sales is explained by the variation in price and advertising Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-7 Adjusted r r never decreases when a new X variable is added to the model This can be a disadvantage when comparing models What is the net effect of adding a new variable? We lose a degree of freedom when a new X variable is added Did the new X variable add enough explanatory power to offset the loss of one degree of freedom? Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

7 Chapter Adjusted r Shows the proportion of variation in Y explained by all X variables adjusted for the number of X variables used and sample size n r adj ( r ) n k (where n = sample size, k = number of independent variables) Penalize excessive use of unimportant independent variables Smaller than r Useful in comparing among models Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-9 Adjusted r in Excel Regression Statistics Multiple R.73 R Square.548 Adjusted R Square.447 Standard Error Observations 5 r adj % of the variation in pie sales is explained by the variation in price and advertising, taking into account the sample size and number of independent d variables ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3- Adjusted r in Minitab The regression equation is Sales = Price Advertising Predictor Coef SE Coef T P Constant Price Advertising S = R-Sq = 5.% R-Sq(adj) = 44.% Analysis of Variance Source DF SS MS F P Regression Residual Error Total r adj % of the variation in pie sales is explained by the variation in price and advertising, taking into account the sample size and number of independent variables Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-6 Prentice Hall, Inc.

8 Chapter Is the Model Significant? F Test for Overall Significance of the Model Shows if there is a linear relationship between all of the X variables considered together and Y Use F-test statistic Hypotheses: H : β = β = = β k = (no linear relationship) H : at least one β i (at least one independent variable affects Y) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3- F Test for Overall Significance Test statistic: SSR MSR F k STAT MSE SSE n k where F STAT has numerator d.f. = k and denominator d.f. = (n k - ) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-3 F Test for Overall Significance In Excel Regression Statistics Multiple R.73 R Square.548 Adjusted R Square.447 Standard Error Observations 5 MSR 473. F STAT MSE 5.8 With and degrees P-value for of freedom the F Test ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

9 Chapter F Test for Overall Significance In Minitab The regression equation is Sales = Price Advertising Predictor Coef SE Coef T P Constant Price Advertising S = R-Sq = 5.% R-Sq(adj) = 44.% MSR 473. F STAT MSE 5.8 Analysis of Variance Source DF SS MS F P Regression Residual Error Total With and degrees of freedom P-value for the F Test Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-5 F Test for Overall Significance H : β = β = H : β and β not both zero =.5 df = df = Do not reject H Critical Value: F.5 = =.5 Reject H F.5 = F Test Statistic: MSR F STAT MSE Decision: Since F STAT test statistic is in the rejection region (pvalue <.5), reject H Conclusion: There is evidence that at least one independent variable affects Y Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-6 Residuals in Multiple Regression Two variable model Y Residual = e i = (Y i Y i ) < Y i < Y i Sample observation Ŷ b bx bx x i X X x i The best fit equation is found by minimizing the sum of squared errors, e Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

10 Chapter 4 4- Multiple Regression Assumptions Errors (residuals) from the regression model: Assumptions: < e i = (Y i Y i ) Independence of errors Error values are statistically independent Normality of errors Error values are normally distributed for any given set of X values Equal Variance (also called Homoscedasticity) The probability distribution of the errors has constant variance Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-8 Residual Plots Used in Multiple Regression These residual plots are used in multiple regression: < Residuals vs. Y i Residuals vs. X i Residuals vs. X i Residuals vs. time (if time series data) Use the residual plots to check for violations of regression assumptions Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-9 Are Individual Variables Significant? Use t tests of individual variable slopes Shows if there is a linear relationship between the variable X j and Y holding constant the effects of other X variables Hypotheses: H : β j = (no linear relationship) H : β j (linear relationship does exist between X j and Y) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

11 Chapter 4 4- Are Individual Variables Significant? H : β j = (no linear relationship) H : β j (linear relationship does exist between X j and Y) Test Statistic: t STAT b j S b j (df = n k ) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-3 Regression Statistics Multiple R.73 R Square.548 Adjusted R Square.447 Standard Error Observations 5 Are Individual Variables Significant? Excel Output t Stat for Price is t STAT = -.36, with p-value.398 t Stat for Advertising is t STAT =.855, with p-value.45 ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-3 Are Individual Variables Significant? Minitab Output The regression equation is Sales = Price Advertising Predictor Coef SE Coef T P Constant Price Advertising S = R-Sq = 5.% R-Sq(adj) = 44.% Analysis of Variance Source DF SS MS F P Regression Residual Error Total t Stat for Price is t STAT = -.36, with p-value.398 t Stat for Advertising is t STAT =.855, with p-value.45 Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

12 Chapter 4 4- H : β j = H : β j d.f. = 5-- = =.5 t / =.788 /=.5 Inferences about the Slope: t Test Example From the Excel and Minitab output: For Price t STAT = -.36, with p-value.398 For Advertising t STAT =.855, with p-value.45 The test t statistic ti ti for each variable falls in the rejection region (p-values <.5) /=.5 Decision: Reject H for each variable Conclusion: There is evidence that both Price and Advertising affect pie sales at =.5 Reject H Do not reject H Reject H -t α/ t α/ Statistics: A First 9 Prentice-Hall, Business Course, 5e Inc.. Chap 3-34 Confidence Interval Estimate for the Slope Confidence interval for the population slope β j b j tα / S bj where t has (n k ) d.f. Coefficients Standard Error Intercept Price Advertising Here, t has (5 ) = d.f. Example: Form a 95% confidence interval for the effect of changes in price (X ) on pie sales: ± (.788)(.83) So the interval is ( , -.374) (This interval does not contain zero, so price has a significant effect on sales) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-35 Confidence Interval Estimate for the Slope Confidence interval for the population slope β j Coefficients Standard Error Lower 95% Upper 95% Intercept Price Advertising Example: Excel output also reports these interval endpoints: Weekly sales are estimated to be reduced by between.37 to pies for each increase of $ in the selling price, holding the effect of price constant Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

13 Chapter Using Dummy Variables A dummy variable is a categorical independent variable with two levels: yes or no, on or off, male or female coded as or Assumes the slopes associated with numerical independent variables do not change with the value for the categorical variable Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-37 Dummy-Variable Example (with Levels) Let: Ŷ b b X b X Y = pie sales X = price X = holiday (X = if a holiday occurred during the week) (X = if there was no holiday that week) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-38 Dummy-Variable Example (with Levels) Ŷ b b X b () (b b ) b X Ŷ b b X b () Y (sales) b + b b b Different intercept b X Same slope Holiday No Holiday If H : β = is rejected, then Holiday has a significant effect on pie sales X (Price) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

14 Chapter Interpreting the Dummy Variable Coefficient (with Levels) Example: Sales 3-3(Price) 5(Holiday) Sales: number of pies sold per week Price: pie price in $ Holiday: If a holiday occurred during the week If no holiday occurred b = 5: on average, sales were 5 pies greater in weeks with a holiday than in weeks without a holiday, given the same price Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-4 Interaction Between Independent Variables Hypothesizes interaction between pairs of X variables Response to one X variable may vary at different levels of another X variable Contains two-way cross product terms Ŷ b b X b X b X b b X b X b (X X ) Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-4 Effect of Interaction Given: Y β βx βx β3xx ε Without interaction term, effect of X on Y is measured dby β With interaction term, effect of X on Y is measured by β + β 3 X Effect changes as X changes Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

15 Chapter Interaction Example Suppose X is a dummy variable and the estimated regression equation is Ŷ = + X + 3X + 4X X Y 8 X = : Y = + X + 3() + 4X () = 4 + 6X 4 X = : Y = + X + 3() + 4X () = + X X.5.5 Slopes are different if the effect of X on Y depends on X value Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-43 Significance of Interaction Term Tested by utilizing the t-test on the coefficient associated with the interaction term. Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap 3-44 Chapter Summary Developed the multiple regression model Tested the significance of the multiple regression model Discussed adjusted r Discussed using residual plots to check model assumptions Tested individual regression coefficients Used dummy variables Evaluated interaction effects Business Statistics: A First Course, 5e 9 Prentice-Hall, Inc.. Chap Prentice Hall, Inc.

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ STA 3024 Practice Problems Exam 2 NOTE: These are just Practice Problems. This is NOT meant to look just like the test, and it is NOT the only thing that you should study. Make sure you know all the material

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares Topic 4 - Analysis of Variance Approach to Regression Outline Partitioning sums of squares Degrees of freedom Expected mean squares General linear test - Fall 2013 R 2 and the coefficient of correlation

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

Estimation of σ 2, the variance of ɛ

Estimation of σ 2, the variance of ɛ Estimation of σ 2, the variance of ɛ The variance of the errors σ 2 indicates how much observations deviate from the fitted surface. If σ 2 is small, parameters β 0, β 1,..., β k will be reliably estimated

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

1.1. Simple Regression in Excel (Excel 2010).

1.1. Simple Regression in Excel (Excel 2010). .. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

One-Way Analysis of Variance (ANOVA) Example Problem

One-Way Analysis of Variance (ANOVA) Example Problem One-Way Analysis of Variance (ANOVA) Example Problem Introduction Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of two or more population (or treatment) means

More information

17. SIMPLE LINEAR REGRESSION II

17. SIMPLE LINEAR REGRESSION II 17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.

More information

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480

Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500 6 8480 1) The S & P/TSX Composite Index is based on common stock prices of a group of Canadian stocks. The weekly close level of the TSX for 6 weeks are shown: Week TSX Index 1 8480 2 8470 3 8475 4 8510 5 8500

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Regression step-by-step using Microsoft Excel

Regression step-by-step using Microsoft Excel Step 1: Regression step-by-step using Microsoft Excel Notes prepared by Pamela Peterson Drake, James Madison University Type the data into the spreadsheet The example used throughout this How to is a regression

More information

GLM I An Introduction to Generalized Linear Models

GLM I An Introduction to Generalized Linear Models GLM I An Introduction to Generalized Linear Models CAS Ratemaking and Product Management Seminar March 2009 Presented by: Tanya D. Havlicek, Actuarial Assistant 0 ANTITRUST Notice The Casualty Actuarial

More information

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results IAPRI Quantitative Analysis Capacity Building Series Multiple regression analysis & interpreting results How important is R-squared? R-squared Published in Agricultural Economics 0.45 Best article of the

More information

2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or

2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or Simple and Multiple Regression Analysis Example: Explore the relationships among Month, Adv.$ and Sales $: 1. Prepare a scatter plot of these data. The scatter plots for Adv.$ versus Sales, and Month versus

More information

The importance of graphing the data: Anscombe s regression examples

The importance of graphing the data: Anscombe s regression examples The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0.

Predictor Coef StDev T P Constant 970667056 616256122 1.58 0.154 X 0.00293 0.06163 0.05 0.963. S = 0.5597 R-Sq = 0.0% R-Sq(adj) = 0. Statistical analysis using Microsoft Excel Microsoft Excel spreadsheets have become somewhat of a standard for data storage, at least for smaller data sets. This, along with the program often being packaged

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

MULTIPLE REGRESSION WITH CATEGORICAL DATA

MULTIPLE REGRESSION WITH CATEGORICAL DATA DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting

More information

Notes on Applied Linear Regression

Notes on Applied Linear Regression Notes on Applied Linear Regression Jamie DeCoster Department of Social Psychology Free University Amsterdam Van der Boechorststraat 1 1081 BT Amsterdam The Netherlands phone: +31 (0)20 444-8935 email:

More information

1 Simple Linear Regression I Least Squares Estimation

1 Simple Linear Regression I Least Squares Estimation Simple Linear Regression I Least Squares Estimation Textbook Sections: 8. 8.3 Previously, we have worked with a random variable x that comes from a population that is normally distributed with mean µ and

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Linear Models in R Regression Regression analysis is the appropriate

More information

5. Multiple regression

5. Multiple regression 5. Multiple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/5 QBUS6840 Predictive Analytics 5. Multiple regression 2/39 Outline Introduction to multiple linear regression Some useful

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

More information

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996)

MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL. by Michael L. Orlov Chemistry Department, Oregon State University (1996) MULTIPLE LINEAR REGRESSION ANALYSIS USING MICROSOFT EXCEL by Michael L. Orlov Chemistry Department, Oregon State University (1996) INTRODUCTION In modern science, regression analysis is a necessary part

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

CS 147: Computer Systems Performance Analysis

CS 147: Computer Systems Performance Analysis CS 147: Computer Systems Performance Analysis One-Factor Experiments CS 147: Computer Systems Performance Analysis One-Factor Experiments 1 / 42 Overview Introduction Overview Overview Introduction Finding

More information

Chapter 4 and 5 solutions

Chapter 4 and 5 solutions Chapter 4 and 5 solutions 4.4. Three different washing solutions are being compared to study their effectiveness in retarding bacteria growth in five gallon milk containers. The analysis is done in a laboratory,

More information

Multinomial and Ordinal Logistic Regression

Multinomial and Ordinal Logistic Regression Multinomial and Ordinal Logistic Regression ME104: Linear Regression Analysis Kenneth Benoit August 22, 2012 Regression with categorical dependent variables When the dependent variable is categorical,

More information

We extended the additive model in two variables to the interaction model by adding a third term to the equation.

We extended the additive model in two variables to the interaction model by adding a third term to the equation. Quadratic Models We extended the additive model in two variables to the interaction model by adding a third term to the equation. Similarly, we can extend the linear model in one variable to the quadratic

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

10. Analysis of Longitudinal Studies Repeat-measures analysis

10. Analysis of Longitudinal Studies Repeat-measures analysis Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.

More information

Jinadasa Gamage, Professor of Mathematics, Illinois State University, Normal, IL, e- mail: jina@ilstu.edu

Jinadasa Gamage, Professor of Mathematics, Illinois State University, Normal, IL, e- mail: jina@ilstu.edu Submission for ARCH, October 31, 2006 Jinadasa Gamage, Professor of Mathematics, Illinois State University, Normal, IL, e- mail: jina@ilstu.edu Jed L. Linfield, FSA, MAAA, Health Actuary, Kaiser Permanente,

More information

Section 1: Simple Linear Regression

Section 1: Simple Linear Regression Section 1: Simple Linear Regression Carlos M. Carvalho The University of Texas McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Interaction between quantitative predictors

Interaction between quantitative predictors Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors

More information

Chapter 3 Quantitative Demand Analysis

Chapter 3 Quantitative Demand Analysis Managerial Economics & Business Strategy Chapter 3 uantitative Demand Analysis McGraw-Hill/Irwin Copyright 2010 by the McGraw-Hill Companies, Inc. All rights reserved. Overview I. The Elasticity Concept

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Simple Methods and Procedures Used in Forecasting

Simple Methods and Procedures Used in Forecasting Simple Methods and Procedures Used in Forecasting The project prepared by : Sven Gingelmaier Michael Richter Under direction of the Maria Jadamus-Hacura What Is Forecasting? Prediction of future events

More information

Business Valuation Review

Business Valuation Review Business Valuation Review Regression Analysis in Valuation Engagements By: George B. Hawkins, ASA, CFA Introduction Business valuation is as much as art as it is science. Sage advice, however, quantitative

More information

Multiple Linear Regression in Data Mining

Multiple Linear Regression in Data Mining Multiple Linear Regression in Data Mining Contents 2.1. A Review of Multiple Linear Regression 2.2. Illustration of the Regression Process 2.3. Subset Selection in Linear Regression 1 2 Chap. 2 Multiple

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

Coefficient of Determination

Coefficient of Determination Coefficient of Determination The coefficient of determination R 2 (or sometimes r 2 ) is another measure of how well the least squares equation ŷ = b 0 + b 1 x performs as a predictor of y. R 2 is computed

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look

More information

Outline: Demand Forecasting

Outline: Demand Forecasting Outline: Demand Forecasting Given the limited background from the surveys and that Chapter 7 in the book is complex, we will cover less material. The role of forecasting in the chain Characteristics of

More information

Causal Forecasting Models

Causal Forecasting Models CTL.SC1x -Supply Chain & Logistics Fundamentals Causal Forecasting Models MIT Center for Transportation & Logistics Causal Models Used when demand is correlated with some known and measurable environmental

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

An analysis method for a quantitative outcome and two categorical explanatory variables.

An analysis method for a quantitative outcome and two categorical explanatory variables. Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that

More information

1.5 Oneway Analysis of Variance

1.5 Oneway Analysis of Variance Statistics: Rosie Cornish. 200. 1.5 Oneway Analysis of Variance 1 Introduction Oneway analysis of variance (ANOVA) is used to compare several means. This method is often used in scientific or medical experiments

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and

More information

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1) Spring 204 Class 9: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.) Big Picture: More than Two Samples In Chapter 7: We looked at quantitative variables and compared the

More information

Section 13, Part 1 ANOVA. Analysis Of Variance

Section 13, Part 1 ANOVA. Analysis Of Variance Section 13, Part 1 ANOVA Analysis Of Variance Course Overview So far in this course we ve covered: Descriptive statistics Summary statistics Tables and Graphs Probability Probability Rules Probability

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015

Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb 2 2015 Stat 412/512 CASE INFLUENCE STATISTICS Feb 2 2015 Charlotte Wickham stat512.cwick.co.nz Regression in your field See website. You may complete this assignment in pairs. Find a journal article in your field

More information

c 2015, Jeffrey S. Simonoff 1

c 2015, Jeffrey S. Simonoff 1 Modeling Lowe s sales Forecasting sales is obviously of crucial importance to businesses. Revenue streams are random, of course, but in some industries general economic factors would be expected to have

More information

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

MODELING AUTO INSURANCE PREMIUMS

MODELING AUTO INSURANCE PREMIUMS MODELING AUTO INSURANCE PREMIUMS Brittany Parahus, Siena College INTRODUCTION The findings in this paper will provide the reader with a basic knowledge and understanding of how Auto Insurance Companies

More information

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management KSTAT MINI-MANUAL Decision Sciences 434 Kellogg Graduate School of Management Kstat is a set of macros added to Excel and it will enable you to do the statistics required for this course very easily. To

More information

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5 Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression

More information

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2

ECON 142 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE #2 University of California, Berkeley Prof. Ken Chay Department of Economics Fall Semester, 005 ECON 14 SKETCH OF SOLUTIONS FOR APPLIED EXERCISE # Question 1: a. Below are the scatter plots of hourly wages

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

4. Multiple Regression in Practice

4. Multiple Regression in Practice 30 Multiple Regression in Practice 4. Multiple Regression in Practice The preceding chapters have helped define the broad principles on which regression analysis is based. What features one should look

More information

Week 5: Multiple Linear Regression

Week 5: Multiple Linear Regression BUS41100 Applied Regression Analysis Week 5: Multiple Linear Regression Parameter estimation and inference, forecasting, diagnostics, dummy variables Robert B. Gramacy The University of Chicago Booth School

More information

Statistical Models in R

Statistical Models in R Statistical Models in R Some Examples Steven Buechler Department of Mathematics 276B Hurley Hall; 1-6233 Fall, 2007 Outline Statistical Models Structure of models in R Model Assessment (Part IA) Anova

More information

Analysis of Variance. MINITAB User s Guide 2 3-1

Analysis of Variance. MINITAB User s Guide 2 3-1 3 Analysis of Variance Analysis of Variance Overview, 3-2 One-Way Analysis of Variance, 3-5 Two-Way Analysis of Variance, 3-11 Analysis of Means, 3-13 Overview of Balanced ANOVA and GLM, 3-18 Balanced

More information

ANOVA. February 12, 2015

ANOVA. February 12, 2015 ANOVA February 12, 2015 1 ANOVA models Last time, we discussed the use of categorical variables in multivariate regression. Often, these are encoded as indicator columns in the design matrix. In [1]: %%R

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

Categorical Data Analysis

Categorical Data Analysis Richard L. Scheaffer University of Florida The reference material and many examples for this section are based on Chapter 8, Analyzing Association Between Categorical Variables, from Statistical Methods

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

THE KRUSKAL WALLLIS TEST

THE KRUSKAL WALLLIS TEST THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON

More information

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis Journal of tourism [No. 8] MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM Assistant Ph.D. Erika KULCSÁR Babeş Bolyai University of Cluj Napoca, Romania Abstract This paper analysis

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

MTH 140 Statistics Videos

MTH 140 Statistics Videos MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information