CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Size: px
Start display at page:

Download "CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression"

Transcription

1 Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the relationship etween two or more variales. A simple regression model includes only two variales: one independent and one dependent. The dependent variale is the one eing explained, and the independent variale is the one used to explain the variation in the dependent variale. Linear Regression Definition A (simple) regression model that gives a straight-line relationship etween two variales is called a linear regression model. Figure 13.1 Relationship etween food expenditure and income. (a) Linear relationship. () Nonlinear relationship.

2 Figure 13. Plotting a linear equation. Figure 13.3 y-intercept and slope of a line. SIMPLE LINEAR REGREION ANALYSIS SIMPLE LINEAR REGREION ANALYSIS Definition In the regression model y = A + Bx + ε, A is called the y- intercept or constant term, B is the slope, and ε is the random error term. The dependent and independent variales are y and x, respectively. SIMPLE LINEAR REGREION ANALYSIS Definition In the model ŷ = a + x, a and, which are calculated using sample data, are called the estimates of A and B, respectively. Tale 13.1 Incomes (in hundreds of dollars) and Food Expenditures of Seven Households

3 Scatter Diagram Definition A plot of paired oservations is called a scatter diagram. Figure 13.4 Scatter diagram. Figure 13.5 Scatter diagram and straight lines. Figure 13.6 Regression Line and random errors. Error Sum of Squares (E) The error sum of squares, denoted E, is E = e = ( y yˆ ) The values of a and that give the minimum E are called the least square estimates of A and B, and the regression line otained with these estimates is called the least squares line. The Least Squares Line For the least squares regression line ŷ = a + x, where xy = and a = y x ( x)( y) ( x) x xy = xy and = n and stands for sum of squares. The least squares regression line ŷ = a + x is also called the regression of y on x. n

4 Example 13-1 Find the least squares regression line for the data on incomes and food expenditure on the seven households given in the Tale Use income as an independent variale and food expenditure as a dependent variale. Tale 13. Example 13-1: Solution x = 386 y = 108 x = x/ n = 386 / 7 = y = y / n = 108 / 7 = Example 13-1: Solution ( x)( y) (386)(108) xy = xy = 6403 = n 7 ( ) x (386) = x = 3,058 = n 7 Example 13-1: Solution Figure 13.7 Error of prediction. xy = = = a = y x = (.55)(55.149) = Thus, our estimated regression model is ŷ = x

5 Interpretation of a and Interpretation of a! Consider a household with zero income. Using the estimated regression line otained in Example 13-1, " ŷ = (0) = $ hundred.! Thus, we can state that a household with no income is expected to spend $ per month on food.! The regression line is valid only for the values of x etween 33 and 83. Interpretation of a and Interpretation of! The value of in the regression model gives the change in y (dependent variale) due to a change of one unit in x (independent variale).! We can state that, on average, a $100 (or $1) increase in income of a household will increase the food expenditure y $5.5 (or $.55). Figure 13.8 Positive and negative linear relationships etween x and y. Case Study 13-1 Regression of Weights on Heights for NFL Players Case Study 13-1 Regression of Weights on Heights for NFL Players Assumptions of the Regression Model Assumption 1: The random error term Є has a mean equal to zero for each x Assumption : The errors associated with different oservations are independent Assumption 3: For any given x, the distriution of errors is normal Assumption 4: The distriution of population errors for each x has the same (constant) standard deviation, which is denoted σ Є

6 Figure (a) Errors for households with an income of $4000 per month. Figure () Errors for households with an income of $ 7500 per month. Figure 13.1 Distriution of errors around the population regression line. Figure Nonlinear relations etween x and y. STANDARD DEVIATION OF ERRORS AND COEFFICIENT OF DETERMINATION Degrees of Freedom for a Simple Linear Regression Model The degrees of freedom for a simple linear regression model are df = n Figure Spread of errors for x = 40 and x = 75.

7 STANDARD DEVIATION OF ERRORS AND COEFFICIENT OF DETERMINATION The standard deviation of errors is calculated as where s e = n xy Example 13- Compute the standard deviation of errors s e for the data on monthly incomes and food expenditures of the seven households given in Tale ( y ) = y n Tale 13.3 Example 13-: Solution ( y ) (108) = y = 179 = n 7 xy ( ) se = = n 7 COEFFICIENT OF DETERMINATION Total Sum of Squares (T) The total sum of squares, denoted y T, is calculated as ( ) T = y n y Figure Total errors. Note that this is the same formula that we used to calculate.

8 Tale 13.4 Figure Errors of prediction when regression model is used. COEFFICIENT OF DETERMINATION Regression Sum of Squares (R) The regression sum of squares, denoted y R, is R = T E COEFFICIENT OF DETERMINATION Coefficient of Determination The coefficient of determination, denoted y r, represents the proportion of T that is explained y the use of the regression model. The computational formula for r is r = xy and 0 r 1 Example 13-3 For the data of Tale 13.1 on monthly incomes and food expenditures of seven households, calculate the coefficient of determination. Example 13-3: Solution! From earlier calculations made in Examples 13-1 and 13-,! =.55, = , = xy (.55)( ) r = = =

9 INFERENCES ABOUT B! Sampling Distriution of! Estimation of B! Hypothesis Testing Aout B Sampling Distriution of Mean, Standard Deviation, and Sampling Distriution of Because of the assumption of normally distriuted random errors, the sampling distriution of is normal. The mean and standard deviation of, denoted y µ and, respectively, σ are σ µ = B and σ = Estimation of B Confidence Interval for B The (1 α)100% confidence interval for B is given y ± ts Example 13-4 Construct a 95% confidence interval for B for the data on incomes and food expenditures of seven households given in Tale where s = s e and the value of t is otained from the t distriution tale for α α / area in the right tail of the t distriution and n- degrees of freedom. Example 13-4: Solution s se = = = df = n = 7 = 5 α / = (1.95) / =.05 t =.571 ± ts =.55 ±.571(.0379) =.55 ±.0974 =.155 to.350 Hypothesis Testing Aout B Test Statistic for The value of the test statistic t for is calculated as B t = s The value of B is sustituted from the null hypothesis.

10 Example 13-5 Test at the 1% significance level whether the slope of the regression line for the example on incomes and food expenditures of seven households is positive. Example 13-5: Solution! Step 1: H 0 : B = 0 (The slope is zero) H 1 : B > 0 (The slope is positive)! Step : σ is not known Hence, we will use the t distriution to make the test aout B. Example 13-5: Solution! Step 3: α =.01 Area in the right tail = α =.01 df = n = 7 = 5 The critical value of t is Figure Example 13-5: Solution # Step 4: From H 0 B.55 0 t = = = 6.66 s.0379 Example 13-5: Solution! Step 5: The value of the test statistic t = 6.66 " It is greater than the critical value of t = " It falls in the rejection region Hence, we reject the null hypothesis We conclude that x (income) determines y (food expenditure) positively.

11 LINEAR CORRELATION! Linear Correlation Coefficient! Hypothesis Testing Aout the Linear Correlation Coefficient Linear Correlation Coefficient Value of the Correlation Coefficient The value of the correlation coefficient always lies in the range of 1 to 1; that is, -1 ρ 1 and -1 r 1 Figure Linear correlation etween two variales. (a) Perfect positive linear correlation, r = 1 Figure Linear correlation etween two variales. () Perfect negative linear correlation, r = -1 Copyright 013 John Wiley x & Sons. All rights reserved. Copyright 013 John Wiley x & Sons. All rights reserved. Figure Linear correlation etween two variales. Figure Linear correlation etween variales. (c) No linear correlation,, r 0 Copyright 013 John Wiley x & Sons. All rights reserved.

12 Figure Linear correlation etween variales. Figure Linear correlation etween variales. Figure Linear correlation etween variales. Linear Correlation Coefficient Linear Correlation Coefficient The simple linear correlation coefficient, denoted y r, measures the strength of the linear relationship etween two variales for a sample and is calculated as r = xy Example 13-6 Calculate the correlation coefficient for the example on incomes and food expenditures of seven households. Example 13-6: Solution r = xy = =.95 ( )( )

13 Hypothesis Testing Aout the Linear Correlation Coefficient Test Statistic for r If oth variales are normally distriuted and the null hypothesis is H 0 : ρ = 0, then the value of the test statistic t is calculated as t = r n 1 r Example 13-7 Using the 1% level of significance and the data from Example 13-1, test whether the linear correlation coefficient etween incomes and food expenditures is positive. Assume that the populations of oth variales are normally distriuted. Here n are the degrees of freedom. Example 13-7: Solution! Step 1: H 0 : ρ = 0 (The linear correlation coefficient is zero) H 1 : ρ > 0 (The linear correlation coefficient is positive)! Step : The population distriutions for oth variales are normally distriuted. Hence, we can use the t distriution to perform this test aout the linear correlation coefficient. Example 13-7: Solution! Step 3: Area in the right tail =.01 df = n = 7 = 5 The critical value of t = Figure 13.0 Example 13-7: Solution # Step 4:!=" % #/$ " # =.&'($ ) #/$ (.&'($ ) # =6.667

14 Example 13-7: Solution! Step 5: The value of the test statistic t = " It is greater than the critical value of t=3.365 " It falls in the rejection region Hence, we reject the null hypothesis. REGREION ANALYSIS: A COMPLETE Example 13-8 A random sample of eight drivers selected from a small city insured with a company and having similar minimum required auto insurance policies was selected. The following tale lists their driving experiences (in years) and monthly auto insurance premiums (in dollars). We conclude that there is a positive relationship etween incomes and food expenditures. Example 13-8 Example 13-8 (a) Does the insurance premium depend on the driving experience or does the driving experience depend on the insurance premium? Do you expect a positive or a negative relationship etween these two variales? () Compute,, and xy. (c) Find the least squares regression line y choosing appropriate dependent and independent variales ased on your answer in part a. (d) Interpret the meaning of the values of a and calculated in part c. Example 13-8 (e) Plot the scatter diagram and the regression line. (f) Calculate r and r and explain what they mean. (g) Predict the monthly auto insurance for a driver with 10 years of driving experience. (h) Compute the standard deviation of errors. (i) Construct a 90% confidence interval for B. (j) Test at the 5% significance level whether B is negative. (k) Using α =.05, test whether ρ is different from zero. (a) Based on theory and intuition, we expect the insurance premium to depend on driving experience. " The insurance premium is a dependent variale " The driving experience is an independent variale

15 Tale 13.5 () x = x/ n = 90/8= 11.5 y = y/ n = 474/8= 59.5 xy ( x)( y) (90)(474) = xy = 4739 = n 8 ( x) (90) 1396 n ( y ) (474) 9,64 n = x = = = y = = (c) xy = = = a = y x = 59.5 ( )(11.5) = (d) The value of a = gives the value of ŷ for x = 0; that is, it gives the monthly auto insurance premium for a driver with no driving experience. The value of = indicates that, on average, for every extra year of driving experience, the monthly auto insurance premium decreases y $1.55. ŷ=)-.--./ $./')-) Figure 13.1 Scatter diagram and the regression line. (e) The regression line slopes downward from left to right. (f) xy r = = =.77 ( )( ) r xy ( )( ) = = =

16 (f) The value of r = indicates that the driving experience and the monthly auto insurance premium are negatively related. The (linear) relationship is strong ut not very strong. The value of r² = 0.59 states that 59% of the total variation in insurance premiums is explained y years of driving experience and 41% is not. (g) Using the estimated regression line, we find the predicted value of y for x = 10 is ŷ = (10) = $61.18 Thus, we expect the monthly auto insurance premium of a driver with 10 years of driving experience to e $ (h) s e = n xy ( )( ) = 8 = (i) se s = = = α / =.5 (.90/) =.05 df = n = 8 = 6 t = ± ts = ± 1.943(.570) = ± =.57 to.5 (j)! Step 1: H 0 : B = 0 (B is not negative) H 1 : B < 0 (B is negative)! Step 3: Area in the left tail = α =.05 df = n = 8 = 6 The critical value of t is ! Step : Because the standard deviation of the error is not known, we use the t distriution to make the hypothesis test

17 Figure 13. # Step 4: From H 0 B t = = =.937 s.570! Step 5: The value of the test statistic t = " It falls in the rejection region Hence, we reject the null hypothesis and conclude that B is negative. (k)! Step 1: H 0 : ρ = 0 (The linear correlation coefficient is zero) H 1 : ρ 0 (The linear correlation coefficient is different from zero) The monthly auto insurance premium decreases with an increase in years of driving experience.! Step : Assuming that variales x and y are normally distriuted, we will use the t distriution to perform this test aout the linear correlation coefficient. Figure 13.3! Step 3: Area in each tail =.05/ =.05 df = n = 8 = 6 The critical values of t are and.447

18 # Step 4: *=" % #/$ " # =.)-)& ( #/$ (.))) # = -.936! Step 5: The value of the test statistic t = " It falls in the rejection region Hence, we reject the null hypothesis We conclude that the linear correlation coefficient etween driving experience and auto insurance premium is different from zero. USING THE REGREION MODEL! Using the Regression Model for Estimating the Mean Value of y! Using the Regression Model for Predicting a Particular Value of y Figure 13.4 Population and sample regression lines. Using the Regression Model for Estimating the Mean Value of y Confidence Interval for µ y x The (1 α)100% confidence interval for µ y x for x = x 0 is ˆ y ± t s ym ˆ where the value of t is otained from the t distriution tale for α/ area in the right tail of the t distriution curve and df = n. Using the Regression Model for Estimating the Mean Value of y Confidence Interval for µ y x The value of s y is calculated as follows: s ˆm 1 ( x0 x) = s + n yˆ m e

19 Example 13-9 Refer to Example 13-1 on incomes and food expenditures. Find a 99% confidence interval for the mean food expenditure for all households with a monthly income of $5500. Example 13-9: Solution! Using the regression line estimated in Example 13-1, we find the point estimate of the mean food expenditure for x = 55 " ŷ = (55) = $ hundred! Area in each tail = α/ = (1.99)/ =.005! df = n = 7 = 5! t = 4.03 Example 13-9: Solution s = , x = , and = S e yˆ m 1 ( x0 x) = se + n 1 ( ) = (1.5939) + = Example 13-9: Solution Hence, the 99% confidence interval for µ yˆ ± ts = ± 4.03(.605) yˆ m y 55 = ±.493 = to is Using the Regression Model for Predicting a Particular Value of y Prediction Interval for y p The (1 α)100% prediction interval for the predicted value of y, denoted y y p, for x = x 0 is ˆ y ± t s y ˆ p Using the Regression Model for Predicting a Particular Value of y Prediction Interval for y p where the value of t is otained from the t distriution tale for α/ area in the right tail of the t distriution curve and df = n. The value of s s y ˆ p yˆ p e is calculated as follows: 1 ( x0 x) = s 1+ + n

20 Example Refer to Example 13-1 on incomes and food expenditures. Find a 99% prediction interval for the predicted food expenditure for a randomly selected household with a monthly income of $5500. Example 13-10: Solution! Using the regression line estimated in Example 13-1, we find the point estimate of the predicted food expenditure for x = 55 " ŷ = (55) = $ hundred! Area in each tail = α/ = (1.99)/ =.005! df = n = 7 = 5! t = 4.03 Example 13-10: Solution Example 13-10: Solution s = , x = , and = S e yˆ p 1 ( x0 x) = se 1+ + n 1 ( ) = (1.5939) 1+ + = Hence, the 99% prediction interval for y for x = 55 is yˆ ± t s = ± 4.03(1.7040) ŷ p = ± = 8.50 to.630 p TI-84 TI-84

21 Minita Excel Excel Excel Excel

Regression Analysis: A Complete Example

Regression Analysis: A Complete Example Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

More information

Factors affecting online sales

Factors affecting online sales Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

17. SIMPLE LINEAR REGRESSION II

17. SIMPLE LINEAR REGRESSION II 17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.

More information

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

More information

Non-Linear Regression 2006-2008 Samuel L. Baker

Non-Linear Regression 2006-2008 Samuel L. Baker NON-LINEAR REGRESSION 1 Non-Linear Regression 2006-2008 Samuel L. Baker The linear least squares method that you have een using fits a straight line or a flat plane to a unch of data points. Sometimes

More information

Example: Boats and Manatees

Example: Boats and Manatees Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant

More information

2013 MBA Jump Start Program. Statistics Module Part 3

2013 MBA Jump Start Program. Statistics Module Part 3 2013 MBA Jump Start Program Module 1: Statistics Thomas Gilbert Part 3 Statistics Module Part 3 Hypothesis Testing (Inference) Regressions 2 1 Making an Investment Decision A researcher in your firm just

More information

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96 1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

More information

Regression and Correlation

Regression and Correlation Regression and Correlation Topics Covered: Dependent and independent variables. Scatter diagram. Correlation coefficient. Linear Regression line. by Dr.I.Namestnikova 1 Introduction Regression analysis

More information

Correlation key concepts:

Correlation key concepts: CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

Using R for Linear Regression

Using R for Linear Regression Using R for Linear Regression In the following handout words and symbols in bold are R functions and words and symbols in italics are entries supplied by the user; underlined words and symbols are optional

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

A Primer on Forecasting Business Performance

A Primer on Forecasting Business Performance A Primer on Forecasting Business Performance There are two common approaches to forecasting: qualitative and quantitative. Qualitative forecasting methods are important when historical data is not available.

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

August 2012 EXAMINATIONS Solution Part I

August 2012 EXAMINATIONS Solution Part I August 01 EXAMINATIONS Solution Part I (1) In a random sample of 600 eligible voters, the probability that less than 38% will be in favour of this policy is closest to (B) () In a large random sample,

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4

4. Simple regression. QBUS6840 Predictive Analytics. https://www.otexts.org/fpp/4 4. Simple regression QBUS6840 Predictive Analytics https://www.otexts.org/fpp/4 Outline The simple linear model Least squares estimation Forecasting with regression Non-linear functional forms Regression

More information

QUADRATIC EQUATIONS EXPECTED BACKGROUND KNOWLEDGE

QUADRATIC EQUATIONS EXPECTED BACKGROUND KNOWLEDGE MODULE - 1 Quadratic Equations 6 QUADRATIC EQUATIONS In this lesson, you will study aout quadratic equations. You will learn to identify quadratic equations from a collection of given equations and write

More information

Probability, Mean and Median

Probability, Mean and Median Proaility, Mean and Median In the last section, we considered (proaility) density functions. We went on to discuss their relationship with cumulative distriution functions. The goal of this section is

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression We are often interested in studying the relationship among variables to determine whether they are associated with one another. When we think that changes in a

More information

The Big Picture. Correlation. Scatter Plots. Data

The Big Picture. Correlation. Scatter Plots. Data The Big Picture Correlation Bret Hanlon and Bret Larget Department of Statistics Universit of Wisconsin Madison December 6, We have just completed a length series of lectures on ANOVA where we considered

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1. Page 1 of 11. EduPristine CMA - Part I

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1. Page 1 of 11. EduPristine CMA - Part I Index Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques... 1 EduPristine CMA - Part I Page 1 of 11 Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting

More information

AP Physics 1 and 2 Lab Investigations

AP Physics 1 and 2 Lab Investigations AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression A regression with two or more explanatory variables is called a multiple regression. Rather than modeling the mean response as a straight line, as in simple regression, it is

More information

Indiana State Core Curriculum Standards updated 2009 Algebra I

Indiana State Core Curriculum Standards updated 2009 Algebra I Indiana State Core Curriculum Standards updated 2009 Algebra I Strand Description Boardworks High School Algebra presentations Operations With Real Numbers Linear Equations and A1.1 Students simplify and

More information

Part 2: Analysis of Relationship Between Two Variables

Part 2: Analysis of Relationship Between Two Variables Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable

More information

2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or

2. What is the general linear model to be used to model linear trend? (Write out the model) = + + + or Simple and Multiple Regression Analysis Example: Explore the relationships among Month, Adv.$ and Sales $: 1. Prepare a scatter plot of these data. The scatter plots for Adv.$ versus Sales, and Month versus

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5 Using Your TI-83/84 Calculator: Linear Correlation and Regression Elementary Statistics Dr. Laura Schultz This handout describes how to use your calculator for various linear correlation and regression

More information

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r), Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

More information

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools 2009-2010 Week 1 Week 2 14.0 Students organize and describe distributions of data by using a number of different

More information

WEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y 7.1 1 8.92 X 10.0 0.0 16.0 10.0 1.6

WEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y 7.1 1 8.92 X 10.0 0.0 16.0 10.0 1.6 WEB APPENDIX 8A Calculating Beta Coefficients The CAPM is an ex ante model, which means that all of the variables represent before-thefact, expected values. In particular, the beta coefficient used in

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?

Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure? Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure? Harvey Motulsky hmotulsky@graphpad.com This is the first case in what I expect will be a series of case studies. While I mention

More information

International Statistical Institute, 56th Session, 2007: Phil Everson

International Statistical Institute, 56th Session, 2007: Phil Everson Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: peverso1@swarthmore.edu 1. Introduction

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares Linear Regression Chapter 5 Regression Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). We can then predict the average response for all subjects

More information

1 Simple Linear Regression I Least Squares Estimation

1 Simple Linear Regression I Least Squares Estimation Simple Linear Regression I Least Squares Estimation Textbook Sections: 8. 8.3 Previously, we have worked with a random variable x that comes from a population that is normally distributed with mean µ and

More information

Module 5: Multiple Regression Analysis

Module 5: Multiple Regression Analysis Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College

More information

Interaction between quantitative predictors

Interaction between quantitative predictors Interaction between quantitative predictors In a first-order model like the ones we have discussed, the association between E(y) and a predictor x j does not depend on the value of the other predictors

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

5. Linear Regression

5. Linear Regression 5. Linear Regression Outline.................................................................... 2 Simple linear regression 3 Linear model............................................................. 4

More information

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables. SIMPLE LINEAR CORRELATION Simple linear correlation is a measure of the degree to which two variables vary together, or a measure of the intensity of the association between two variables. Correlation

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS. SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed

More information

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. Polynomial Regression POLYNOMIAL AND MULTIPLE REGRESSION Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model. It is a form of linear regression

More information

The importance of graphing the data: Anscombe s regression examples

The importance of graphing the data: Anscombe s regression examples The importance of graphing the data: Anscombe s regression examples Bruce Weaver Northern Health Research Conference Nipissing University, North Bay May 30-31, 2008 B. Weaver, NHRC 2008 1 The Objective

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

Elements of statistics (MATH0487-1)

Elements of statistics (MATH0487-1) Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -

More information

Hypothesis testing - Steps

Hypothesis testing - Steps Hypothesis testing - Steps Steps to do a two-tailed test of the hypothesis that β 1 0: 1. Set up the hypotheses: H 0 : β 1 = 0 H a : β 1 0. 2. Compute the test statistic: t = b 1 0 Std. error of b 1 =

More information

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA We Can Early Learning Curriculum PreK Grades 8 12 INSIDE ALGEBRA, GRADES 8 12 CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA April 2016 www.voyagersopris.com Mathematical

More information

Simple Linear Regression

Simple Linear Regression STAT 101 Dr. Kari Lock Morgan Simple Linear Regression SECTIONS 9.3 Confidence and prediction intervals (9.3) Conditions for inference (9.1) Want More Stats??? If you have enjoyed learning how to analyze

More information

Elementary Statistics Sample Exam #3

Elementary Statistics Sample Exam #3 Elementary Statistics Sample Exam #3 Instructions. No books or telephones. Only the supplied calculators are allowed. The exam is worth 100 points. 1. A chi square goodness of fit test is considered to

More information

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7 Using Your TI-83/84/89 Calculator: Linear Correlation and Regression Dr. Laura Schultz Statistics I This handout describes how to use your calculator for various linear correlation and regression applications.

More information

Scatter Plot, Correlation, and Regression on the TI-83/84

Scatter Plot, Correlation, and Regression on the TI-83/84 Scatter Plot, Correlation, and Regression on the TI-83/84 Summary: When you have a set of (x,y) data points and want to find the best equation to describe them, you are performing a regression. This page

More information

Worksheet A5: Slope Intercept Form

Worksheet A5: Slope Intercept Form Name Date Worksheet A5: Slope Intercept Form Find the Slope of each line below 1 3 Y - - - - - - - - - - Graph the lines containing the point below, then find their slopes from counting on the graph!.

More information

1. How different is the t distribution from the normal?

1. How different is the t distribution from the normal? Statistics 101 106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M 7.1 and 7.2, ignoring starred parts. Reread M&M 3.2. The effects of estimated variances on normal approximations. t-distributions.

More information

WHAT IS A BETTER PREDICTOR OF ACADEMIC SUCCESS IN AN MBA PROGRAM: WORK EXPERIENCE OR THE GMAT?

WHAT IS A BETTER PREDICTOR OF ACADEMIC SUCCESS IN AN MBA PROGRAM: WORK EXPERIENCE OR THE GMAT? WHAT IS A BETTER PREDICTOR OF ACADEMIC SUCCESS IN AN MBA PROGRAM: WORK EXPERIENCE OR THE GMAT? Michael H. Deis, School of Business, Clayton State University, Morrow, Georgia 3060, (678)466-4541, MichaelDeis@clayton.edu

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

SPSS Guide: Regression Analysis

SPSS Guide: Regression Analysis SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar

More information

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,

More information

TIME SERIES ANALYSIS & FORECASTING

TIME SERIES ANALYSIS & FORECASTING CHAPTER 19 TIME SERIES ANALYSIS & FORECASTING Basic Concepts 1. Time Series Analysis BASIC CONCEPTS AND FORMULA The term Time Series means a set of observations concurring any activity against different

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers Correlation Greg C Elvers What Is Correlation? Correlation is a descriptive statistic that tells you if two variables are related to each other E.g. Is your related to how much you study? When two variables

More information

Getting Correct Results from PROC REG

Getting Correct Results from PROC REG Getting Correct Results from PROC REG Nathaniel Derby, Statis Pro Data Analytics, Seattle, WA ABSTRACT PROC REG, SAS s implementation of linear regression, is often used to fit a line without checking

More information

Least Squares Estimation

Least Squares Estimation Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Section 14 Simple Linear Regression: Introduction to Least Squares Regression Slide 1 Section 14 Simple Linear Regression: Introduction to Least Squares Regression There are several different measures of statistical association used for understanding the quantitative relationship

More information

STAT 350 Practice Final Exam Solution (Spring 2015)

STAT 350 Practice Final Exam Solution (Spring 2015) PART 1: Multiple Choice Questions: 1) A study was conducted to compare five different training programs for improving endurance. Forty subjects were randomly divided into five groups of eight subjects

More information

Final Exam Practice Problem Answers

Final Exam Practice Problem Answers Final Exam Practice Problem Answers The following data set consists of data gathered from 77 popular breakfast cereals. The variables in the data set are as follows: Brand: The brand name of the cereal

More information

Pearson s Correlation

Pearson s Correlation Pearson s Correlation Correlation the degree to which two variables are associated (co-vary). Covariance may be either positive or negative. Its magnitude depends on the units of measurement. Assumes the

More information

10.1 Systems of Linear Equations: Substitution and Elimination

10.1 Systems of Linear Equations: Substitution and Elimination 726 CHAPTER 10 Systems of Equations and Inequalities 10.1 Systems of Linear Equations: Sustitution and Elimination PREPARING FOR THIS SECTION Before getting started, review the following: Linear Equations

More information

$2 4 40 + ( $1) = 40

$2 4 40 + ( $1) = 40 THE EXPECTED VALUE FOR THE SUM OF THE DRAWS In the game of Keno there are 80 balls, numbered 1 through 80. On each play, the casino chooses 20 balls at random without replacement. Suppose you bet on the

More information

Section 3 Part 1. Relationships between two numerical variables

Section 3 Part 1. Relationships between two numerical variables Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.

More information

Online Appendix to Are Risk Preferences Stable Across Contexts? Evidence from Insurance Data

Online Appendix to Are Risk Preferences Stable Across Contexts? Evidence from Insurance Data Online Appendix to Are Risk Preferences Stable Across Contexts? Evidence from Insurance Data By LEVON BARSEGHYAN, JEFFREY PRINCE, AND JOSHUA C. TEITELBAUM I. Empty Test Intervals Here we discuss the conditions

More information

Premaster Statistics Tutorial 4 Full solutions

Premaster Statistics Tutorial 4 Full solutions Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for

More information

Linear Models in STATA and ANOVA

Linear Models in STATA and ANOVA Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples

More information

You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week 1 2 3 4 5 6 7 8 9

You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week 1 2 3 4 5 6 7 8 9 Warm Up: You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week Week 1 2 3 4 5 6 7 8 9 Money Owed 900 800 700 600 500 400 300 200 100

More information

Homework #1 Solutions

Homework #1 Solutions Homework #1 Solutions Problems Section 1.1: 8, 10, 12, 14, 16 Section 1.2: 2, 8, 10, 12, 16, 24, 26 Extra Problems #1 and #2 1.1.8. Find f (5) if f (x) = 10x x 2. Solution: Setting x = 5, f (5) = 10(5)

More information

MTH 140 Statistics Videos

MTH 140 Statistics Videos MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative

More information

Linear Approximations ACADEMIC RESOURCE CENTER

Linear Approximations ACADEMIC RESOURCE CENTER Linear Approximations ACADEMIC RESOURCE CENTER Table of Contents Linear Function Linear Function or Not Real World Uses for Linear Equations Why Do We Use Linear Equations? Estimation with Linear Approximations

More information

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Basic Statistics and Data Analysis for Health Researchers from Foreign Countries Volkert Siersma siersma@sund.ku.dk The Research Unit for General Practice in Copenhagen Dias 1 Content Quantifying association

More information

Statistics 151 Practice Midterm 1 Mike Kowalski

Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and

More information