DIRECTIONS/SUGGESTIONS Textbook instructions for some exercises have been modified, or new parts added.

Similar documents
(More Practice With Trend Forecasts)

WEB APPENDIX. Calculating Beta Coefficients. b Beta Rise Run Y X

Chapter 23. Inferences for Regression

Scatter Plots with Error Bars

Coefficient of Determination

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Chapter 7: Simple linear regression Learning Objectives

NIKE Case Study Solutions

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Simple Linear Regression

EXCEL Tutorial: How to use EXCEL for Graphs and Calculations.

The Dummy s Guide to Data Analysis Using SPSS

Simple linear regression

DIRECTIONS. Exercises (SE) file posted on the Stats website, not the textbook itself. See How To Succeed With Stats Homework on Notebook page 7!

How To Run Statistical Tests in Excel

Regression Analysis: A Complete Example

SPSS Guide: Regression Analysis

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Data analysis and regression in Stata

Simple Regression Theory II 2010 Samuel L. Baker

Directions for using SPSS

2 Sample t-test (unequal sample sizes and unequal variances)

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Formula for linear models. Prediction, extrapolation, significance test against zero slope.

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Copyright 2013 by Laura Schultz. All rights reserved. Page 1 of 7

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

A Review of Cross Sectional Regression for Financial Data You should already know this material from previous study

Statistics 151 Practice Midterm 1 Mike Kowalski

Homework Solutions - Lecture 2

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

ABSORBENCY OF PAPER TOWELS

Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb

3.2 Measures of Spread

Using SPSS, Chapter 2: Descriptive Statistics

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

hp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines

Dealing with Data in Excel 2010

Univariate Regression

Homework 11. Part 1. Name: Score: / null

Exercise 1.12 (Pg )

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

Lin s Concordance Correlation Coefficient

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

Predictor Coef StDev T P Constant X S = R-Sq = 0.0% R-Sq(adj) = 0.

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Scatter Plot, Correlation, and Regression on the TI-83/84

CALCULATIONS & STATISTICS

The importance of graphing the data: Anscombe s regression examples

Analysis of Variance ANOVA

INTRODUCTION TO MULTIPLE CORRELATION

Stats for Strategy Fall 2012 First-Discussion Handout: Stats Using Calculators and MINITAB

Example: Boats and Manatees

Using R for Linear Regression

2013 MBA Jump Start Program. Statistics Module Part 3

Forecasting in STATA: Tools and Tricks

11. Analysis of Case-control Studies Logistic Regression

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Statistics 2014 Scoring Guidelines

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Simple Linear Regression, Scatterplots, and Bivariate Correlation

Final Exam Practice Problem Answers

2. Linear regression with multiple regressors

Using Excel for Statistical Analysis

Simple Predictive Analytics Curtis Seare

Lets suppose we rolled a six-sided die 150 times and recorded the number of times each outcome (1-6) occured. The data is

2. Simple Linear Regression

SPSS Manual for Introductory Applied Statistics: A Variable Approach

When to use Excel. When NOT to use Excel 9/24/2014

Confidence Intervals for Cp

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

A Statistical Analysis of the Prices of. Personalised Number Plates in Britain

What is the difference between simple and compound interest and does it really matter?

Name: Date: Use the following to answer questions 2-3:

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

Module 3: Correlation and Covariance

Stepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection

Business Valuation Review

AP Statistics: Syllabus 1

Chapter 4 and 5 solutions

A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion

AP Physics 1 and 2 Lab Investigations

SPSS Explore procedure

Basic Graphing Functions for the TI-83 and TI-84

The problem with waiting time

Fairfield Public Schools

Moderation. Moderation

Section 1: Simple Linear Regression

UNIT 1: COLLECTING DATA

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.

Regression III: Advanced Methods

USE OF ARIMA TIME SERIES AND REGRESSORS TO FORECAST THE SALE OF ELECTRICITY

Transcription:

Stats for Strategy HOMEWORK 6 (Topic 8, Part 2) (revised Jan. 2017) DIRECTIONS/SUGGESTIONS Textbook instructions for some exercises have been modified, or new parts added. Use 5% significance when the textbook doesn t specify a significance level. Data files are available from the Stats website for some exercises. See How To Succeed With Stats Homework in the STATS GUIDE (Using an accounting system provides the best quiz and exam preparation.) The letters A, B, C, etc. represent distinct business applications. (Often several textbook exercises are grouped together for each such application.) A. Applying Population Regression Lines Exercise 10.3 (p. 532) Exercise 10.4 B. Research and Development Spending Exercise 10.5 (p. 534) Clarified directions for parts (a) (e): * Part (a): Enter data into a MINITAB worksheet, then make a scatterplot (first Graph menu option.) * Part (b): For the moment, ignore the textbook s request to add this line to your scatterplot. Instead enter the data into your calculator and use the calculator s regression function to find the least-squares equation. * Part (c): Look up the formula for regression sample variance s 2 on page 43 in the Notebook. Can you figure out how to do these hand calculations? * Part (d): The textbook is requesting the population regression model (with coefficients written as Greek letters) * Part (e): Don t worry about adding the point to your plot. Instead calculate the prediction error and give a reason why the error is relatively large. (continued next page) 1

Add more parts to Exercise 10.5: (f) Make a Fitted-Line Plot in MINITAB: Stat > Regression > Fitted Line Plot What value does the plot provide for regression standard deviation s? (Include all available decimal places in your answer.) Is the plot s reported value for s the same (except for roundoff error) as you calculated by hand in part (c)? (g) Find the standard deviation s y for Spending. Does regression on Years reduce the standard deviation for predicting Spending? Explain. (h) There s another way besides comparing the standard deviations s and s y to measure how well regression is working. How much of the variation of Spending within the data is explained by the variable Years (i.e., by the pattern over time)? C. T-Bills and Inflation Open the data file Inflation. rate from inflation: Get the full regression output for predicting the T-bill ( Stat > Regression > Regression > Fit Regression Model > OK ) The textbook exercises below sometimes refer to Excel regression output. Just refer to your MINITAB output instead. Exercise 10.6 (p. 539) Exercise 10.7 Tip: Briefly review pages 60 64 in the Notebook, including your answers to Example 2. Exercise 10.8 Exercise 10.29 (p. 548) Tip for part (c): Use Four Steps. Tip for part (d): Also interpret the answer for your client. Exercise 10.42 (p. 554) Go ahead and reproduce the MINITAB output shown at the bottom of page 554: Stat > Regression > Regression > Predict... > (Enter 3.7 for INFLATION) > OK Exercise 10.52 (p. 557) Tip: Use the output from MINITAB 17 instead of the textbook s output from MINITAB 16. 2

D. Earnings for Female Bank Employees Review Topic 8 Part 2 Example 7 on Notebook page 82. Open the data file Bank Wages. Reproduce the MINITAB Fitted Line Plot on page 82. Also reproduce the full regression output and calculate a prediction: 1. First generate the model in MINITAB: Stat > Regression > Regression > Fit Regression Model > OK 2. Then predict using the fitted model: Stat > Regression > Regression > Predict... > (Enter 125 for LOS) > OK (a) The first entry in the database refers to a woman who earns $389/week based on 94 months of service. Find the predicted wages and residual for this person. (b) Suppose you d like to predict weekly wages for four other female bank employees. These women s lengths of service are one month, 100 months, 200 months, and 400 months. For each, calculate the predicted wages. Which of these predictions are supported by the range of the data? Which are considered risky, and why? (c) The intercept β 0 of the population regression line measures the average starting wage of female bank employees (those with 0 months of experience.) Find and interpret a 90% confidence interval for β 0. (d) Notice that MINITAB predicts the weekly wage for a worker with 125 months experience as ŷ = $423.185. Also, the standard error of the fit is SE µ = $15.5530. Use a formula to calculate a 95% confidence interval for the mean wages of all bank workers who have 125 months experience, correct to the nearest cent. Is your calculated answer very close to MINITAB s answer? answers exactly the same? Why aren t the two (e) Using only the MINITAB output you made at the beginning (i.e. without re-running MINITAB), calculate a 90% CI for the mean wages of all bank workers who have 125 months experience. (f) Re-do the Fitted Line Plot with a special option: Stat > Regression > Fitted Line Plot > (Choose response = Wages, predictor = LOS) > Options > (Select Display confidence interval, Display prediction interval) > OK > OK (see questions for part (f) next page) 3

Answer the following questions for part (f): 1. Can you guess which color shows the confidence interval for mean wages for all employees? Which color shows the prediction interval? 2. Use the graph to eyeball a rough 95% estimate of mean weekly wages for all employees with 100 months of service. 3. Use the graph to eyeball a rough 95% estimate for the weekly wage of an individual employee with 200 months of service. E. Stocks and Bonds Exercise 10.34 (p. 549) Ignore textbook directions for parts (c) and (d). Instead answer the following: (c) What fact about the scatterplot explains why the relationship between bond flows and stock flows is not significant? (d) Use the regression to predict net cash flow into bonds in the year 2016 with 95% certainty if the net cash flow into stocks in 2016 is $100 billion. (e) Use the regression to estimate average net cash flow into bonds with 95% certainty for all years in which net cash flow into stocks is $200 billion. F. Computer Memory Exercise 10.39 (p. 550) (First enter these data into a MINITAB worksheet) Change instructions for part (a): Make a Fitted-Line Plot and a Residuals Plot: Stat > Regression > Fitted Line Plot...... > Graphs > Select Residuals versus fits > OK > OK Questions to answer for part (a): 1. What s the sample regression equation for predicting DRAM over time? 2. What does the Residuals Plot imply about using the regression of DRAM by Year? Add part (d): Predict DRAM for the year 2002 with 90% certainty. G. Blood Alcohol Content Exercise 10.38 (p. 550) Add part (c): Find and interpret a 90% confidence interval for the slope β 1. Exercise 10.55 (p. 557) (Notice that the textbook mistakenly refers to a confidence interval instead of a prediction interval for Steve.) 4

H. Predicting Water Quality Exercise 10.19 (p. 546) Ignore the book s directions for this exercise. Follow these directions instead: (a) Make a Fitted Line Plot. Also obtain the full MINITAB regression output in the Session Window. What is the sample regression equation to predict IBI? Exercise 10.20 Ignore the book s directions for this exercise. Follow these directions instead: (a) Make a Fitted Line Plot. Also obtain the full MINITAB regression output in the Session Window. What s the sample regression equation to predict IBI? (b) If you had to choose between the predictors Area and Forest to predict IBI, which of these two predictors would you choose? Explain your choice, based on a 10% significance level. (c) Suppose you wish to predict with 90% certainty the IBI for Stream A, whose watershed covers 50 square km and is 30% forested. Provide the answer from your chosen model. (d) Now suppose that you discover that a data-entry error has been made in Table 10.4 in the textbook! The correct Area corresponding to (IBI = 32) is actually 121, not the number 21 which is currently listed. Find the error in the MINITAB worksheet and make the appropriate data correction. Then redo the Fitted Line Plot and Regression for the predictor Area. What s the revised regression equation? The corrected data point (121, 32) is called an outlier. Can you explain why it has this name, based on the scatterplot? (Outliers can have a strong impact on regression results.) (e) Re-evaluate your earlier choice: Has the data correction changed your choice of best predictor variable? Why or why not? What s your 90% prediction for the IBI of Stream A now? (end of assignment) 5