Example: Boats and Manatees



Similar documents
Univariate Regression

Section 1.5 Linear Models

Chapter 23. Inferences for Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Correlation key concepts:

Part 2: Analysis of Relationship Between Two Variables

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Slope-Intercept Equation. Example

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Simple linear regression

TIME SERIES ANALYSIS & FORECASTING

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

2. Simple Linear Regression

The importance of graphing the data: Anscombe s regression examples

ch12 practice test SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.

PEARSON R CORRELATION COEFFICIENT

Final Exam Practice Problem Answers

Homework 8 Solutions

MTH 140 Statistics Videos

Elements of statistics (MATH0487-1)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Simple Regression Theory II 2010 Samuel L. Baker

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

SPSS Guide: Regression Analysis

Fairfield Public Schools

table to see that the probability is (b) What is the probability that x is between 16 and 60? The z-scores for 16 and 60 are: = 1.

Regression Analysis: A Complete Example

Simple Linear Regression Inference

Regression and Correlation

Regression III: Advanced Methods

Section 3 Part 1. Relationships between two numerical variables

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Econometrics Simple Linear Regression

Hypothesis testing - Steps

Elementary Statistics Sample Exam #3

Stat 412/512 CASE INFLUENCE STATISTICS. Charlotte Wickham. stat512.cwick.co.nz. Feb

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Linear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}

Coordinate Plane, Slope, and Lines Long-Term Memory Review Review 1

Answer Key Building Polynomial Functions

5. Linear Regression

Chapter 7: Simple linear regression Learning Objectives

Section 1: Simple Linear Regression

Correlation and Regression

Exercise 1.12 (Pg )

Factors affecting online sales

5. Multiple regression

DATA INTERPRETATION AND STATISTICS

Premaster Statistics Tutorial 4 Full solutions

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Study Guide for the Final Exam

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

Scatter Plot, Correlation, and Regression on the TI-83/84

PRIMARY CONTENT MODULE Algebra I -Linear Equations & Inequalities T-71. Applications. F = mc + b.

Basic Statistics and Data Analysis for Health Researchers from Foreign Countries

hp calculators HP 50g Trend Lines The STAT menu Trend Lines Practice predicting the future using trend lines

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Vocabulary Words and Definitions for Algebra

STT 200 LECTURE 1, SECTION 2,4 RECITATION 7 (10/16/2012)

STATISTICS AND STANDARD DEVIATION

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

1. The parameters to be estimated in the simple linear regression model Y=α+βx+ε ε~n(0,σ) are: a) α, β, σ b) α, β, ε c) a, b, s d) ε, 0, σ

ELEMENTARY STATISTICS

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Copyright 2007 by Laura Schultz. All rights reserved. Page 1 of 5

Foundations for Functions

2. What is the general linear model to be used to model linear trend? (Write out the model) = or

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Algebraic expressions are a combination of numbers and variables. Here are examples of some basic algebraic expressions.

Indiana State Core Curriculum Standards updated 2009 Algebra I

Pearson's Correlation Tests

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Introduction to Regression and Data Analysis

Data Mining Part 5. Prediction

Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Lets suppose we rolled a six-sided die 150 times and recorded the number of times each outcome (1-6) occured. The data is

August 2012 EXAMINATIONS Solution Part I

Session 7 Bivariate Data and Analysis

Point Biserial Correlation Tests

MSLC Workshop Series Math Workshop: Polynomial & Rational Functions

A Primer on Forecasting Business Performance

Multiple Linear Regression

Section A. Index. Section A. Planning, Budgeting and Forecasting Section A.2 Forecasting techniques Page 1 of 11. EduPristine CMA - Part I

Geostatistics Exploratory Analysis

Example G Cost of construction of nuclear power plants

Correlation and Simple Linear Regression

Module 5: Statistical Analysis

Additional sources Compilation of sources:

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis

Transcription:

Figure 9-6 Example: Boats and Manatees Slide 1 Given the sample data in Table 9-1, find the value of the linear correlation coefficient r, then refer to Table A-6 to determine whether there is a significant linear correlation between the number of registered boats and the number of manatees killed by boats. Use method 2. Using the same procedure previously illustrated, we find that r = 0.922. Method 2: Referring to Table A-6, we conclude that there is a significant linear correlation between number of registered boats and number of manatee deaths from boats.

Use method 1 Slide 2 t = r 1 r 2 n 2 0.922 t = = 6.735 1 0.922 2 10 2

Using either of the two methods, we find Slide 3 Method 1: 6.735 > 2.306. Method 2: 0.922 > 0.632; That is, the test statistic falls in the critical region. Conclusion: We therefore reject the null hypothesis. There is sufficient evidence at significance level 0.05 to support the claim of a linear correlation between the number of registered boats and the number of manatee deaths from boats.

FIGURE 9-4 Testing for a Linear Correlation Slide 4

Interpreting r: Explained Variation Slide 5 The value of r 2 is the proportion of the variation in y that is explained by the linear relationship between x and y. Manatee example: With r = 0.922, we get r 2 = 0.850. We conclude that 0.850 (or about 85%) of the variation in manatee deaths can be explained by the linear relationship between the number of boat registrations and the number of manatee deaths from boats. This implies that 15% of the variation of manatee deaths cannot be explained by the number of boat registrations.

9.3 Regression Slide 6 Regression Equation Variables: X =independent variable, predictor variable or explanatory variable Y= dependent variable or response variable. A straight line (linear relationship): Y = a X + b, where a is the y-intercept and b is the slope.

For each x-value, Assumptions Slide 7 Normality: Y is a random variable having a normal (bell-shaped) distribution. Homogeneity: All of these y distributions have the same variance. Linearity: The distribution of y-values has a mean that lies on the regression line. (Results are not seriously affected if departures from normal distributions and equal variances are not too extreme.)

Regression Equation Slide 8 Given paired sample data (x,y) of size n satisfying population parameter equation below. Population Parameter Sample Statistic y-intercept of regression equation β 0 b 0 Slope of regression equation β 1 b 1 Equation of the regression line y = β 0 + β 1 x ^ y = b 0 + b 1 x where b 0 estimates β 0 and b 1 estimates β 1. Regression line Q: How to get b 0 and b 1?

Formula for b 0 and b 1 Slide 9 Formula 9-2 b 1 = n(σxy) (Σx) (Σy) n(σx 2 ) (Σx) 2 (slope) Formula 9-3 b 0 = y b 1 x (y-intercept) calculators or computers can compute these values

Slide 10 The regression line fits the sample points best.

Data x y n = 4 Σx = 10 Σy = 20 Σx 2 = 36 Σy 2 = 120 Σxy = 48 b 1 = b 1 = Calculating the Regression Equation 1 2 8 44 1 8 3 6 5 4 n(σxy) (Σx) (Σy) n(σx 2 ) (Σx) 2 4(48) (10) (20) 4(36) (10) 2 b 1 = = 0.181818 The estimated equation of the regression line is: Slide 11 In Section 9-2, we used these values to find that the linear correlation coefficient of r = 0.135. Use this sample to find the regression equation. b 0 = y b x 1 = 5- (-.181818)(2.5) = 5.45 yˆ = 5.45. 182x

Example: Boats and Manatees Slide 12

Example: Boats and Manatees Slide 13 Given the sample data in Table 9-1, find the regression equation. Using the same procedure as in the previous example, we find that b 1 = 2.27 and b 0 = 113 or computer, the estimated regression equation is: ^ y = 113 + 2.27x

Predictions Slide 14 In predicting a value of y based on some given value of x... 1. If there is not a significant linear correlation, the best predicted y-value is y. 2. If there is a significant linear correlation, the best predicted y-value is found by substituting the x-value into the regression equation.

Slide 15 Figure 9-8 Predicting the Value of a Variable

Revisit Boats and Manatees Example Slide 16 The best regression equation is y = 113 + 2.27x. ^ Assume that in 2001 there were 850,000 registered boats. Because Table 9-1 lists the numbers of registered boats in tens of thousands, this means that for 2001 we have x = 85. Q: Given that x = 85, find the best predicted value of y, the number of manatee deaths from boats. We do have a significant linear correlation (with r = 0.922).

Example: Boats and Manatees Slide 17 y ^ = 113 + 2.27x 113 + 2.27(85) = 80.0 The predicted number of manatee deaths is 80.0. The actual number of manatee deaths in 2001 was 82, so the predicted value of 80.0 is quite close.

Guidelines for Using The Regression Equation Slide 18 1. If there is no significant linear correlation, don t use the regression equation to make predictions. 2. When using the regression equation for predictions, stay within the scope of the available sample data. 3. A regression equation based on old data is not necessarily valid now. 4. Don t make predictions about a population that is different from the population from which the sample data was drawn.

Definitions Slide 19 Marginal Change: The marginal change is the amount that a variable changes when the other variable changes by exactly one unit. Outlier: An outlier is a point lying far away from the other data points. Influential Points: An influential point strongly affects the graph of the regression line.

Residuals and the Slide 20 Residual Least-Squares Property Definitions ^ for a sample of paired (x, y) data, the difference (y - y) between an observed sample y-value and the value of y, ^ which is the value of y that is predicted by using the regression equation. Least-Squares Property A straight line satisfies this property if the sum of the squares of the residuals is the smallest sum possible.

Residuals and the Slide 21 Least-Squares Property x 1 2 4 5 y 4 24 8 32 ^y = 5 + 4x Figure 9-9