PEARSON R CORRELATION COEFFICIENT



Similar documents
Example: Boats and Manatees

Linear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}

2.5 Zeros of a Polynomial Functions

Procedure for Graphing Polynomial Functions

Introduction Assignment

Summary of important mathematical operations and formulas (from first tutorial):

Using Excel for inferential statistics

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Statistical skills example sheet: Spearman s Rank

Correlation key concepts:

Calculator Notes for the TI-Nspire and TI-Nspire CAS

Calibration and Linear Regression Analysis: A Self-Guided Tutorial

MSLC Workshop Series Math Workshop: Polynomial & Rational Functions

AP Physics 1 and 2 Lab Investigations

FREE FALL. Introduction. Reference Young and Freedman, University Physics, 12 th Edition: Chapter 2, section 2.5

Zero: If P is a polynomial and if c is a number such that P (c) = 0 then c is a zero of P.

One-Way ANOVA using SPSS SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate

Example 1. Rise 4. Run Our Solution

Straightening Data in a Scatterplot Selecting a Good Re-Expression Model

Question 1a of 14 ( 2 Identifying the roots of a polynomial and their importance )

Chapter 7: Simple linear regression Learning Objectives

The correlation coefficient

Plot the following two points on a graph and draw the line that passes through those two points. Find the rise, run and slope of that line.

POLYNOMIAL AND MULTIPLE REGRESSION. Polynomial regression used to fit nonlinear (e.g. curvilinear) data into a least squares linear regression model.

Normality Testing in Excel

Univariate Regression

TI-83/84 Plus Graphing Calculator Worksheet #2

Calculator Notes for the TI-Nspire and TI-Nspire CAS

Graphing Parabolas With Microsoft Excel

Indiana State Core Curriculum Standards updated 2009 Algebra I

POLYNOMIAL FUNCTIONS

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

Module 3: Correlation and Covariance

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Wentzville School District Algebra 1: Unit 8 Stage 1 Desired Results

The Dummy s Guide to Data Analysis Using SPSS

Basic Graphing Functions for the TI-83 and TI-84

containing Kendall correlations; and the OUTH = option will create a data set containing Hoeffding statistics.

What are the place values to the left of the decimal point and their associated powers of ten?

Graphing Rational Functions

Directions for using SPSS

Bar Graphs and Dot Plots

Elements of a graph. Click on the links below to jump directly to the relevant section

Notes for EER #4 Graph transformations (vertical & horizontal shifts, vertical stretching & compression, and reflections) of basic functions.

Module 5: Statistical Analysis

Biggar High School Mathematics Department. National 5 Learning Intentions & Success Criteria: Assessing My Progress

Factoring Polynomials and Solving Quadratic Equations

CORRELATION ANALYSIS

CALCULATIONS & STATISTICS

Algebra I Vocabulary Cards

Coordinate Plane, Slope, and Lines Long-Term Memory Review Review 1

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

3.2. Solving quadratic equations. Introduction. Prerequisites. Learning Outcomes. Learning Style

SPSS Guide: Regression Analysis

Brunswick High School has reinstated a summer math curriculum for students Algebra 1, Geometry, and Algebra 2 for the school year.

2.5 Transformations of Functions

SIMPLE LINEAR CORRELATION. r can range from -1 to 1, and is independent of units of measurement. Correlation can be done on two dependent variables.

Section 1.5 Linear Models

Using Excel (Microsoft Office 2007 Version) for Graphical Analysis of Data


CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Calculate Highest Common Factors(HCFs) & Least Common Multiples(LCMs) NA1

FACTORING QUADRATICS and 8.1.2

Definition 8.1 Two inequalities are equivalent if they have the same solution set. Add or Subtract the same value on both sides of the inequality.

Hypothesis testing - Steps

This is a square root. The number under the radical is 9. (An asterisk * means multiply.)

What does the number m in y = mx + b measure? To find out, suppose (x 1, y 1 ) and (x 2, y 2 ) are two points on the graph of y = mx + b.

Exponents, Radicals, and Scientific Notation

Academic Support Center. Using the TI-83/84+ Graphing Calculator PART II

Section 5.0A Factoring Part 1

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Absorbance Spectrophotometry: Analysis of FD&C Red Food Dye #40 Calibration Curve Procedure

Regression III: Advanced Methods

Alum Rock Elementary Union School District Algebra I Study Guide for Benchmark III

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Testing Research and Statistical Hypotheses

M Polynomial Functions 1

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Solving Quadratic Equations

Mathematics Georgia Performance Standards

Section 1.1 Linear Equations: Slope and Equations of Lines

Method To Solve Linear, Polynomial, or Absolute Value Inequalities:

Lesson 9.1 Solving Quadratic Equations

Scientific Method. 2. Design Study. 1. Ask Question. Questionnaire. Descriptive Research Study. 6: Share Findings. 1: Ask Question.

1.3 Algebraic Expressions

Association Between Variables

Linear Equations. 5- Day Lesson Plan Unit: Linear Equations Grade Level: Grade 9 Time Span: 50 minute class periods By: Richard Weber

Chapter 7 - Roots, Radicals, and Complex Numbers

Reliability Overview

In this section, you will develop a method to change a quadratic equation written as a sum into its product form (also called its factored form).

Excel Tutorial. Bio 150B Excel Tutorial 1

0 Introduction to Data Analysis Using an Excel Spreadsheet

2. Creating Bar Graphs with Excel 2007

Updates to Graphing with Excel

ALGEBRA 2: 4.1 Graph Quadratic Functions in Standard Form

Polynomial and Rational Functions

II. DISTRIBUTIONS distribution normal distribution. standard scores

10.1. Solving Quadratic Equations. Investigation: Rocket Science CONDENSED

2.3. Finding polynomial functions. An Introduction:

Comparing Nested Models

Transcription:

PEARSON R CORRELATION COEFFICIENT Introduction: Sometimes in scientific data, it appears that two variables are connected in such a way that when one variable changes, the other variable changes also. This connection is called a correlation. Examples of this type of correlation include: (1) in deer populations, large males seem to have more successful matings; and (2) larger numbers of birds seem to nest in areas with dense vegetation. Scientists measure the strength of a relationship between two variables by calculating a correlation coefficient. The value of the correlation coefficient indicates to what extent the change found in one variable relates to change in another. There are several types of correlation coefficients, but the one that is most widely used is called the Pearson Product-Moment Correlation Coefficient, or simply, the Pearson r. Student Procedure Example: Your students have done some classroom research on amphibian species found in your area and have discovered that the red-backed salamander uses fallen logs and debris on the forest floor for their home. During their earlier census of their Biodiversity Plot, they have noticed that some quadrats have many fallen logs whereas other quadrats have few or none. They expect that they would find more red-backed salamanders in those quadrats with many fallen logs and design an experiment to test this hypothesis. This experiment measures: (1) the number of fallen logs in each quadrat; and (2) the number of red-backed salamanders in each quadrat. This is a table of the data your class has collected: SA 3.1

Quadrat Num ber # Fallen Logs # Salam anders 1 4 3 2 6 2 3 2 0 4 3 1 5 7 3 6 1 0 7 0 0 8 2 0 9 0 0 10 3 1 11 2 1 12 4 2 13 0 0 14 2 1 15 2 0 16 1 0 17 3 1 18 5 3 19 3 1 20 2 2 21 2 1 22 0 0 23 0 0 24 2 0 25 1 0 Step 1: Graphing a the data Graph your data by computer, or by hand, by assigning the number of fallen logs (the second column) as the X-axis and the number of salamanders (the last column) as the Y-axis. For example, in Quadrat 1, the X-value would be 4 and the Y-value would be 3. The results, when we plot all 25 points on the graph, look like this: RELATIONSHIP OF FALLEN LOGS AND SALAMANDERS 3.5 3 Number of Salamanders 2.5 2 1.5 1 0.5 SA 3.2 0 0 1 2 3 4 5 6 7 8 Number of Fallen Logs

Looking at this graph, there seems to be a positive relationship between the number of fallen logs and the number of salamanders. In other words, it appears that when the number of fallen logs increases, the number of Red-Backed Salamanders also increases. Some things to remember about the Pearson r correlation: The lowest value that the Pearson r can have is r = 0.00. This means there is ZERO correlation, and would indicate that X and Y are not related to one another. The highest value that the Pearson r can have is r = 1.00. This indicates a PERFECT correlation and would indicate that X and Y are completely related to one another in the sample. Pearson r values can be either positive or negative. A positive value indicates that increases in X correspond to increases in Y. A negative value indicates that increases in one variable are associated with decreases in the other variable. The following graphs illustrate some of the various types of correlations possible: a. This is an example of a perfect, positive correlation, in that the data shows no deviation from a straight line. b. This is an example of a perfect, negative correlation, in that the data shows no deviation from a straight line. c. This is an example of a high, positive correlation. Since the data shows some variability, a perfect prediction cannot be made. d. This is an example of a high, positive correlation. Since the data shows some variability, a perfect prediction cannot be made. e. This shows a low correlation. Although predictions could be made, and those predictions would be slightly better than chance, estimates would still be imprecise. f. This figure shows a zero correlation. Prediction would be no better than chance. SA 3.3

Step 3: Calculating the Pearson r Correlation Coefficient The graph below was produced by Microsoft Excel (charting function) which calculated a correlation coefficient from the data in our example. The graph shows a trend indicating an increase in salamanders where there are more fallen logs present. Note, however, that the value calculated by this program is the Pearson r value squared. You must take the square root of this figure to give the Pearson r value. From the graph: R 2 = 0.72; 0.85. Because 0.85 is close to 1.0 (the maximum value for the Pearson r), this demonstrates a strong, positive correlation. RELATIONSHIP OF FALLEN LOGS AND SALAMANDERS 3.5 3 R 2 = 0.7175 0.85 Number of Salamanders 2.5 2 1.5 1 0.5 0-0.5 0 1 2 3 4 5 6 7 8 Number of Fallen Logs If not using the Excel Software, or other graphing program, you can calculate the Pearson r by using the following formula: FORMULA FOR CALCULATING THE PEARSON R CORRELATION COEFFICIENT N( XY) ( X) ( Y) N( X 2 ) ( X) 2 ( ) Y [ ][ N ( )] 2 Y2 SA 3.4

This formula looks complicated, but can be simplified by breaking it into its separate components. Using your original data, create the following table: Quadrat Number (N) # Fallen Logs (X) # Salamanders (Y) X 2 Y 2 XY 1 4 3 16 9 12 2 6 2 36 4 12 3 2 0 4 0 0 4 3 1 9 1 3 5 7 3 49 9 21 6 1 0 1 0 0 7 0 0 0 0 0 8 2 0 4 0 0 9 0 0 0 0 0 10 3 1 9 1 3 11 2 1 4 1 2 12 4 2 16 4 8 13 0 0 0 0 0 14 2 1 4 1 2 15 2 0 4 0 0 16 1 0 1 0 0 17 3 1 9 1 3 18 5 3 25 9 15 19 3 1 9 1 3 20 2 2 4 4 4 21 2 1 4 1 2 22 0 0 0 0 0 23 0 0 0 0 0 24 2 0 4 0 0 25 1 0 1 0 0 ΣN = 25 ΣX = 25 ΣY = 25 ΣX 2 = 25 ΣY 2 = 25 ΣXY = 25 Using the values from the new table, complete the Pearson r formula: N( XY) ( X) ( Y) N( X 2 ) ( X) 2 ( ) Y [ ][ N ( )] 2 Y2 SA 3.5

The numerator, or top of the formula, looks like this once we plug in all the numbers: ( 25) ( 90) ( 57) ( 22) [ 25( 213) ( 57) 2 ] 25( 46) 22 [ ( ) 2 ] 2250 1254 [ 5325 3249]1150 484 [ ] 996 [ 2076]666 996 1382616 996 1175.85 0.8471 [ ] Again, this Pearson r correlation coefficient (being extremely close to the maximum value 1.0) demonstrates a strong positive correlation between the number of fallen logs and the number of salamanders. SA 3.6

Step 4: Determine if your calculations have statistical significance Statistical Analysis - 3 You must determine whether or not your calculations have statistical significance. To do this you must determine the critical value for your Pearson r correllation coefficient by using the following table: So for our example: 1. Calculate the degrees of freedom (DF) by subtracting the 2 from the number of comparisons you are making (DF = N - 2) In our case, we are sampling fallen logs and salamanders in 25 quadrats (N = 25) DF = 25-2 = 23 2. Find your DF on the table below and find the critical value allowed. In our case, the nearest DF listed is 25 with a critical value of 0.3233. Our calculated Pearson r correlation coefficient is 0.8471. 3. The calculated figure is greater than the critical value from the table; our findings have statistical significance. Therefore, we can assume that our hypothesis is true and that there is a strong positive correlation betwen the number of fallen logs and the number of salamanders and this correlation is not due to chance. SA 3.7

CRITICAL VALUES FOR THE PEARSON R CORRELLATION COEFFICIENT DF Critical (N - 2) Value (5% certainty) 1.98769 2.90000 3.8054 4.7293 5.6694 6.6215 7.5822 8.5494 9.5214 10.4973 11.4762 12.4575 13.4409 14.4259 15.4124 16.4000 17.3887 18.3783 19.3687 20.3598 25.3233 30.2960 35.2746 40.2573 45.2428 50.2306 60.2108 70.1954 80.1829 90.1726 100.1638 SA 3.8