Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots



Similar documents
Section 3 Part 1. Relationships between two numerical variables

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Statistics. Measurement. Scales of Measurement 7/18/2012

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

Descriptive Statistics

Correlation Coefficient The correlation coefficient is a summary statistic that describes the linear relationship between two numerical variables 2

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

The correlation coefficient

DATA COLLECTION AND ANALYSIS

We are often interested in the relationship between two variables. Do people with more years of full-time education earn higher salaries?

UNIVERSITY OF NAIROBI

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Chapter 7: Simple linear regression Learning Objectives

Correlation key concepts:

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Linear Models in STATA and ANOVA

II. DISTRIBUTIONS distribution normal distribution. standard scores

Using Excel for inferential statistics

Module 3: Correlation and Covariance

Analysing Questionnaires using Minitab (for SPSS queries contact -)

Types of Group Comparison Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Causal-Comparative Research 1

Additional sources Compilation of sources:

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Father s height (inches)

Elementary Statistics


Eight things you need to know about interpreting correlations:

Association Between Variables

Introduction to Quantitative Methods

Introduction to Regression and Data Analysis

WHAT IS A JOURNAL CLUB?

Univariate Regression

Basic Concepts in Research and Data Analysis

The Dummy s Guide to Data Analysis Using SPSS

CALCULATIONS & STATISTICS

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

Simple linear regression

Measurement and Metrics Fundamentals. SE 350 Software Process & Product Quality

Linear Regression. Chapter 5. Prediction via Regression Line Number of new birds and Percent returning. Least Squares

Part 2: Analysis of Relationship Between Two Variables

Statistical tests for SPSS

Shiken: JLT Testing & Evlution SIG Newsletter. 5 (3) October 2001 (pp )

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Statistics, Research, & SPSS: The Basics

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Correlational Research

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

The Basics of Regression Analysis. for TIPPS. Lehana Thabane. What does correlation measure? Correlation is a measure of strength, not causation!

Chapter 1: The Nature of Probability and Statistics

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

MULTIPLE REGRESSION WITH CATEGORICAL DATA

CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA

Simple Predictive Analytics Curtis Seare

RESEARCH METHODS IN I/O PSYCHOLOGY

SPSS Explore procedure

TABLE OF CONTENTS. About Chi Squares What is a CHI SQUARE? Chi Squares Hypothesis Testing with Chi Squares... 2

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

Simple Linear Regression, Scatterplots, and Bivariate Correlation

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Chapter 7 Factor Analysis SPSS

UNDERSTANDING THE DEPENDENT-SAMPLES t TEST

You buy a TV for $1000 and pay it off with $100 every week. The table below shows the amount of money you sll owe every week. Week

Nursing Journal Toolkit: Critiquing a Quantitative Research Article

Introduction to Statistics Used in Nursing Research

Regression Analysis: A Complete Example

Econometrics Simple Linear Regression

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Data Mining Part 5. Prediction

Correlation. What Is Correlation? Perfect Correlation. Perfect Correlation. Greg C Elvers

Homework 11. Part 1. Name: Score: / null

Pearson s Correlation

Statistics for Sports Medicine

Introduction to Statistics and Quantitative Research Methods

When to Use a Particular Statistical Test

The Correlation Coefficient

The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Module 5: Statistical Analysis

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SOCIOLOGY 7702 FALL, 2014 INTRODUCTION TO STATISTICS AND DATA ANALYSIS

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared.

Research Methods & Experimental Design

Moderator and Mediator Analysis

On the Practice of Dichotomization of Quantitative Variables

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

Simple Regression Theory II 2010 Samuel L. Baker

Survey Data Analysis. Qatar University. Dr. Kenneth M.Coleman - University of Michigan

Study Guide for the Final Exam

Sample Size and Power in Clinical Trials

Data Analysis, Research Study Design and the IRB

Lean Six Sigma Analyze Phase Introduction. TECH QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

Transcription:

Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship exists between two or more variables within a population (or a sample). The degree of relationships are expressed by correlation coefficients. Coefficients range from +1.00 to -1.00 Higher correlations (coefficients closer to +1.00 or -1.00) indicate stronger relationships. Positive correlations indicate that as the values associated with one variable go up, so do the values associated with the other. e.g., higher grades are associated with higher???. Negative correlations indicate that as the values associated with one variable go up, the values associated with the other go down e.g., higher grades are associated with lower???. 2 Correlational Research: Scatter Plots http://www.mste.uiuc.edu/courses/ci330ms/youtsey/scatterinfo.html 3 Descriptive Research 1

Correlational Research: Scatter Plots http://www.mste.uiuc.edu/courses/ci330ms/youtsey/scatterinfo.html 4 Portfolio Activity # 6: Mini-proposal 2 Briefly describe a correlational research project relevant to one of your identified research topics. Small group discussion 5 A researcher found that there was a +0.85 correlation between the variable of height and Mental Age among a random sample of 100 individuals. From these data the researcher determines that taller people are smarter than shorter people. What do you think? Interpret this finding. 6 Descriptive Research 2

Mental Age Chronological Age Height 7 % of Homes with Firearms Isolation Suicide Rate 8 Even a perfect correlation does not necessarily imply a causal connection between variables. For example, in a recent CDE study, the number of support staff in school districts was positively correlated with poor attendance. An educational research example: Attention span is highly correlated with reading comprehension test scores. But both are also correlated with basic reading skill. The correlation may be the result of a mutual association with these other variables. 9 Descriptive Research 3

Attention Deficits Basic Reading Skill Reading Comprehension 10 A statistically significant relationship (correlation) is a necessary, but not sufficient condition when determining causation. Must be able to document that the causal variable occurred first and that all other factors are accounted for. Experiments are typically necessary to determine causation. 11 Types of Correlational Studies Descriptive Used to simply describe relationships. Often a precursor to the experimental study. Variables suggested to be related would be the subject of further study. Also helps to identify variables that need to be controlled during an experiment. e.g., basic reading skill in a study of the effects of ADHD on reading comprehension. Hypotheses, if offered, are often nondirectional. Predictive Hypotheses are directional 12 Descriptive Research 4

1. Problem selection Variables to be correlated should be selected based on a) A logical relationship b) Theoretical grounds c) Personal experience What are some examples of problems (or questions) that are consistent with these three bases for correlational research? Correlational treasure hunts (AKA the shotgun approach ) are strongly discouraged. What does r =.50, p =.05 mean? r = strength of the relationship (actually it is r 2 or 25% of variance) p = significance of the relationship (how unlikely a given r value will occur given NO relationship in the population, 5% chance of an r of.50 if there is no real relationship between variable in the population) 13 2. Select/Obtain Participants Sample from the population so as to maximize generalizability. What are examples of preferred sampling techniques? At least 30 participants. If you are dealing with a sample and the correlation between two variables is r =.05, p =.50, what would you say about the relationship? If you are dealing with a population and the correlation between two variables is r =.05, what would you say about the relationship? 14 3. Select Measures How to quantify the variables under study. How might your quantify ADHD, Reading Achievement, phonological processing? If the measures lack reliability a larger sample will be required. 4. Specify Procedures How is data assessing the variables are obtained and correlated? 15 Descriptive Research 5

5. Conduct Data Analysis Statistically significant correlations p values (p. 582 of the text). Whether the obtained coefficient is really different from zero or- the probability that the correlation represents a true relationship or a chance occurrence. In larger samples, lower correlations are required to reach statistical significance. Why? Tests of significance are not required if the entire population has been assessed. Why? What do the levels of significance (e.g., p =.1, p =.05, p =.01, p =.001) mean? Is a significant relationship necessarily an important relationship? Compare these two results: (1) ADHD correlates with Reading (r =.75, p =.05); (2) ADHD correlates with Reading (r =.25, p =.001) 16 5. Conduct Data Analysis (continued) Determining statistically significant correlations http://www.danielsoper.com/statcalc3/calc.aspx?id=44 If you are using a Table, the number of degrees of freedom is two less than the number of pairs. 17 5. Conduct Data Analysis (continued) Correlation s significance vs. its strength. Just because a correlation is significant does not mean it is high enough to reflect an important relationship. Variance (the correlation coefficient squared) When two or more variables are correlated, each variable will have a range of scores. Each variable will have some variance; that is not everyone will get the same score. Common or shared variance indicates the extent to which variables vary in a systematic way (pp. 314-315). r 2 is the amount of variance explained (or accounted for) by the correlation coefficient. Determine the amount of variance accounted for by the following r values: 1.0,.95,.75,.50,.25 http://www.calculator.org/jcalc98.html 18 Descriptive Research 6

Relationship Studies Often used to study complex variables before beginning an experiment. To identify variables (other than the independent variable) that correlate with the dependent measure. When relationships are identified these variables are then controlled for. For example, before studying how a given IV (like ADHD symptom severity) influences reading comprehension you would want to identify other variables (such as word reading, word attack, vocabulary, background knowledge) that also affect reading comprehension and then control for them. How would this be done? 19 Relationship Studies Why is it important to be selective when identifying variables to be correlated? What problems might arise if you used a shotgun approach and obtained correlations among 100 randomly selected variables and you used a p value of.05? Chances are that 5 of the obtained coefficients will not reflect a true relationship greater than zero. 20 Prediction Studies Regression Analysis A method of analyzing the variability of a criterion variable by examining information available on one or more predictor variables. When only one predictor variable is used, the analysis is referred to as simple regression. When more than one predictor is used, the analysis is referred to as multiple regression 21 Descriptive Research 7

Simple Regression A college football coach wishes to use the scores on one variable to predict the scores on another variable. He wishes to determine the best prediction equation for the grade-point averages of potential freshmen recruits. SAT test scores for the current group of recruits as well as their grade point averages are available. From the available SAT and GPA scores for this year s class, the prediction equation for next year s class can be calculated. 22 Simple Regression A teacher wishes to determine the effects of hours of study (the predictor variable) on vocabulary test performance (the criterion variable). When vocabulary test means associated with different amounts of study differ from each other and lie on a straight line, it is said that there is a simple linear regression of vocabulary test performance on hours of study 23 Hours Score A Score B 1 3 1 1 5 2 1 6 3 1 9 0 2 4 3 2 6 4 2 7 3 2 10 3 3 4 4 3 6 5 3 8 6 3 10 4 4 5 5 4 7 6 4 9 7 4 12 5 5 6 7 5 7 8 5 10 9 5 12 6 Hours of study & vocabulary tests Excel Data Sheet pp. 609-610 of text for SPSS screen shots 24 Descriptive Research 8

Multiple Regression A GATE program administrator wishes to determine the best prediction equation for rapid learning among a group of ELL elementary students. The available predictor variables are: 1. SAT-9 scores (X 1 ) 2. Changes in scores on the English LAS (X 2 ) 3. Scores on a non-verbal reasoning test (X 3 ) 4. Primary language vocabulary test scores (X 4 ) Y The criterion variable (the one the administrator wishes to predict) are achievement test gains made between 2nd to 3rd grades (Y observed ). From the regression of the predictor variables (X 1, 2, 3, and 4 ) on the criterion variable (Y observed ), a prediction equation can be developed. 25 Types of Correlation Coefficients Variable 1 Continuous Rank Dichotomous Pearson r Continuous Correlation ratio Variable 2 Spearmans rho Rank Kendall s tau Biserial Tetrachoric Dichotomous Point Biserial Phi coefficient Scales of Measurement: Ratio or Interval (Continuous, e.g., scores on a test); Ordinal (Rank, e.g., class rankings); and Nominal or Categorical (Dichotomous, e.g., gender) Notes: Correlation ratio is used for nonlinear relationships (i.e., curvilinear correlations) Biserial and Tetrachoric are used when the dichotomy is artificial Point Biserial and Phi coefficient are used when the dichotomy is genuine or true. 26 Types of Statistical Correlations Partial Correlations If you have three variables and you wish to know how highly two of them are related when the mutual relationships with the third variable are taken out ( partialed out ), use partial correlation. Height r =.835 (A spurious correlation?) Age M.A. After partialing out Age this correlation dropped to.219. This is an example of statistical control. What might be another variable to partial out? What might be another way to control for the Age? 27 Descriptive Research 9

Types of Statistical Correlations Multiple Correlations If you have three variables and you wish to know how highly two of them, taken together, are related to the third, use multiple correlation. Reading Speed + Reading Achievement Phonological awareness In a typical multiple-correlation study, the first set of numbers represents measures of a criterion variable (e.g., reading achievement) and the other two sets of numbers are measures of predictors (e.g., reading speed and sound awareness). The multiple-correlation coefficient between the criterion variable and the two predictor variables will give an indication of the degree to which the two predictors, taken together, actually predict the criterion Questions: What is the difference between prediction and 28 causation? Next Meeting: After Spring Break Causal Comparative and Single Subject Research Read Educational Research Chapters 9 & 11. Portfolio Element #7 Due: Mini-proposal 3 29 Portfolio Activity # 7 Mini-proposal 3 Students will briefly describe a causal comparative research project relevant to one of their identified research topics. Chapter 9 provides guidance necessary to complete this mini-proposal. 30 Descriptive Research 10