When to Use a Particular Statistical Test



Similar documents
Descriptive Statistics

Additional sources Compilation of sources:

Introduction to Statistics and Quantitative Research Methods

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

II. DISTRIBUTIONS distribution normal distribution. standard scores

Introduction to Quantitative Methods

Multivariate Analysis. Overview

Statistics Review PSY379

Statistics. Measurement. Scales of Measurement 7/18/2012

Inferential Statistics. What are they? When would you use them?

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Overview Classes Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Working with data: Data analyses April 8, 2014

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

MASTER COURSE SYLLABUS-PROTOTYPE PSYCHOLOGY 2317 STATISTICAL METHODS FOR THE BEHAVIORAL SCIENCES

Module 3: Correlation and Covariance

11. Analysis of Case-control Studies Logistic Regression

UNIVERSITY OF NAIROBI

Data Analysis: Analyzing Data - Inferential Statistics

Chapter Eight: Quantitative Methods

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

research/scientific includes the following: statistical hypotheses: you have a null and alternative you accept one and reject the other

DATA ANALYSIS. QEM Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. Howard University

Moderation. Moderation

Predicting Successful Completion of the Nursing Program: An Analysis of Prerequisites and Demographic Variables

The Dummy s Guide to Data Analysis Using SPSS

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Elements of statistics (MATH0487-1)

10. Analysis of Longitudinal Studies Repeat-measures analysis

Statistics Graduate Courses

Multiple Regression: What Is It?

January 26, 2009 The Faculty Center for Teaching and Learning

Simple Linear Regression

DISCRIMINANT FUNCTION ANALYSIS (DA)

Multivariate Analysis of Variance (MANOVA)

Survey Data Analysis. Qatar University. Dr. Kenneth M.Coleman - University of Michigan

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Elementary Statistics Sample Exam #3

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

STA-201-TE. 5. Measures of relationship: correlation (5%) Correlation coefficient; Pearson r; correlation and causation; proportion of common variance

UNIVERSITY of MASSACHUSETTS DARTMOUTH Charlton College of Business Decision and Information Sciences Fall 2010

End User Satisfaction With a Food Manufacturing ERP

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Description. Textbook. Grading. Objective

DATA COLLECTION AND ANALYSIS

Basic Concepts in Research and Data Analysis

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Fairfield Public Schools

Introduction to Statistics Used in Nursing Research

How To Understand Multivariate Models

Multivariate Logistic Regression

Data Analysis in SPSS. February 21, If you wish to cite the contents of this document, the APA reference for them would be

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Univariate Regression

Chapter 5 Analysis of variance SPSS Analysis of variance

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Organizing Your Approach to a Data Analysis

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

ANALYSING LIKERT SCALE/TYPE DATA, ORDINAL LOGISTIC REGRESSION EXAMPLE IN R.

Introduction to Longitudinal Data Analysis

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Simple Predictive Analytics Curtis Seare

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

CENTER FOR EQUAL OPPORTUNITY. Preferences at the Service Academies

Reporting Statistics in Psychology

Finding Supporters. Political Predictive Analytics Using Logistic Regression. Multivariate Solutions


Simple Linear Regression Inference

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

How To Find Out How Different Groups Of People Are Different

MULTIPLE REGRESSION WITH CATEGORICAL DATA

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

SPSS Tests for Versions 9 to 13

Association Between Variables

Statistical Analysis RSCH 665 Eagle Vision Classroom - Blended Course Syllabus

How to Get More Value from Your Survey Data

Quantitative Methods for Finance

STATISTICS FOR PSYCHOLOGISTS

SPSS Explore procedure

Directions for using SPSS

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

1 Theory: The General Linear Model

Multinomial Logistic Regression

Introduction to Regression and Data Analysis

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén Table Of Contents

Moderator and Mediator Analysis

Sun Li Centre for Academic Computing

Microsoft Azure Machine learning Algorithms

Recommend Continued CPS Monitoring. 63 (a) 17 (b) 10 (c) (d) 20 (e) 25 (f) 80. Totals/Marginal

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Students' Opinion about Universities: The Faculty of Economics and Political Science (Case Study)

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Transcription:

When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the formula is N+1 2 6 people with ages 21, 22, 24, 23, 19, 21 line them up in order form lowest to highest 19, 21, 21, 22, 23, 24 and take the center value - mode =21.5 Mean the mathematical average the formula is X/N mean age = age of person one + age of person two + age of person three, etc./number of people Variance a measure of how spread out a distribution is it is computed as the average squared deviation of each number from its mean Standard Deviation how much scores deviate from the mean it is the square root of the variance it is the most commonly used measure of spread

Differences of Groups Bi- and Multivariate Inferential Statistical Tests Chi Square compares observed frequencies to expected frequencies Is the distribution of sex and voting behavior due to chance or is there a difference between the sexes on voting behavior? t-test looks at differences between two groups on some variable of interest the IV must have only two groups (male/female, undergrad/grad) Do males and females differ in the amount of hours they spend shopping in a given month? ANOVA tests the significance of group differences between two or more groups the IV has two or more categories only determines that there is a difference between groups, but doesn t tell which is different Do SAT scores differ for low-, middle-, and high-income students? ANCOVA same as ANOVA, but adds control of one or more covariates that may influence the DV Do SAT scores differ for low-, middle-, and high-income students after controlling for single/dual parenting? MANOVA same as ANOVA, but you can study two or more related DVs while controlling for the correlation between the DV if the DVs are not correlated, then separate ANOVAs are appropriate Does ethnicity affect reading achievement, math achievement, and overall scholastic th achievement among 6 graders? MANCOVA same as MANOVA, but adds control of one or more covariates that may influence the DV

Does ethnicity affect reading achievement, math achievement, and overall scholastic th achievement among 6 graders after controlling for social class? Relationships Correlation used with two variables to determine a relationship/association do two variables covary? does not distinguish between independent and dependent variables Amount of damage to a house on fire and number of firefighters at the fire Multiple Regression used with several independent variables and one dependent variable used for prediction it identifies the best set of predictor variables you can enter many IVs and it tells you which are best predictors by looking at all of them at the same time in sequential regression the computer adds the variables one at a time based on the amount of variance they account for IVs drug use, alcohol use, child abuse DV. suicidal tendencies Path Analysis looks at direct and indirect effects of predictor variables used for relationships/causality Child abuse causes drug use which leads to suicidal tendencies. Group Membership Logistic Regression like multiple regression, but the DV is a dichotomous variable logistic regression estimates the odds probability of the DV occurring as the values of the IVs change What are the odds of a suicide occurring at various levels of alcohol use?

Statistical Analyses Independent Variables Dependent Variables Control Variables Question Answered by the Statistic # of # of Data Type IVs DVs Type of Data Chi square 1 categorical 1 categorical 0 Do differences exist between groups? t-test 1 dichotomous 1 0 Do differences exist between 2 groups on one DV? ANOVA 1 + categorical 1 0 Do differences exist between 2 or more groups on one DV? ANCOVA 1 + categorical 1 1 + Do differences exist between 2 or more groups after controlling for CVs on one DV? MANOVA 1 + categorical 2 + 0 Do differences exist between 2 or more groups on multiple DVs? MANCOVA 1 + categorical 2 + 1 + Do differences exist between 2 or more groups after controlling for CVs on multiple Dvs? Correlation 1 dichotomous or How strongly and in what direction (i.e., +, -) are 1 0 the IV and DV related? How much variance in the DV is accounted for by Multiple dichotomous or linear combination of the IVs? Also, how strongly 2 + 1 0 regression related to the DV is the beta coefficient for each IV? Path analysis 2 + 1 + 0 Logistic Regression 1 + categorical or 1 dichotomous 0 What are the direct and indirect effects of predictor variables on the DV? What is the odds probability of the DV occurring as the values of the IVs change?

Statistics Decision Tree Research Question Number and type of DV Number and type of IV Covariates Test Goal of Analysis nominal or higher 1 nominal or higher chi square determine if difference between croups 1 dichotomous t-test Group differences 2+ 1 categorical 2+ categorical 1 categorical 2+ categorical one-way anova 1+ one-way ancova factorial anova 1+ factorial ancova 1+ 1+ one-way manova one-way mancova factorial manova factorial mancova determine significance of mean group differences create linear combo of Dvs to maximize mean group differences Degree of relationship 1 2+ bivariate correlation multiple regression determine relationship/ prediction linear combination to predict the DV 1+ 2+ path analysis estimate causal relations among variables Prediction of group membership dichotomous 2+ nominal or higher logistic regression create linear combo of IVs of the log odds of being in one group

When trying to decide what test to use, ask youself the following... Am I interested in...?: description (association) - correlations, factor analysis, path analysis explanation (prediction) - regression, logistic regression, discriminant analysis intervention (group differences) - t-test, anova, manova, chi square Do I need longitudinal data or is cross-sectional data sufficient for my purpose? Do my hypotheses involve the investigation of change, growth, or the timing of an event? If longitudinal data is necessary, how many data points are needed? (We do not cover these techniques in this class, but your major advisor can direct you to the appropriate procedure.) Is my dependent variable nominal, ordinal, interval, or ratio? nominal - chi square, logistic regression dichotomous - logistic regression ordinal - chi square interval/ratio - correlation, multiple regression, path analysis, t-test, anova, manova, discriminant analysis Do I have moderating or mediating variables? A moderating relationship can be thought of as an interaction. It occurs when the relationship between variables A and B depends on the level of C. A=marital satisfaction B=risk of divorce C=amount of resources High Low Resources Risk of Divorce High Resources Low Low High Marital Satisfaction When resources are low, marital satisfaction doesn t affect divorce, but at high resources, marital satisfaction predicts a greater risk of divorce. A mediating relationship can be thought of as an intervening relationship. It is one in which the path relating A to C is mediated by a third variable (B). We all know that older drivers, up to a point, are safer than younger drivers. But I'm sure that we

don t think that the aging (some would say deterioration) of the body, or the mere passage of time, somehow leads to safer driving. What happens, as all right thinking people will agree, is that age leads to wisdom, and wisdom leads to safer drivers. Hence wisdom is the mediating variable that explains the correlation between age and safe driving. (Forget the part about the decline in driving related to being way old, that creates a curvilinear relationship and we re not going there.) Leerkes and Crockenberg (1999) were studying the relationship between how a new mother was raised by her own mother 20+ years before (A=maternal care) and the new mother's level of selfefficacy as a mother (C=self-efficacy). The idea being that if your mother showed high levels of maternal care toward you, you would feel more confident of your ability to mother your own child. Indeed, the correlation between Maternal Care and Self-Efficacy was.272, which is significant at p <.01. But Leerkes expected that this relationship was mediated by self-esteem, such that if you had good maternal care, you will have good self-esteem (B=self-esteem), and if you have good self-esteem, that will, in turn, lead you to have high self-efficacy. A C B When A (maternal care) and C (self-efficacy) are entered into the regression equation, A predicts C. When B (self-esteem) is added to the equation, if B predicts C and the A-C relationship declines in value, that s a mediating/intervening relationship.