Multiple Linear Regression Tutorial: RSCH-8250 Advanced Quantitative Reasoning. Charles T. Diebold, Ph.D. July 11, 2013 (revised October 3, 2013)
|
|
- Collin Rich
- 7 years ago
- Views:
Transcription
1 Multiple Linear Regression Tutorial: RSCH-8250 Advanced Quantitative Reasoning Charles T. Diebold, Ph.D. July 11, 2013 (revised October 3, 2013) How to cite this document: Diebold, C. T. (2013, October 3). Multiple linear regression tutorial: RSCH-8250 advanced quantitative reasoning. Available from Assignment and Tutorial Introduction... 2 Section 1: SPSS Specification of the Assignment... 2 Section 2: Annotated Example SPSS Output, Write Up Guide, and Sample APA Tables... 5 Descriptive Statistics... 5 Bivariate Correlation Matrix... 6 Regression Method... 6 Model Summary... 7 ANOVA F Test of the Omnibus Regression... 7 Coefficients Output... 8 Results Write Up Guide Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 1 of 11
2 Multiple Linear Regression Tutorial: RSCH-8250 Advanced Quantitative Reasoning Assignment and Tutorial Introduction This tutorial is intended to assist RSCH8250 students in completing the Week 6 application assignment. I recommend that you use this tutorial as your first line of instruction; then, if you have time, study the textbook chapter and other resources noted in the classroom. 3 rd edition of Field textbook: Chapter 7 in the Field textbook, Smart Alex's Task #1 on p. 262, using the Supermodel.sav SPSS dataset. 4 th edition of Field textbook: Chapter 8 in the Field textbook, Smart Alex's Task #4 on p. 355, using the Supermodel.sav SPSS dataset. The objective of the exercise is to conduct and interpret a standard multiple regression, including assessment of multicollinearity. The tutorial contains two sections. Section 1 provides step-by-step graphic user interface (GUI) screenshots for specifying the assignment in SPSS. If you follow the steps you will produce correct SPSS output. Section 2 presents and interprets output for a different set of variables, and includes a results write up guide, and sample APA style tables (the variables and data in Section 2 are made up and do not reflect real research). Section 1: SPSS Specification of the Assignment The assignment asks you to regress the per day salary of models (SALARY) on model s age (AGE), number of years having worked as a model (YEARS), and a rating of the model s attractiveness (BEAUTY). The capitalized words are the respective variable names in the Supermodel.sav SPSS dataset. Open the dataset, The Variable View screenshot is shown below. There are four variables in the dataset, corresponding to the four variable described in the previous paragraph. Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 2 of 11
3 Go to Analyze Descriptive Statistics Descriptives. The Descriptives dialogue box appears (below left). Select all four variables and move into the Variable(s) box (below right). Click the OK button, which will produce output with the minimum, maximum, mean, and standard deviation values for each variable. Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 3 of 11
4 Go to Analyze Regression Linear as shown below. The Linear Regression dialogue box appears (below left). We want to predict salary, so it is the dependent variable; click on it to highlight, then click the arrow next to the Dependent box, which will move salary into the box. The other three variables are being used to predict salary, so they are independent variables; select and move each one into the Independent(s) box. When done, it should look like the screen below right. Below the Independent(s) box is the word Method and a dropdown box with different ways to specify the entry of the predictors. For this assignment, leave it as Enter, which will force all predictors into the analysis at the same time, regardless of whether they are statistically significant. The Enter method represents what is referred to as a standard regression. Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 4 of 11
5 In the Linear Regression dialogue box (see previous screenshot), there is a column of buttons along the upper right. Click the Statistics button; a new dialogue box will appear. Click the boxes so that checkmarks appear for each of the elements as shown at left. For the purposes of this assignment, there is no need to examine the dialogue boxes for Plots, Save, Options, or Bootstrap. Even though Field discusses some regression diagnostics, these are (except for multicollinearity) beyond the level of this course. So, once you have specified the statistics at left click the Continue button, which will return you to the Linear Regression dialogue, in which clicking the OK button will run the analysis and produce adequate output for the assignment. Example output is shown and interpreted in the next section. Section 2: Annotated Example SPSS Output, Write Up Guide, and Sample APA Tables The example output shown below uses variables different from the Week 6 assignment. The purpose is to explain key elements of the output, point out what to focus on, and demonstrate how to interpret and report the results in APA statistical style. The criterion (aka dependent variable, what we are trying to predict) is overall grade point average (GPA) of 9 th grade students. The predictors (aka independent variables) are intelligence quotient (IQ), grade earned in an English course (ENGG), and a measure of attention deficit (ADDSC). Descriptive Statistics As shown in the descriptive statistics output (from the DESCRIPTIVES procedure in SPSS), data had been collected on 216 individuals. The minimum, maximum, mean, and standard deviation of each variable are provided. Reporting on the operationalization of each variable and the observed values in the sample give the reader insight into the variable being analyzed. For example: Attention deficit was measured on a scale of 0 to 100 with higher scores indicating more pronounced attention deficit symptomatology. In the sample of 9 th grade students, attention deficit scores ranged from to with a mean of (SD = 10.45). The regression procedure will also produce a descriptive statistics table, but it does not include the minimum and maximum values. Descriptive Statistics N Minimum Maximum Mean Std. Deviation GPA IQ ENGG ADDSC Valid N (listwise) 216 Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 5 of 11
6 Bivariate Correlation Matrix From the regression output, the correlations table indicates the bivariate correlations and one-tailed p values of each pair of variables. There was a statistically significant inverse relationship between GPA and attention deficit score, r(214) = -.542, p <.001 one-tailed, indicating that as attention deficit increased, GPA tended to decrease. You can similarly report the other two bivariate correlations with GPA, but keep in mind that these are just descriptive because the focus is on the multiple regression results. As FYI, the 214 in the parenthesis in the example above is the df value. For correlations, the df value is N 2. The table below indicates that N = 216, so df = = 214. Correlations Pearson Correlation Sig. (1-tailed) N GPA ENGG ADDSC IQ GPA ENGG ADDSC IQ GPA ENGG ADDSC IQ GPA ENGG ADDSC IQ Regression Method The output below simply informs us that all three variables were entered simultaneously, which is what had to happen because we had specified the Enter method. In the results write up you just need to identify the method used. Such as: The purpose of the standard regression analysis was to examine the combined and relative effects of 9 th grade students IQ, English grade, and attention deficit score in predicting overall GPA. The term standard regression means that all predictors were entered simultaneously. Two other common methods are statistical regression (aka stepwise regression) in which variables enter according to level of significance, and sequential regression (aka hierarchical regression) in which the analyst decides and specifies the order of entry of each variable. Variables Entered/Removed a Model Variables Entered Variables Removed Method 1 IQ, ENGG, ADDSC b. Enter a. Dependent Variable: GPA b. All requested variables entered. Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 6 of 11
7 Model Summary For a standard regression the information highlighted in yellow in the Model Summary output is relevant (the other information is somewhat redundant to what we will pull out of the ANOVA output that follows). In this example, the three predictors combined explained 62.9% of the variance in overall GPA. R 2 is the sample result, and adjusted R 2 is an estimate for the population; the sample result is typically reported. Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate Change Statistics R Square Change F Change df1 df2 Sig. F Change a a. Predictors: (Constant), IQ, ENGG, ADDSC ANOVA F Test of the Omnibus Regression The ANOVA output provides the test of statistical significance of the regression. In this example, the combined effect of student s IQ, English grade, and attention deficit score statistically significantly predicted overall GPA, F(3, 212) = , p <.001, R 2 =.63. The output shows.000 in the Sig column, but probability cannot be zero, so in such cases, report as p <.001 (which is APA style), do not report p =.000. To be clear, ignore Dr. Morrow s reporting of p =.000 in her videos and, instead, follow APA style. FYI for the inquisitive: The regression sum of squares of is the explained variance in GPA. The residual sum of squares of is the variance in GPA that was not explained by the three predictors. The sum of these is the total sum of squares. If you divide the total sum of squares by the regression sum of squares you get the proportion of variance explained, which is R 2 = =.629. R, which in this example is.793, is the correlation between predicted GPA and actual GPA. That is, if you saved the predicted GPA scores from the regression analysis and then did a correlation between those predicted scores and the original GPA scores that we were predicting, the correlation would be.793. For multiple regression, a R value of.14 is considered a small effect,.36 a medium effect, and.51 a large effect; these correspond to the R 2 values of.02,.13, and.26, respectively. ANOVA a Model Sum of Squares df Mean Square F Sig. Regression b 1 Residual Total a. Dependent Variable: GPA b. Predictors: (Constant), IQ, ENGG, ADDSC Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 7 of 11
8 Coefficients Output The Coefficients output details the effects of each predictor while holding the other predictors constant (or, said another way, while controlling for the other predictors). That is, standard multiple regression is not about the isolated effect of a predictor on the criterion, but the effect of a predictor on the criterion while simultaneously considering the effect of the other predictors and the correlations among the predictors. Correlations. To illustrate this point, look at the zero-order and part correlations columns. The zero-order is the simple correlation between a predictor and the criterion. For example, the simple correlation between English grade and overall GPA is shown as.746, which is the same as was shown in the previous correlations output. The part correlation (aka semipartial correlation, which is the more common term in the literature, and is the term I will use hereafter) indexes the unique relationship between the predictor and criterion that none of the other predictors explains. That is, when the predictors are correlated, some of the variance in the criterion is explained by more than one predictor; the semipartial correlation filters out any shared explanation by predictors, leaving only each predictor s unique contribution. If predictors are correlated, the semipartial will always be smaller than the simple zero-order correlation (if predictors are uncorrelated, a rare event, the zero-order and semipartial will be equal). In this example, the semipartial correlation between English grade and overall GPA is.563, much less than the simple correlation of.746. The semipartial squared (sr 2 ), a commonly reported effect size, is the proportion of variance in the criterion uniquely accounted for by the predictor; so, English grade uniquely accounted for 31.7% of the variance in overall GPA. The interpretation of the partial correlation is not as straightforward as the semipartial correlation. In addition to the unique variance accounted for, the partial correlation attributes to each predictor its relative proportion of explained variance in the criterion that is shared with other predictors. When predictors are correlated, the partial correlation will always be higher than the semipartial correlation. Relative Importance of Predictors. If interested in rank ordering the relative importance of each predictor, such is best arranged by using sr 2 or the absolute value of the semipartial correlations (Tabachnick & Fidell, 2007). In this example, order of variable importance is English grade (sr 2 =.317), IQ (sr 2 =.017), then attention deficit score (sr 2 =.013). Coefficients a Model Unstandardized Standardized t Sig. 95.0% Confidence Interval Correlations Collinearity Coefficients Coefficients for B Statistics B Std. Error Beta Lower Bound Upper Bound Zeroorder Partial Part Tolerance VIF (Constant) ENGG ADDSC IQ a. Dependent Variable: GPA Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 8 of 11
9 Unstandardized and Standardized Coefficients. The unstandardized coefficient, B, indexes the amount of raw score change in the criterion for a 1-unit raw score change in the predictor. For example, while holding the other predictors constant, if English grade increases from 2.0 to 3.0 (or 2.4 to 3.4. or 1.7 to 2.7, in other words, any 1-unit increase), overall GPA is predicted to increase by.584 points (with 95% CI from.498 to.669). Similarly, if IQ increases 10 points, overall GPA is expected to increase.110 points (i.e., the B weight is.011 for a 1 point increase, so for a 10 point increase it would be 10 x.011 =.110). From these two examples, you should be able to determine how to report the relationship between attention deficit score and overall GPA. The generic unstandardized regression equation is: Y = B 0 + B 1 X 1 + B 2 X 2 + B k X k Y is the predicted value of the criterion (aka dependent variable), B 0 is the constant, and the numbered Bs and Xs represent each predictor. Contextualized to this example, the unstandardized equation is: Overall GPA = (English grade) -.013(attention deficit score) +.011(IQ) The equation can be used to predict overall GPA for specific values of each predictor. For example, if a student had an English grade of 2.7, an attention deficit score of 65, and an IQ of 105, predicted overall GPA would be: (2.7) -.013(65) +.011(105) = The standardized coefficient, β (pronounced beta), indexes the standard unit change in the criterion for a 1-standard deviation change in the predictor. For a 1 standard deviation increase in attention deficit score, overall GPA is predicted to decrease by.156 standard deviations. Predictor Significance Tests. A t test is used to determine the statistical significance of each predictor. Technically, the t test determines if the B coefficient is different from 0. The t value is equal to the B coefficient divided by its standard error (SE). For English grade, t = = 13.58, which is within rounding error of the t value shown in the output (for the computation to be accurate, the B and SE values need to be known to several more decimal places than shown in the output). The t value is evaluated at the error degrees of freedom (df) value, which is N k - 1, where k is the number of predictor variables. For IQ one might report: While holding the effects of the other predictors constant, the effect of IQ on predicting overall GPA was statistically significant, t(212) = 3.16, p =.002, sr 2 =.017, uniquely accounting for 1.7% of the variance in overall GPA. For each 1-point increase in IQ, overall GPA was expected to increase.011 points (95% CI from.004 to.019). Similar statements should be made for the other predictors. Collinearity. Multicollinearity exists if one predictor is highly predicted by the set of other predictors, which can be the case when highly correlated with just one of the other predictors in the set. In the last two columns of the Coefficients output (see previous page) the tolerance and variance inflation factor (VIF) values can be examined to assess multicollinearity. Tolerance values greater than.1 and VIF values greater than 10 are often cited as cutoffs. Cohen, Cohen, West, and Aiken (2003) demonstrated how these cutoffs are not particularly useful in identifying potential multicollinearity issues, requiring bivariate correlations in excess of.90. The Help information within the SPSS software recommends tolerance of.5 and VIF of 2 as cutoffs for further examination of any multicollinearity effects. Correlations between predictors of.70 or higher can cause issues. Though there are other possible causes unrelated to multicollinearity, a common manifestation of multicollinearity is different signs (i.e., + or -) for a predictor s zero-order correlation and its β weight. In such cases, if multicollinearity is the culprit, the regression analysis is invalid. Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 9 of 11
10 Results Write Up Guide Begin the write up by describing the context of the research and the variables. If known, state how each variable was operationalized, for example: Overall GPA was measured on the traditional 4-point scale from 0 (F) to 4 (A), or Satisfaction was measured on a 5-point likerttype scale from 1 (not at all satisfied) to 5 (extremely satisfied). Please pay attention to APA style for reporting scale anchors (see p. 91 and p. 105 in the 6 th edition of the APA Manual). Report descriptive statistics such as minimum, maximum, mean, and standard deviation for each metric variable. For nominal variables, report percentage for each level of the variable, for example: Of the total sample (N = 150) there were 40 (26.7%) males and 110 (73.3%) females. Keep in mind that a sentence that includes information in parentheticals must still be a sentence (and make sense) if the parentheticals are removed. For example, the one above without parentheticals is still a sentence and makes sense: Of the total sample there were 40 males and 110 females. State the purpose of the analysis or provide the guiding research question(s). If you use research questions, do not craft them such that they can be answered with a yes or no. Instead, craft them so that they will have a quantitative answer. For example: What is the strength and direction of relationship between X and Y? or What is the difference in group means on X between males and females? Present null and alternative hypothesis sets applicable to the analysis. For regression there would be a hypothesis set for the overall result (i.e., the combined effect of the predictors) and a hypothesis set for each predictor while controlling for or holding constant the effects of the other predictors. State assumptions or other considerations for the analysis, and report the actual statistical result for relevant tests. For this course, the only regression consideration that needs to be presented and discussed is for multicollinearity. Even if violated, you must still report and interpret the remaining results. Report and interpret the overall regression results. Report and interpret the results of each predictor. Be sure to include the actual statistical results in text examples were provided within the annotated output section of this tutorial. Don t forget to interpret the results (e.g., as IQ increased, overall GPA was predicted to increase; based on semipartial correlations, variable x was the most important predictor of y; etc.). Draw conclusions about rejecting or failing to reject each null. If needed, summarize the results, without statistics, in a concluding sentence or paragraph. Provide APA style tables appropriate to the analysis. Do not use SPSS output, it is not in APA style. Example APA tables for a multiple regression are shown below using the results from the example output in this tutorial. Although one would typically not duplicate information in text and tables, it is important to demonstrate competence in both ways of reporting the results; so, you cannot just provide tables, you must also report the relevant statistical results within the textual write up. Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 10 of 11
11 Example APA Tables for Standard Regression Table 1 Means, Standard Deviations, and Intercorrelations for Overall GPA and Predictor Variables IQ, English Grade, and Attention Deficit Score Variable M SD Overall GPA IQ < English grade <.001 < Attention deficit score <.001 <.001 <.001 Note. Upper diagonal contains correlation coefficients. Lower diagonal contains p values. Table 2 Standard Regression Summary for IQ, English Grade, and Attention Deficit Scores Predicting Overall GPA Variable B 95% CI β sr p Constant [-0.666, 1.620] IQ [0.004, 0.019] English grade [0.498, 0.669] <.001 Attention deficit score [-0.022, ] Note. CI = confidence interval for B; sr = semipartial correlation (aka, part correlation). Charles T. Diebold, Ph.D., 7/11/13, 10/03/13. All Rights Reserved. Page 11 of 11
Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
More informationMultiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.
Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear. In the main dialog box, input the dependent variable and several predictors.
More informationSPSS Guide: Regression Analysis
SPSS Guide: Regression Analysis I put this together to give you a step-by-step guide for replicating what we did in the computer lab. It should help you run the tests we covered. The best way to get familiar
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationMultiple Regression. Page 24
Multiple Regression Multiple regression is an extension of simple (bi-variate) regression. The goal of multiple regression is to enable a researcher to assess the relationship between a dependent (predicted)
More informationThis chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
More informationSimple Linear Regression, Scatterplots, and Bivariate Correlation
1 Simple Linear Regression, Scatterplots, and Bivariate Correlation This section covers procedures for testing the association between two continuous variables using the SPSS Regression and Correlate analyses.
More informationModerator and Mediator Analysis
Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,
More informationCorrelation and Regression Analysis: SPSS
Correlation and Regression Analysis: SPSS Bivariate Analysis: Cyberloafing Predicted from Personality and Age These days many employees, during work hours, spend time on the Internet doing personal things,
More informationFactor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
More informationChapter 13 Introduction to Linear Regression and Correlation Analysis
Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing
More informationFalse. Model 2 is not a special case of Model 1, because Model 2 includes X5, which is not part of Model 1. What she ought to do is estimate
Sociology 59 - Research Statistics I Final Exam Answer Key December 6, 00 Where appropriate, show your work - partial credit may be given. (On the other hand, don't waste a lot of time on excess verbiage.)
More informationDirections for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
More informationINTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)
INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of
More informationUNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
More informationSPSS Explore procedure
SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,
More informationMultiple Regression Using SPSS
Multiple Regression Using SPSS The following sections have been adapted from Field (2009) Chapter 7. These sections have been edited down considerably and I suggest (especially if you re confused) that
More informationJanuary 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
More informationMultiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
More informationModeration. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
More informationChapter 5 Analysis of variance SPSS Analysis of variance
Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationSPSS-Applications (Data Analysis)
CORTEX fellows training course, University of Zurich, October 2006 Slide 1 SPSS-Applications (Data Analysis) Dr. Jürg Schwarz, juerg.schwarz@schwarzpartners.ch Program 19. October 2006: Morning Lessons
More informationHow to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
More information1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96
1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationDidacticiel - Études de cas
1 Topic Regression analysis with LazStats (OpenStat). LazStat 1 is a statistical software which is developed by Bill Miller, the father of OpenStat, a wellknow tool by statisticians since many years. These
More informationLinear Models in STATA and ANOVA
Session 4 Linear Models in STATA and ANOVA Page Strengths of Linear Relationships 4-2 A Note on Non-Linear Relationships 4-4 Multiple Linear Regression 4-5 Removal of Variables 4-8 Independent Samples
More informationDoing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:
Doing Multiple Regression with SPSS Multiple Regression for Data Already in Data Editor Next we want to specify a multiple regression analysis for these data. The menu bar for SPSS offers several options:
More informationOne-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationSCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES
SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR
More informationThe Dummy s Guide to Data Analysis Using SPSS
The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests
More informationScatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
More informationChapter 23. Inferences for Regression
Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily
More informationChapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data.
Chapter 15 Mixed Models A flexible approach to correlated data. 15.1 Overview Correlated data arise frequently in statistical analyses. This may be due to grouping of subjects, e.g., students within classrooms,
More informationPredictability Study of ISIP Reading and STAAR Reading: Prediction Bands. March 2014
Predictability Study of ISIP Reading and STAAR Reading: Prediction Bands March 2014 Chalie Patarapichayatham 1, Ph.D. William Fahle 2, Ph.D. Tracey R. Roden 3, M.Ed. 1 Research Assistant Professor in the
More informationMultivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine
2 - Manova 4.3.05 25 Multivariate Analysis of Variance What Multivariate Analysis of Variance is The general purpose of multivariate analysis of variance (MANOVA) is to determine whether multiple levels
More informationAssociation Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
More informationData exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
More informationChapter 2 Probability Topics SPSS T tests
Chapter 2 Probability Topics SPSS T tests Data file used: gss.sav In the lecture about chapter 2, only the One-Sample T test has been explained. In this handout, we also give the SPSS methods to perform
More informationINTRODUCTION TO MULTIPLE CORRELATION
CHAPTER 13 INTRODUCTION TO MULTIPLE CORRELATION Chapter 12 introduced you to the concept of partialling and how partialling could assist you in better interpreting the relationship between two primary
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationPremaster Statistics Tutorial 4 Full solutions
Premaster Statistics Tutorial 4 Full solutions Regression analysis Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression = 125,000 + 150. b. What is the prediction for
More informationEPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST
EPS 625 INTERMEDIATE STATISTICS The Friedman test is an extension of the Wilcoxon test. The Wilcoxon test can be applied to repeated-measures data if participants are assessed on two occasions or conditions
More informationUnit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
More informationIndependent t- Test (Comparing Two Means)
Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent
More informationStepwise Regression. Chapter 311. Introduction. Variable Selection Procedures. Forward (Step-Up) Selection
Chapter 311 Introduction Often, theory and experience give only general direction as to which of a pool of candidate variables (including transformed variables) should be included in the regression model.
More informationCourse Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed
More informationIntroduction to Analysis of Variance (ANOVA) Limitations of the t-test
Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only
More informationAn introduction to IBM SPSS Statistics
An introduction to IBM SPSS Statistics Contents 1 Introduction... 1 2 Entering your data... 2 3 Preparing your data for analysis... 10 4 Exploring your data: univariate analysis... 14 5 Generating descriptive
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationChapter 7: Simple linear regression Learning Objectives
Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -
More informationBinary Logistic Regression
Binary Logistic Regression Main Effects Model Logistic regression will accept quantitative, binary or categorical predictors and will code the latter two in various ways. Here s a simple model including
More informationCOMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
More informationMULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis
Journal of tourism [No. 8] MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM Assistant Ph.D. Erika KULCSÁR Babeş Bolyai University of Cluj Napoca, Romania Abstract This paper analysis
More informationUNDERSTANDING THE TWO-WAY ANOVA
UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables
More informationAn Introduction to Path Analysis. nach 3
An Introduction to Path Analysis Developed by Sewall Wright, path analysis is a method employed to determine whether or not a multivariate set of nonexperimental data fits well with a particular (a priori)
More informationTwo Related Samples t Test
Two Related Samples t Test In this example 1 students saw five pictures of attractive people and five pictures of unattractive people. For each picture, the students rated the friendliness of the person
More informationUNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
More informationMulticollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015
Multicollinearity Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ Last revised January 13, 2015 Stata Example (See appendices for full example).. use http://www.nd.edu/~rwilliam/stats2/statafiles/multicoll.dta,
More informationDDBA 8438: The t Test for Independent Samples Video Podcast Transcript
DDBA 8438: The t Test for Independent Samples Video Podcast Transcript JENNIFER ANN MORROW: Welcome to The t Test for Independent Samples. My name is Dr. Jennifer Ann Morrow. In today's demonstration,
More informationPsyc 250 Statistics & Experimental Design. Correlation Exercise
Psyc 250 Statistics & Experimental Design Correlation Exercise Preparation: Log onto Woodle and download the Class Data February 09 dataset and the associated Syntax to create scale scores Class Syntax
More informationSPSS Guide How-to, Tips, Tricks & Statistical Techniques
SPSS Guide How-to, Tips, Tricks & Statistical Techniques Support for the course Research Methodology for IB Also useful for your BSc or MSc thesis March 2014 Dr. Marijke Leliveld Jacob Wiebenga, MSc CONTENT
More informationData Analysis for Marketing Research - Using SPSS
North South University, School of Business MKT 63 Marketing Research Instructor: Mahmood Hussain, PhD Data Analysis for Marketing Research - Using SPSS Introduction In this part of the class, we will learn
More informationIBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA
CALIFORNIA STATE UNIVERSITY, LOS ANGELES INFORMATION TECHNOLOGY SERVICES IBM SPSS Statistics 20 Part 4: Chi-Square and ANOVA Summer 2013, Version 2.0 Table of Contents Introduction...2 Downloading the
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationCHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA
CHAPTER 14 ORDINAL MEASURES OF CORRELATION: SPEARMAN'S RHO AND GAMMA Chapter 13 introduced the concept of correlation statistics and explained the use of Pearson's Correlation Coefficient when working
More information1.1. Simple Regression in Excel (Excel 2010).
.. Simple Regression in Excel (Excel 200). To get the Data Analysis tool, first click on File > Options > Add-Ins > Go > Select Data Analysis Toolpack & Toolpack VBA. Data Analysis is now available under
More informationChapter Four. Data Analyses and Presentation of the Findings
Chapter Four Data Analyses and Presentation of the Findings The fourth chapter represents the focal point of the research report. Previous chapters of the report have laid the groundwork for the project.
More informationChapter 7 Factor Analysis SPSS
Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often
More informationThere are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
More informationRegression Analysis (Spring, 2000)
Regression Analysis (Spring, 2000) By Wonjae Purposes: a. Explaining the relationship between Y and X variables with a model (Explain a variable Y in terms of Xs) b. Estimating and testing the intensity
More informationDISCRIMINANT FUNCTION ANALYSIS (DA)
DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant
More information5. Correlation. Open HeightWeight.sav. Take a moment to review the data file.
5. Correlation Objectives Calculate correlations Calculate correlations for subgroups using split file Create scatterplots with lines of best fit for subgroups and multiple correlations Correlation The
More informationAn analysis method for a quantitative outcome and two categorical explanatory variables.
Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More informationHYPOTHESIS TESTING WITH SPSS:
HYPOTHESIS TESTING WITH SPSS: A NON-STATISTICIAN S GUIDE & TUTORIAL by Dr. Jim Mirabella SPSS 14.0 screenshots reprinted with permission from SPSS Inc. Published June 2006 Copyright Dr. Jim Mirabella CHAPTER
More informationAnalysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk
Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
More informationIBM SPSS Statistics for Beginners for Windows
ISS, NEWCASTLE UNIVERSITY IBM SPSS Statistics for Beginners for Windows A Training Manual for Beginners Dr. S. T. Kometa A Training Manual for Beginners Contents 1 Aims and Objectives... 3 1.1 Learning
More informationCALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
More informationTHE KRUSKAL WALLLIS TEST
THE KRUSKAL WALLLIS TEST TEODORA H. MEHOTCHEVA Wednesday, 23 rd April 08 THE KRUSKAL-WALLIS TEST: The non-parametric alternative to ANOVA: testing for difference between several independent groups 2 NON
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More informationExamining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish
Examining Differences (Comparing Groups) using SPSS Inferential statistics (Part I) Dwayne Devonish Statistics Statistics are quantitative methods of describing, analysing, and drawing inferences (conclusions)
More informationWhen to use Excel. When NOT to use Excel 9/24/2014
Analyzing Quantitative Assessment Data with Excel October 2, 2014 Jeremy Penn, Ph.D. Director When to use Excel You want to quickly summarize or analyze your assessment data You want to create basic visual
More informationSimple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
More informationMain Effects and Interactions
Main Effects & Interactions page 1 Main Effects and Interactions So far, we ve talked about studies in which there is just one independent variable, such as violence of television program. You might randomly
More informationGood luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:
Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationX X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)
CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.
More information1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 Introduce moderated multiple regression Continuous predictor continuous predictor Continuous predictor categorical predictor Understand
More informationChapter 7. Comparing Means in SPSS (t-tests) Compare Means analyses. Specifically, we demonstrate procedures for running Dependent-Sample (or
1 Chapter 7 Comparing Means in SPSS (t-tests) This section covers procedures for testing the differences between two means using the SPSS Compare Means analyses. Specifically, we demonstrate procedures
More informationSTATISTICS FOR PSYCHOLOGISTS
STATISTICS FOR PSYCHOLOGISTS SECTION: STATISTICAL METHODS CHAPTER: REPORTING STATISTICS Abstract: This chapter describes basic rules for presenting statistical results in APA style. All rules come from
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationSPSS Resources. 1. See website (readings) for SPSS tutorial & Stats handout
Analyzing Data SPSS Resources 1. See website (readings) for SPSS tutorial & Stats handout Don t have your own copy of SPSS? 1. Use the libraries to analyze your data 2. Download a trial version of SPSS
More informationMULTIPLE REGRESSION WITH CATEGORICAL DATA
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 86 MULTIPLE REGRESSION WITH CATEGORICAL DATA I. AGENDA: A. Multiple regression with categorical variables. Coding schemes. Interpreting
More informationA Basic Guide to Analyzing Individual Scores Data with SPSS
A Basic Guide to Analyzing Individual Scores Data with SPSS Step 1. Clean the data file Open the Excel file with your data. You may get the following message: If you get this message, click yes. Delete
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More informationSPSS TUTORIAL & EXERCISE BOOK
UNIVERSITY OF MISKOLC Faculty of Economics Institute of Business Information and Methods Department of Business Statistics and Economic Forecasting PETRA PETROVICS SPSS TUTORIAL & EXERCISE BOOK FOR BUSINESS
More information