Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences

Size: px
Start display at page:

Download "Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences"

Transcription

1 Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences Third Edition Jacob Cohen (deceased) New York University Patricia Cohen New York State Psychiatric Institute and Columbia University College of Physicians and Surgeons Stephen G. West Arizona State University Leona S. Aiken Arizona State University leq LAWRENCE ERLBAUM ASSOCIATES, PUBLISHERS 2003 Mahwah, New Jersey London

2 Contents Preface Chapter 1: Introduction 1.1 Multiple Regression/Correlation as a General Data-Analytic System Overview Testing Hypotheses Using Multiple Regression/Correlation: Some Examples Multiple Regression/Correlation in Prediction Models A Comparison of Multiple Regression/Correlation and Analysis of Variance Approaches Historical Background Hypothesis Testing and Effect Sizes Multiple Regression/Correlation and the Complexity of Behavioral Science Multiplicity of Influences Correlation Among Research Factors and Partialing Form of Information Shape of Relationship General and Conditional Relationships Orientation of the Book Nonmathematical Applied Data-Analytic Inference Orientation and Specification Error Computation, the Computer, and Numerical Results Computation 14

3 viii CONTENTS Numerical Results: Reporting and Rounding Signiflcance Tests, Confidence Intervals, and Appendix Tables The Spectrum of Behavioral Science Plan for the Book Content Structure: Numbering of Sections, Tables, andequations Summary 18 Chapter 2: Bivariate Correlation and Regression 2.1 Tabular and Graphic Representations of Relationships The Index of Linear Correlation Between Two Variables: The Pearson Product Moment Correlation Coefficient Standard Scores: Making Units Comparable The Product Moment Correlation as a Function of Differences Between z Scores Alternative Formulas for the Product Moment Correlation Coefficient r as the Average Product of z Scores Raw Score Formulas for r Point Biserial r Phi (<)>) Coefficient Rank Correlation Regression Coefficients: Estimating yfromx Regression Toward the Mean The Standard Error of Estimate and Measures of the Strength of Association Summary of Definitions and Interpretations Statistical Inference With Regression and Correlation Coefficients Assumptions Underlying Statistical Inference With Byx, B 0, F and r XY Estimation With Confidence Intervals Null Hypothesis Signiflcance Tests (NHSTs) Confidence Limits and Null Hypothesis Signiflcance Testing Precision and Power Precision of Estimation Power of Null Hypothesis Signiflcance Tests Factors Affecting the Size of r The Distributions of X and Y The Reliability of the Variables Restriction of Range Part-Whole Correlations Ratio or Index Variables Curvilinear Relationships Summary 62

4 Chapter 3: Multiple Regression/Correlation With Two or More Independent Variables 3.1 Introduction: Regression and Causal Models What Is a Cause? Diagrammatic Representation of Causal Models Regression With Two Independent Variables Measures of Association With Two Independent Variables Multiple R and R Semipartial Correlation Coefficients and Increments to R Partial Correlation Coefficients Patterns of Association Between Y and Two Independent Variables Direct and Indirect Effects Partial Redundancy Suppression in Regression Models Spurious Effects and Entirely Indirect Effects Multiple Regression/Correlation With k Independent Variables Introduction: Components of the Prediction Equation Partial Regression Coefficients R, R 2, and Shrunken R sr and sr pr andpr Example of Interpretation of Partial Coefficients Statistical Inference With k Independent Variables Standard Errors and Confidence Intervals for B andß Confidence Intervals for R Confidence Intervals for Differences Between Independent R 2 s Statistical Tests on Multiple and Partial Coefficients Statistical Precision and Power Analysis Introduction: Research Goals and the Null Hypothesis The Precision and Power ofr Precision and Power Analysis for Partial Coefficients Using Multiple Regression Equations in Prediction Prediction of Y for a New Observation Correlation of Individual Variables With Predicted Values Cross-Validation and Unit Weighting Multicollinearity Summary 99

5 X CONTENTS Chapter 4: Data Visualization, Exploration, and Assumption Checking: Diagnosing and Solving Regression Problems I 4.1 Introduction Some Useful Graphical Displays of the Original Data Univariate Displays Bivariate Displays Correlation and Scatterplot Matrices Assumptions and Ordinary Least Squares Regression Assumptions Underlying Multiple Linear Regression Ordinary Least Squares Estimation Detecting Violations of Assumptions Form of the Relationship Omitted Independent Variables Measurement Error Homoscedasticity of Residuais Nonindependence of Residuais Normality of Residuais Remedies: Alternative Approaches When Problems Are Detected Form of the Relationship Inclusion of All Relevant Independent Variables Measurement Error in the Independent Variables Nonconstant Variance Nonindependence of Residuais Summary 150 Chapter 5: Data-Analytic Strategies Using Multiple Regression/Correlation 5.1 Research Questions Answered by Correlations and Their Squares Net Contribution to Prediction Indicesof Differential Validity Comparisons of Predictive Utility Attribution of a Fraction of the XY Relationship to a Third Variable Which of Two Variables Accounts for More of the XY Relationship? Are the Various Squared Correlations in One Population Different From Those in Another Given the Same Variables? Research Questions Answered by B Or ß Regression Coefficients as Reflections of Causal Effects Alternative Approaches to Making ß ra Substantively Meaningful Are the Effects of a Set of Independent Variables on Two Different Outcomes in a Sample Different? 157

6 5.2.4 What Are the Reciprocal Effects of Two Variables on One Another? Hierarchical Analysis Variables in Multiple Regression/ Correlation Causal Priority and the Removal of Confounding Variables Research Relevance Examination of Alternative Hierarchical Sequences of Independent Variable Sets Stepwise Regression The Analysis of Sets of Independent Variables TypesofSets The Simultaneous and Hierarchical Analyses of Sets Variance Proportions for Sets and the Ballantine Again B and ß Coefficients for Variables Within Sets Significance Testing for Sets Application in Hierarchical Analysis Application in Simultaneous Analysis Using Computer Output to Determine Statistical Significance An Alternative F Test: Using Model 2 Error Estimate From the Final Model Power Analysis for Sets Determining n* for the F Test of sr\ with Model 1 or Model 2 Error Estimating the Population sr 2 Values Setting Power for n* Reconciling Differentn*s Power as a Function of n Tactics of Power Analysis Statistical Inference Strategy in Multiple Regression/ Correlation Controlling and Balancing Type I and Type II Errors in Inference Less Is More LeastlsLast Adaptation of Fisher's Protected t Test Statistical Inference and the Stage of Scientific Investigations Summary 190 CONTENTS Chapter 6: Quantitative Scales, Curvilinear Reiationships, and Transformations Introduction What Do We Mean by Linear Regression? 193

7 CONTENTS Linearity in the Variables and Linear Multiple Regression Four Approaches to Examining Nonlinear Relationships in Multiple Regression 195 Power Polynomials Method An Example: Quadratic Fit Centering Predictors in Polynomial Equations Relationship of Test of Significance of Highest Order Coefficient and Gain in Prediction Interpreting Polynomial Regression Results Another Example: A Cubic Fit Strategy and Limitations More Complex Equations 213 Orthogonal Polynomials The Cubic Example Revisited Unequal n and Unequal Intervals Applications and Discussion 220 Nonlinear Transformations Purposes of Transformation and the Nature of Transformations The Conceptual Basis of Transformations and Model Checking Before and After Transformation Is It Always Ideal to Transform? Logarithms and Exponents; Additive and Proportional Relationships Linearizing Relationships Linearizing Relationships Based on Strong Theoretical Models Linearizing Relationships Based on Weak Theoretical Models Empirically Driven Transformations in the Absence of Strong or Weak Models Empirically Driven Transformation for Linearization: The Ladder of Re-expression and the Bulging Rule Empirically Driven Transformation for Linearization in the Absence of Models: Box-Cox Family of Power Transformations on Y Empirically Driven Transformation for Linearization in the Absence of Models: Box-Tidwell Family of Power Transformations on X Linearization of Relationships With Correlations: Fisher z' Transform of r Transformations That Linearize Relationships for Counts and Proportions Variance Stabilizing Transformations and Alternatives for Treatment of Heteroscedasticity Transformations to Normalize Variables Diagnostics Following Transformation 247

8 CONTENTS XIII Measuring and Comparing Model Fit Second-Order Polynomial Numerical Example Revisited When to Transform and the Choice of Transformation Nonlinear Regression Nonparametric Regression Summary 253 Chapter 7: Interactions Among Continuous Variables Introduction Interactions Versus Additive Effects Conditional First-Order Effects in Equations Containing Interactions Centering Predictors and the Interpretation of Regression Coefficients in Equations Containing Interactions Regression with Centered Predictors Relationship Between Regression Coefficients in the Uncentered and Centered Equations Centered Equations With No Interaction Essential Versus Nonessential Multicollinearity Centered Equations With Interactions The Highest Order Interaction in the Centered Versus Uncentered Equation Do Not Center Y A Recommendation for Centering Simple Regression Equations and Simple Slopes Plotting Interactions Moderator Variables Simple Regression Equations Overall Regression Coefficient and Simple Slope at the Mean Simple Slopes From Uncentered Versus Centered Equations Are Identical Linear by Linear Interactions Interpreting Interactions in Multiple Regression and Analysis of Variance Post Hoc Probing of Interactions Standard Error of Simple Slopes Equation Dependence of Simple Slopes and Their Standard Errors Tests of Significance of Simple Slopes Confidence Intervals Around Simple Slopes A Numerical Example The Uncentered Regression Equation Revisited First-Order Coefficients in Equations Without and With Interactions Interpretation and the Range of Data 282

9 XIV CONTENTS 7.5 Standardized Estimates for Equations Containing Interactions Interactions as Partialed Effects: Building Regression Equations With Interactions Patteras of First-Order and Interactive Effects Three Theoretically Meaningful Patterns of First-Order and Interaction Effects Ordinal Versus Disordinal Interactions Three-Predictor Interactions in Multiple Regression Curvilinear by Linear Interactions Interactions Among Sets of Variables Issues in the Detection of Interactions: Reliability, Predictor Distributions, Model Specification Variable Reliability and Power to Detect Interactions Sampling Designs to Enhance Power to Detect Interactions Optimal Design Difficulty in Distinguishing Interactions Versus Curvilinear Effects Summary 300 Chapter 8: Categorical or Nominal Independent Variables Introduction Categories as a Set of Independent Variables The Representation of Categories or Nominal Scales Dummy-Variable Coding Coding the Groups Pearson Correlations of Dummy Variables With Y Correlations Among Dummy-Coded Variables Multiple Correlation of the Dummy-Variable Set Wim Regression Coefficients for Dummy Variables Partial and Semipartial Correlations for Dummy Variables Dummy-Variable Multiple Regression/Correlation and One-Way Analysis of Variance A Cautionary Note: Dummy-Variable-Like Coding Systems Dummy-Variable Coding When Groups Are Not Mutually Exclusive Unweighted Effects Coding Introduction: Unweighted and Weighted Effects Coding Constructing Unweighted Effects Codes The R 2 and the r K s for Unweighted Effects Codes Regression Coefficients and Other Partial Effects in Unweighted Code Sets 325

10 CONTENTS XV 8.4 Weighted Effects Coding Selection Considerations for Weighted Effects Coding Constructing Weighted Effects The R 2 and R 2 for Weighted Effects Codes Interpretation and Testing of B With Unweighted Codes Contrast Coding Considerations in the Selection of a Contrast Coding Scheine Constructing Contrast Codes The R 2 and R Partial Regression Coefficients Statistical Power and the Choice of Contrast Codes Nonsense Coding Coding Schemes in the Context of Other Independent Variables Combining Nominal and Continuous Independent Variables Calculating Adjusted Means for Nominal Independent Variables Adjusted Means for Combinations of Nominal and Quantitative Independent Variables Adjusted Means for More Than Two Groups and Alternative Coding Methods Multiple Regression/Correlation With Nominal Independent Variables and the Analysis of Covariance Summary 351 Chapter 9: Interactions With Categorical Variables Nominal Scale by Nominal Scale Interactions The 2 by 2 Design Regression Analyses of Multiple Sets of Nominal Variables With More Than Two Categories Interactions Involving More Than Two Nominal Scales An Example of Three Nominal Scales Coded by Alternative Methods Interactions Among Nominal Scales in Which Not All Combinations Are Considered What If the Categories for One or More Nominal "Scales" Are Not Mutually Exclusive? Consideration of pr, ß, and Variance Proportions for Nominal Scale Interaction Variables Summary of Issues and Recommendations for Interactions Among Nominal Scales Nominal Scale by Continuous Variable Interactions A Reminder on Centering 375

11 XVi CONTENTS Interactions of a Continuous Variable With Dummy-Variable Coded Groups Interactions Using Weighted or Unweighted Effects Codes Interactions With a Contrast-Coded Nominal Scale Interactions Coded to Estimate Simple Slopes of Groups Categorical Variable Interactions With Nonlinear Effects of Scaled Independent Variables Interactions of a Scale With Two or More Categorical Variables Summary 388 Chapter 10: Outliers and Multicoilinearity: Diagnosing and Solving Regression Problems II Introduction Outliers: Introduction and Illustration Detecting Outliers: Regression Diagnostics Extremity on the Independent Variables: Leverage Extremity on Y: Discrepancy Influence on the Regression Estimates Location of Outlying Points and Diagnostic Statistics Summary and Suggestions Sources of Outliers and Possible Remedial Actions Sources of Outliers Remedial Actions Multicoilinearity Exact Cpllinearity Multicoilinearity: A Numerical Illustration Measures of the Degree of Multicoilinearity Remedies for Multicoilinearity Model Respecification Collection of Additional Data Ridge Regression Principal Components Regression Summary of Multicoilinearity Considerations Summary 430 Chapter 11: Missing Data Basic Issues in Handling Missing Data Minimize Missing Data Types of Missing Data Traditional Approaches to Missing Data 433

12 CONTENTS xvii 11.2 Missing Data in Nominal Scales Coding Nominal Scale X for Missing Data Missing Data on Two Dichotomies Estimation Using the EM Algorithm Missing Data in Quantitative Scales Available Alternatives Imputation of Values for Missing Cases Modeling Solutions to Missing Data in Scaled Variables An Illustrative Comparison of Alternative Methods RulesofThumb Summary 450 Chapter 12: Multiple Regression/Correlation and Causal Models Introduction Limits on the Current Discussion and the Relationship Between Causal Analysis and Analysis of Covariance Theories and Multiple Regression/Correlation Models That Estimate and Test Them Kinds of Variables in Causal Models Regression Models as Causal Models Models Without Reciprocal Causation Direct and Indirect Effects Path Analysis and Path Coefficients Hierarchical Analysis and Reduced Form Equations Partial Causal Models and the Hierarchical Analysis of Sets Testing Model Elements Models With Reciprocal Causation Identification and Overidentification Just Identified Models Overidentification Underidentification Latent Variable Models An Example of a Latent Variable Model How Latent Variables Are Estimated Fixed and Free Estimates in Latent Variable Models Goodness-of-Fit Tests of Latent Variable Models Latent Variable Models and the Correction for Attenuation Characteristics of Data Sets That Make Latent Variable Analysis the Method of Choice A Review of Causal Model and Statistical Assumptions 475

13 XVÜi CONTENTS Specification Error Identification Error Comparisons of Causal Models Nested Models Longitudinal Data in Causal Models Summary 477 Chapter 13: Alternative Regression Models: Logistic, Poisson Regression, and the Generalized Linear Model Ordinary Least Squares Regression Revisited Three Characteristics of Ordinary Least Squares Regression The Generalized Linear Model Relationship of Dichotomous and Count Dependent Variables Y to a Predictor Dichotomous Outcomes and Logistic Regression Extending Linear Regression: The Linear Probability Model and Discriminant Analysis The Nonlinear Transformation From Predictor to Predicted Scores: Probit and Logistic Transformation The Logistic Regression Equation Numerical Example: Three Forms of the Logistic Regression Equation Understanding the Coefficients for the Predictor in Logistic Regression Multiple Logistic Regression Numerical Example Confidence Intervals on Regression Coefficients and Odds Ratios Estimation of the Regression Model: Maximum Likelihood Deviances: Indices of Overall Fit of the Logistic Regression Model Multiple R 2 Analogs in Logistic Regression Testing Significance of Overall Model Fit: The Likelihood Ratio Test and the Test of Model Deviance x 2 Test for the Significance of a Single Predictor in a Multiple Logistic Regression Equation Hierarchical Logistic Regression: Likelihood Ratio x 2 Test for the Significance of a Set of Predictors Above and Beyond Another Set Akaike's Information Criterion and the Bayesian Information Criterion for Model Comparison Some Treachery in Variable Scaling and Interpretation of the Odds Ratio 509

14 CONTENTS xix Regression Diagnostics in Logistic Regression Sparseness of Data Classification of Cases Extensions of Logistic Regression to Multiple Response Categories: Polytomous Logistic Regression and Ordinal Logistic Regression Polytomous Logistic Regression Nested Dichotomies Ordinal Logistic Regression Models for Count Data: Poisson Regression and Alternatives Linear Regression Applied to Count Data Poisson Probability Distribution Poisson Regression Analysis Overdispersion and Alternative Models Independence of Observations Sources on Poisson Regression Füll Circle: Parallels Between Logistic and Poisson Regression, and the Generalized Linear Model Parallels Between Poisson and Logistic Regression The Generalized Linear Model Revisited Summary 535 Chapter 14: Random Coefficient Regression and Multilevel Models Clustering Within Data Sets Clustering, Alpha Inflation, and the Intraclass Correlation Estimating the Intraclass Correlation Analysis of Clustered Data With Ordinary Least Squares Approaches Numerical Example, Analysis of Clustered Data With Ordinary Least Squares Regression The Random Coefficient Regression Model Random Coefficient Regression Model and Multilevel Data Structure Ordinary Least Squares (Fixed Effects) Regression Revisited Fixed and Random Variables Clustering and Hierarchically Structured Data Structure of the Random Coefficient Regression Model Level 1 Equations Level 2 Equations Mixed Model Equation for Random Coefficient Regression Variance Components New Parameters in the Multilevel Model Variance Components and Random Coefficient Versus Ordinary Least Squares (Fixed Effects) Regression 549

15 XX CONTENTS Parameters of the Random Coefficient Regression Model: Fixed and Random Effects Numerical Example: Analysis of Clustered Data With Random Coefficient Regression Unconditional Cell Means Model and the Intraclass Correlation Testing the Fixed and Random Parts of the Random Coefficient Regression Model Clustering as a Meaningful Aspect of the Data Multilevel Modeling With a Predictor at Level Level 1 Equations Revised Level 2 Equations Mixed Model Equation With Level 1 Predictor and Level 2 Predictor of Intercept and Slope and the Cross-Level Interaction An Experimental Design as a Multilevel Data Structure: Combining Experimental Manipulation With Individual Differences Numerical Example: Multilevel Analysis Estimation of the Multilevel Model Parameters: Fixed Effects, Variance Components, and Level 1 Equations Fixed Effects and Variance Components An Equation for Each G/oup: Empirical Bayes Estimates of Level 1 Coefficients Statistical Tests in Multilevel Models Fixed Effects Variance Components Some Model Specification Issues The Same Variable at Two Levels Centering in Multilevel Models Statistical Power of Multilevel Models Choosing Between the Fixed Effects Model and the Random Coefficient Model Sources on Multilevel Modeling Multilevel Models Applied to Repeated Measures Data Summary 567 Chapter 15: Longitudinal Regression Methods 15.1 Introduction Chapter Goals Purposes of Gathering Data on Multiple Occasions Analyses of Two-Time-Point Data Change or Regressed Change? Alternative Regression Models for Effects Over a Single Unit of Time Three- or Four-Time-Point Data Repeated Measure Analysis of Variance 573

16 Multiple Error Terms in Repeated Measure Analysis of Variance Trend Analysis in Analysis of Variance Repeated Measure Analysis of Variance in Which Time Is Not the Issue Multilevel Regression of Individual Changes Over Time Patterns of Individual Change Over Time Adding Other Fixed Predictors to the Model Individual Differences in Variation Around Individual Slopes Alternative Developmental Models and Error Structures Alternative Link Functions for Predicting Y From Time Unbalanced Data: Variable Timing and Missing Data Latent Growth Models: Structural Equation Model Representation of Multilevel Data Estimation of Changes in True Scores Representation of Latent Growth Models in Structural Equation Model Diagrams Comparison of Multilevel Regression and Structural Equation Model Analysis of Change Time Varying Independent Variables Survival Analysis Regression Analysis of Time Until Outcome and the Problem of Censoring Extension to Time-Varying Independent Variables Extension to Multiple Episode Data Extension to a Categorical Outcome: Event-History Analysis Time Series Analysis 600 / Units of Observation in Time Series Analyses Time Series Analyses Applications Time Effects in Time Series Extension of Time Series Analyses to Multiple Units or Subjects Dynamic System Analysis Statistical Inference and Power Analysis in Longitudinal Analyses Summary 605 CONTENTS Chapter 16: Multiple Dependent Variables: Set Correlation Introduction to Ordinary Least Squares Treatment of Multiple Dependent Variables Set Correlation Analysis 608

17 XXII CONTENTS Canonical Analysis Elements of Set Correlation Measures of Multivariate Association R Y,x> me Proportion of Generalized Variance Ty x and P Y,x> Proportions of Additive Variance Partialing in Set Correlation Frequent Reasons for Partialing Variable Sets From the Basic Sets The Five Types of Association Between Basic Y and X Sets Tests of Statistical Significance and Statistical Power Testing the Null Hypothesis Estimators of the Population R YJC, T\j, and P 2 YJi Guarding Against Type I Error Inflation Statistical Power Analysis in Set Correlation Comparison of Set Correlation With Multiple Analysis of Variance New Analytic Possibilities With Set Correlation Illustrative Examples A Simple Whole Association A Multivariate Analysis of Partial Variance A Hierarchical Analysis of a Quantitative Set and Its Unique Components Bipartial Association Among Three Sets Summary 627 APPENDICES Appendix 1: The Mathematical Basis for Multiple Regression/Correlation and Identification of the Inverse Matrix Elements 631 ALI Alternative Matrix Methods 634 AI.2 Determinants 634 Appendix 2: Determination of the Inverse Matrix and Applications Thereof 636 A2.1 Hand Calculation of the Multiple Regression/Correlation Problem 636 A2.2 Testing the Difference Between Partial ßs and Bs From the Same Sample 640 A2.3 Testing the Difference Between ßs for Different Dependent Variables From a Single Sample 642

18 CONTENTS XXÜi Appendix Tables 643 Table A t Values for a =.01,.05 (Two Tailed) 643 Table B z! Transformation of r 644 Table C Normal Distribution 645 Table D F Values for a =.01, Table E L Values for a =.01, Table F Power of Significance Test of r at a =.01,.05 (Two Tailed) 652 Table G n* to Detect r by t Test at a =.01,.05 (Two Tailed) 654 References Glossary Statistical Symbols and Abbreviations Author Index Subject Index i

Applied Regression Analysis and Other Multivariable Methods

Applied Regression Analysis and Other Multivariable Methods THIRD EDITION Applied Regression Analysis and Other Multivariable Methods David G. Kleinbaum Emory University Lawrence L. Kupper University of North Carolina, Chapel Hill Keith E. Muller University of

More information

How To Understand Multivariate Models

How To Understand Multivariate Models Neil H. Timm Applied Multivariate Analysis With 42 Figures Springer Contents Preface Acknowledgments List of Tables List of Figures vii ix xix xxiii 1 Introduction 1 1.1 Overview 1 1.2 Multivariate Models

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Multivariate Statistical Inference and Applications

Multivariate Statistical Inference and Applications Multivariate Statistical Inference and Applications ALVIN C. RENCHER Department of Statistics Brigham Young University A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim

More information

Elements of statistics (MATH0487-1)

Elements of statistics (MATH0487-1) Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -

More information

Regression Modeling Strategies

Regression Modeling Strategies Frank E. Harrell, Jr. Regression Modeling Strategies With Applications to Linear Models, Logistic Regression, and Survival Analysis With 141 Figures Springer Contents Preface Typographical Conventions

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Univariate and Multivariate Methods PEARSON. Addison Wesley

Univariate and Multivariate Methods PEARSON. Addison Wesley Time Series Analysis Univariate and Multivariate Methods SECOND EDITION William W. S. Wei Department of Statistics The Fox School of Business and Management Temple University PEARSON Addison Wesley Boston

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm

Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Correlational Research

Correlational Research Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.

More information

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection

Adequacy of Biomath. Models. Empirical Modeling Tools. Bayesian Modeling. Model Uncertainty / Selection Directions in Statistical Methodology for Multivariable Predictive Modeling Frank E Harrell Jr University of Virginia Seattle WA 19May98 Overview of Modeling Process Model selection Regression shape Diagnostics

More information

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS

CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Examples: Regression And Path Analysis CHAPTER 3 EXAMPLES: REGRESSION AND PATH ANALYSIS Regression analysis with univariate or multivariate dependent variables is a standard procedure for modeling relationships

More information

The Basic Two-Level Regression Model

The Basic Two-Level Regression Model 2 The Basic Two-Level Regression Model The multilevel regression model has become known in the research literature under a variety of names, such as random coefficient model (de Leeuw & Kreft, 1986; Longford,

More information

(and sex and drugs and rock 'n' roll) ANDY FIELD

(and sex and drugs and rock 'n' roll) ANDY FIELD DISCOVERING USING SPSS STATISTICS THIRD EDITION (and sex and drugs and rock 'n' roll) ANDY FIELD CONTENTS Preface How to use this book Acknowledgements Dedication Symbols used in this book Some maths revision

More information

11. Analysis of Case-control Studies Logistic Regression

11. Analysis of Case-control Studies Logistic Regression Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2015 Examinations Aim The aim of the Probability and Mathematical Statistics subject is to provide a grounding in

More information

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)

Overview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and

More information

Multivariate Analysis. Overview

Multivariate Analysis. Overview Multivariate Analysis Overview Introduction Multivariate thinking Body of thought processes that illuminate the interrelatedness between and within sets of variables. The essence of multivariate thinking

More information

APPLIED MISSING DATA ANALYSIS

APPLIED MISSING DATA ANALYSIS APPLIED MISSING DATA ANALYSIS Craig K. Enders Series Editor's Note by Todd D. little THE GUILFORD PRESS New York London Contents 1 An Introduction to Missing Data 1 1.1 Introduction 1 1.2 Chapter Overview

More information

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-

More information

College Readiness LINKING STUDY

College Readiness LINKING STUDY College Readiness LINKING STUDY A Study of the Alignment of the RIT Scales of NWEA s MAP Assessments with the College Readiness Benchmarks of EXPLORE, PLAN, and ACT December 2011 (updated January 17, 2012)

More information

How To Understand The Theory Of Probability

How To Understand The Theory Of Probability Graduate Programs in Statistics Course Titles STAT 100 CALCULUS AND MATR IX ALGEBRA FOR STATISTICS. Differential and integral calculus; infinite series; matrix algebra STAT 195 INTRODUCTION TO MATHEMATICAL

More information

Statistical Rules of Thumb

Statistical Rules of Thumb Statistical Rules of Thumb Second Edition Gerald van Belle University of Washington Department of Biostatistics and Department of Environmental and Occupational Health Sciences Seattle, WA WILEY AJOHN

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1) CORRELATION AND REGRESSION / 47 CHAPTER EIGHT CORRELATION AND REGRESSION Correlation and regression are statistical methods that are commonly used in the medical literature to compare two or more variables.

More information

SAS Software to Fit the Generalized Linear Model

SAS Software to Fit the Generalized Linear Model SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling

More information

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA

CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA We Can Early Learning Curriculum PreK Grades 8 12 INSIDE ALGEBRA, GRADES 8 12 CORRELATED TO THE SOUTH CAROLINA COLLEGE AND CAREER-READY FOUNDATIONS IN ALGEBRA April 2016 www.voyagersopris.com Mathematical

More information

Organizing Your Approach to a Data Analysis

Organizing Your Approach to a Data Analysis Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING

More information

MATHEMATICAL METHODS OF STATISTICS

MATHEMATICAL METHODS OF STATISTICS MATHEMATICAL METHODS OF STATISTICS By HARALD CRAMER TROFESSOK IN THE UNIVERSITY OF STOCKHOLM Princeton PRINCETON UNIVERSITY PRESS 1946 TABLE OF CONTENTS. First Part. MATHEMATICAL INTRODUCTION. CHAPTERS

More information

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

HLM software has been one of the leading statistical packages for hierarchical

HLM software has been one of the leading statistical packages for hierarchical Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush

More information

Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@

Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Developing Risk Adjustment Techniques Using the SAS@ System for Assessing Health Care Quality in the lmsystem@ Yanchun Xu, Andrius Kubilius Joint Commission on Accreditation of Healthcare Organizations,

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

More information

Multiple Regression: What Is It?

Multiple Regression: What Is It? Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in

More information

Big Ideas in Mathematics

Big Ideas in Mathematics Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards

More information

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19 PREFACE xi 1 INTRODUCTION 1 1.1 Overview 1 1.2 Definition 1 1.3 Preparation 2 1.3.1 Overview 2 1.3.2 Accessing Tabular Data 3 1.3.3 Accessing Unstructured Data 3 1.3.4 Understanding the Variables and Observations

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel

More information

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University

Practical. I conometrics. data collection, analysis, and application. Christiana E. Hilmer. Michael J. Hilmer San Diego State University Practical I conometrics data collection, analysis, and application Christiana E. Hilmer Michael J. Hilmer San Diego State University Mi Table of Contents PART ONE THE BASICS 1 Chapter 1 An Introduction

More information

Computer-Aided Multivariate Analysis

Computer-Aided Multivariate Analysis Computer-Aided Multivariate Analysis FOURTH EDITION Abdelmonem Af if i Virginia A. Clark and Susanne May CHAPMAN & HALL/CRC A CRC Press Company Boca Raton London New York Washington, D.C Contents Preface

More information

BookTOC.txt. 1. Functions, Graphs, and Models. Algebra Toolbox. Sets. The Real Numbers. Inequalities and Intervals on the Real Number Line

BookTOC.txt. 1. Functions, Graphs, and Models. Algebra Toolbox. Sets. The Real Numbers. Inequalities and Intervals on the Real Number Line College Algebra in Context with Applications for the Managerial, Life, and Social Sciences, 3rd Edition Ronald J. Harshbarger, University of South Carolina - Beaufort Lisa S. Yocco, Georgia Southern University

More information

Moderation. Moderation

Moderation. Moderation Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

More information

Moderator and Mediator Analysis

Moderator and Mediator Analysis Moderator and Mediator Analysis Seminar General Statistics Marijtje van Duijn October 8, Overview What is moderation and mediation? What is their relation to statistical concepts? Example(s) October 8,

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA

CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Examples: Mixture Modeling With Longitudinal Data CHAPTER 8 EXAMPLES: MIXTURE MODELING WITH LONGITUDINAL DATA Mixture modeling refers to modeling with categorical latent variables that represent subpopulations

More information

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing! MATH BOOK OF PROBLEMS SERIES New from Pearson Custom Publishing! The Math Book of Problems Series is a database of math problems for the following courses: Pre-algebra Algebra Pre-calculus Calculus Statistics

More information

SPSS Explore procedure

SPSS Explore procedure SPSS Explore procedure One useful function in SPSS is the Explore procedure, which will produce histograms, boxplots, stem-and-leaf plots and extensive descriptive statistics. To run the Explore procedure,

More information

South Carolina College- and Career-Ready (SCCCR) Probability and Statistics

South Carolina College- and Career-Ready (SCCCR) Probability and Statistics South Carolina College- and Career-Ready (SCCCR) Probability and Statistics South Carolina College- and Career-Ready Mathematical Process Standards The South Carolina College- and Career-Ready (SCCCR)

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Supplementary PROCESS Documentation

Supplementary PROCESS Documentation Supplementary PROCESS Documentation This document is an addendum to Appendix A of Introduction to Mediation, Moderation, and Conditional Process Analysis that describes options and output added to PROCESS

More information

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program

Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department/Academic Unit: Public Health Sciences Degree Program: Biostatistics Collaborative Program Department of Mathematics and Statistics Degree Level Expectations, Learning Outcomes, Indicators of

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:

More information

QUANTITATIVE METHODS. for Decision Makers. Mik Wisniewski. Fifth Edition. FT Prentice Hall

QUANTITATIVE METHODS. for Decision Makers. Mik Wisniewski. Fifth Edition. FT Prentice Hall Fifth Edition QUANTITATIVE METHODS for Decision Makers Mik Wisniewski Senior Research Fellow, Department of Management Science, University of Strathclyde Business School FT Prentice Hall FINANCIAL TIMES

More information

Contrast Coding in Multiple Regression Analysis: Strengths, Weaknesses, and Utility of Popular Coding Structures

Contrast Coding in Multiple Regression Analysis: Strengths, Weaknesses, and Utility of Popular Coding Structures Journal of Data Science 8(2010), 61-73 Contrast Coding in Multiple Regression Analysis: Strengths, Weaknesses, and Utility of Popular Coding Structures Matthew J. Davis Texas A&M University Abstract: The

More information

Overview... 2. Accounting for Business (MCD1010)... 3. Introductory Mathematics for Business (MCD1550)... 4. Introductory Economics (MCD1690)...

Overview... 2. Accounting for Business (MCD1010)... 3. Introductory Mathematics for Business (MCD1550)... 4. Introductory Economics (MCD1690)... Unit Guide Diploma of Business Contents Overview... 2 Accounting for Business (MCD1010)... 3 Introductory Mathematics for Business (MCD1550)... 4 Introductory Economics (MCD1690)... 5 Introduction to Management

More information

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest

Analyzing Intervention Effects: Multilevel & Other Approaches. Simplest Intervention Design. Better Design: Have Pretest Analyzing Intervention Effects: Multilevel & Other Approaches Joop Hox Methodology & Statistics, Utrecht Simplest Intervention Design R X Y E Random assignment Experimental + Control group Analysis: t

More information

When to Use a Particular Statistical Test

When to Use a Particular Statistical Test When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the

More information

Introduction to Longitudinal Data Analysis

Introduction to Longitudinal Data Analysis Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction

More information

096 Professional Readiness Examination (Mathematics)

096 Professional Readiness Examination (Mathematics) 096 Professional Readiness Examination (Mathematics) Effective after October 1, 2013 MI-SG-FLD096M-02 TABLE OF CONTENTS PART 1: General Information About the MTTC Program and Test Preparation OVERVIEW

More information

Section 3 Part 1. Relationships between two numerical variables

Section 3 Part 1. Relationships between two numerical variables Section 3 Part 1 Relationships between two numerical variables 1 Relationship between two variables The summary statistics covered in the previous lessons are appropriate for describing a single variable.

More information

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics

MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS. Biostatistics MEU. INSTITUTE OF HEALTH SCIENCES COURSE SYLLABUS title- course code: Program name: Contingency Tables and Log Linear Models Level Biostatistics Hours/week Ther. Recite. Lab. Others Total Master of Sci.

More information

DISCRIMINANT FUNCTION ANALYSIS (DA)

DISCRIMINANT FUNCTION ANALYSIS (DA) DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant

More information

Curriculum - Doctor of Philosophy

Curriculum - Doctor of Philosophy Curriculum - Doctor of Philosophy CORE COURSES Pharm 545-546.Pharmacoeconomics, Healthcare Systems Review. (3, 3) Exploration of the cultural foundations of pharmacy. Development of the present state of

More information

Multivariate Logistic Regression

Multivariate Logistic Regression 1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that depends on p covariates or independent variables. Then, using an inv.logit formulation

More information

Overview of Factor Analysis

Overview of Factor Analysis Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

More information

GRADES 7, 8, AND 9 BIG IDEAS

GRADES 7, 8, AND 9 BIG IDEAS Table 1: Strand A: BIG IDEAS: MATH: NUMBER Introduce perfect squares, square roots, and all applications Introduce rational numbers (positive and negative) Introduce the meaning of negative exponents for

More information

The Statistics Tutor s Quick Guide to

The Statistics Tutor s Quick Guide to statstutor community project encouraging academics to share statistics support resources All stcp resources are released under a Creative Commons licence The Statistics Tutor s Quick Guide to Stcp-marshallowen-7

More information

Interpretation of Somers D under four simple models

Interpretation of Somers D under four simple models Interpretation of Somers D under four simple models Roger B. Newson 03 September, 04 Introduction Somers D is an ordinal measure of association introduced by Somers (96)[9]. It can be defined in terms

More information

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity

More information

Introduction to Data Analysis in Hierarchical Linear Models

Introduction to Data Analysis in Hierarchical Linear Models Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM

More information

Poisson Models for Count Data

Poisson Models for Count Data Chapter 4 Poisson Models for Count Data In this chapter we study log-linear models for count data under the assumption of a Poisson error structure. These models have many applications, not only to the

More information

1 Theory: The General Linear Model

1 Theory: The General Linear Model QMIN GLM Theory - 1.1 1 Theory: The General Linear Model 1.1 Introduction Before digital computers, statistics textbooks spoke of three procedures regression, the analysis of variance (ANOVA), and the

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance

More information

Correlation and Regression Analysis: SPSS

Correlation and Regression Analysis: SPSS Correlation and Regression Analysis: SPSS Bivariate Analysis: Cyberloafing Predicted from Personality and Age These days many employees, during work hours, spend time on the Internet doing personal things,

More information

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents

Linda K. Muthén Bengt Muthén. Copyright 2008 Muthén & Muthén www.statmodel.com. Table Of Contents Mplus Short Courses Topic 2 Regression Analysis, Eploratory Factor Analysis, Confirmatory Factor Analysis, And Structural Equation Modeling For Categorical, Censored, And Count Outcomes Linda K. Muthén

More information

Teaching Multivariate Analysis to Business-Major Students

Teaching Multivariate Analysis to Business-Major Students Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis

More information

Logistic regression modeling the probability of success

Logistic regression modeling the probability of success Logistic regression modeling the probability of success Regression models are usually thought of as only being appropriate for target variables that are continuous Is there any situation where we might

More information

Data analysis process

Data analysis process Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression

Data Mining and Data Warehousing. Henryk Maciejewski. Data Mining Predictive modelling: regression Data Mining and Data Warehousing Henryk Maciejewski Data Mining Predictive modelling: regression Algorithms for Predictive Modelling Contents Regression Classification Auxiliary topics: Estimation of prediction

More information

TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2

TABLE OF CONTENTS. About Chi Squares... 1. What is a CHI SQUARE?... 1. Chi Squares... 1. Hypothesis Testing with Chi Squares... 2 About Chi Squares TABLE OF CONTENTS About Chi Squares... 1 What is a CHI SQUARE?... 1 Chi Squares... 1 Goodness of fit test (One-way χ 2 )... 1 Test of Independence (Two-way χ 2 )... 2 Hypothesis Testing

More information

Module 4 - Multiple Logistic Regression

Module 4 - Multiple Logistic Regression Module 4 - Multiple Logistic Regression Objectives Understand the principles and theory underlying logistic regression Understand proportions, probabilities, odds, odds ratios, logits and exponents Be

More information

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS

MATH. ALGEBRA I HONORS 9 th Grade 12003200 ALGEBRA I HONORS * Students who scored a Level 3 or above on the Florida Assessment Test Math Florida Standards (FSA-MAFS) are strongly encouraged to make Advanced Placement and/or dual enrollment courses their first choices

More information

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus

Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Auxiliary Variables in Mixture Modeling: 3-Step Approaches Using Mplus Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 15 Version 8, August 5, 2014 1 Abstract This paper discusses alternatives

More information

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship)

Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) 1 Calculating, Interpreting, and Reporting Estimates of Effect Size (Magnitude of an Effect or the Strength of a Relationship) I. Authors should report effect sizes in the manuscript and tables when reporting

More information

Module 3: Correlation and Covariance

Module 3: Correlation and Covariance Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis

More information