Multivariate Analysis. Overview
|
|
- Rhoda Smith
- 8 years ago
- Views:
Transcription
1 Multivariate Analysis Overview
2 Introduction Multivariate thinking Body of thought processes that illuminate the interrelatedness between and within sets of variables. The essence of multivariate thinking is to expose the inherent structure and meaning revealed within these sets of variables through application and interpretation of various statistical methods
3 Why the multivariate approach? Big idea- multiple response outcomes With univariate analyses we have just one dependent variable of interest Although any analysis of data involving more than one variable could be seen as multivariate, we typically reserve the term for multiple dependent variables So MV analysis is an extension of UV ones, or conversely, many of the UV analyses are special cases of MV ones
4 Why MV over the univariate approach? Complexity The subject/data studied may be more complex than what univariate methods can offer in terms of analysis Reality In some cases it would be inappropriate to conduct univariate analysis as the data/research demand a multivariate analysis
5 Why MV over the univariate approach? Experimental data Although experimental research can be and often is multivariate, typically subjects are assigned to groups and the manipulations regard corresponding changes to a single outcome Different doses of caffeine test performance Causality is more easily deduced Non-experimental data Likewise survey/inventory data might be analyzed in univariate fashion, but typically it will require the multivariate approach to solve the questions stemming from it Correlational
6 Why not MV? In the past the computations were overwhelming even with smaller datasets, and so MV analyses were typically avoided Now this is not a problem but there are still reasons to not do a MV analysis
7 Why not MV? Ambiguity MV analysis may result in a less clear understanding of the data E.g. group differences on a linear combination of DVs (Manova) Differences are easily interpreted in a univariate sense Ambiguity because of ignorance of the technique is not a valid reason however Unnecessary complexity Just because SEM looks neat/is popular doesn t mean you have to do one, or that it is the best way to answer your research question No free lunch MV analyses come with their own rules and assumptions that may make analysis difficult or not as strong
8 Multivariate Pros and Cons Summary Advantages of using a multivariate statistic Richer realistic design Looks at phenomena in an overarching way (provides multiple levels of analysis) Each method differs in amount or type of Independent Variables (IVs) and DVs Can help control for Type I Error Disadvantages Larger Ns are often required More difficult to interpret Less known about the robustness of assumptions
9 Primary purposes of MV analysis Prediction and explanation Determining structure
10 Prediction The goal in most research situations is to be able to predict outcomes based on prior information E.g. given a person s gender and region, what will their attitude be on some social issue? Given a number of variables how well can we predict group membership? Explanation Which variables are most important in the prediction of some outcome? In many cases this is end goal of an analysis, though a very problematic one
11 A caveat regarding explanation Determining variable importance can be a suspect endeavor Something that might be deemed a statistically significant variable may not make the cut had the study been conducted again Depending on a number of factors, results may be sample specific i.e. you may not see the same ordering next time
12 Structure A different goal in MV analysis is to determine the structure of the data Is there an underlying dimension that can describe the data in a simpler fashion? Methods involve classification and/or data reduction Latent variables (constructs) Example: Observed variables Giddiness, Silliness, Irrationality, Possessiveness and Misunderstanding reduced to the underlying construct of Love Interest may be in reducing variables (Factor analysis), emphasis on group membership (Cluster analysis), stimulus structure (MDS) etc.
13 Prediction and Structure Both prediction and structure may be the goal of analysis SEM and path analysis How well does the model fit the data?
14 Multivariate Themes Multiple Theories and Hypotheses Multiple Empirical Studies Multiple Measures Multiplicity Theme Multiple considerations at all levels of focus, with greater multiplicity generally leading to greater reliability, validity, and generalization: Multiple Time Points Multiple Controls Multiple Samples Practical Implications Multiple Statistical Methods
15 Multivariate Themes Variance Systematic Random Central Themes All multivariate methods focus on these central themes: Covariance Ratio of Variances Linear Combinations (e.g., Components, Factors) Interpretation Themes Big picture and Specific levels Macro-Assessment (e.g., Significance Test & Effect Size) Micro-Assessment (e.g., Examining Means or Weights)
16 Things to consider Initial variable choice Comes down to: Familiarity with previous research Instrument used Expertise with field of study Common sense Much of the hard work consists of developing a plan of attack and deciding on how to study the problem
17 Initial Examination of Data Preliminary analysis A thorough initial examination of the data is not only required but also necessary for a full understanding of any research Such initial analyses provide a better grasp of what is happening in the data and may inform the MV analysis to a certain extent However, in the MV case, if the actual goal is interpretation of the UV analyses (as one often sees in MANOVA), the MV analysis is unwarranted
18 More to consider Intro now, more details as we discuss each method Assumptions important for inferences beyond the sample Normality: Basic assumption of General Linear Model; concerned with an elliptical pattern of residuals for the data Skewness: Distribution of scores is tilted (asymmetrical) Direction established by tail greater skewness = less normality Kurtosis: Degree of peakedness of data 3 Types: leptokurtic (thin); mesokurtic (normal); platykurtic (flattened)
19 More to consider Linearity Data forms a relatively straight oval line when plotted Homoscedasticity variance of 1 variable is equal at all levels of other variables understood through standard deviations across variables and scatter plots Referred to as homogeneity of variance in ANOVA methods Homogeneity of regression Regression slopes between covariate and DV are equal across groups of IV Do not want this statistic (F) to be significantly different if so, violation of assumption for (M)ANCOVA
20 More to consider Multicollinearity Correlation coefficient (r) between predictors is noticeably large Causes instability in the statistical procedure Can t differentiate which variables are contributing to outcome Singularity Redundant variables brings discriminant in equation to zero Orthogonality Allows no association among variables Not realistic in real world data May allow greater interpretability versus data that are too related
21 More to consider Outliers Effect mean (inflate/deflate) disguising true relationship Distort data create noise (error) lose power Transformations (log or square root) may be helpful with outliers Reshapes distribution creating a more normal distribution However you now have a scale with which you are unfamiliar and which you cannot generalize back to the original
22 Some distinctions Types of data Nominal/Categorical Ordinal Continuous Interval or Ratio The types of variables involved will say much about what analyses are going to be appropriate and/or how one might proceed with a particular analysis
23 Types of data One thing to keep in mind is that these distinctions are largely arbitrary One can dichotomize a continuous measure into categories A bad idea most of the time An ordinal measure (e.g. likert question) has a mean/construct that actually falls along a continuum How the data is to be considered is largely left to the researcher
24 Sample vs. Population In typical research we are rarely dealing with a population The goal in research is not to simply describe our data but to generalize to the real world from which the sample is taken This is the purpose of conducting inferential analyses which require certain assumptions to be met in order to be utilized Many analyses and data collection are for a variety of reasons (not good) sample-specific, and not much use to the scientific community Take care in the initial phase of research planning to help guard against such a situation
25 The linear combination of variables Whether of IVs or DVs, a linear combination of variables is often necessary to interpret the data This idea is essential to thinking multivariately MultReg Finding the linear combination of IVs that best predicts the DV Manova What linear combination of DVs maximizes the distinction between groups
26 How many variables Considerations Cost Availability Meaningfulness Theory For ease of understanding and efficiency we typically want the fewest number of variables that will explain the most Ockham s razor
27 Statistical power and effect size A problem that has plagued the social sciences is the lack of power to find subtle effects Some multivariate procedures will require relatively large amounts of data (e.g. SEM) Power and sample size are a required consideration before any attempt at research, multivariate or otherwise, though typically sample size will be determined by the practicalities and limitations of the research After the fact, emphasis should be placed on effect size and model fit, rather than p-values More later
28 The matrices of interest Data matrix What you see in SPSS or whatever program you re using Includes the cases and their corresponding values for the variables of interest Correlation matrix- R Contains information about the linear relationship between variables Standardized covariance Symmetrical Square cov r = xy ss x y Typically only the bottom portion is shown as the top portion is its mirror image and the diagonal contains all ones (each variable is perfectly correlated with itself)
29 The matrices of interest Variance/Covariance matrix - Σ Square and symmetrical Variance of each variable is on the diagonal, covariances with other variables on the offdiagonals In some cases you will have the option to use correlations or covariances as the unit of analysis, with some debate about which is better under what circumstances
30 The matrices of interest Sum of Squares and cross-products matrix - S Precursor to the Variance/Covariance matrix (the values before division by N-1) On the diagonal is a variable s sum of the squared deviations from its mean Off-diagonal elements are the sum of the products of the deviation scores for the two variables
31 Methods of analysis A host of methods are available to the researcher The kind of question asked will help guide one in choosing the appropriate analysis, however the data may be available to multiple methods, and almost always is
32 Degree of relationship Bivariate r The degree of linear relationship between two variables Partial and semi-partial Multiple R The relationship of a set of variables to another (dependent) variable Canonical R The grandaddy Relationship between sets of variables Methods are also available to assess the relationship among non-continuous variables E.g. Chi-square, Multiway Frequency Analysis
33 Group Differences Very popular research question in social sciences (too popular really) Is group A different from B? The answer is always yes, and with a large enough sample, statistically significantly so Anova and related Manova the multivariate counterpart Repeated measures
34 Predicting group membership Turning the group difference question the other way around Discriminant function analysis Logistic regression
35 Structure Data reduction and classification Cluster analysis Seeks to identify homogeneous subgroups of cases or variables based on some measure of distance Identify a set of groups in which within-group variation is minimized and between-group variation is maximized Principal components and Factor analysis Reduce a large number of variables to smaller Often used in psych for the development of inventories Structural equation modeling Where factor analysis and regression meet
36 Time course of events How long is it before some event occurs? How does a DV change over the course of time? The former question can be answered with survival/failure analysis Survival rates for disease Time before failure for a particular electronic part The latter is often examined with time-series analysis Many time periods are available for analysis E.g. monthly stock prices over the past five years Popular in the economics realm
37 Decision tree
38 Decision tree
39 Decision tree Although such guides may be useful, as mentioned before, multiple analyses may be appropriate for the data under consideration The best plan of attack is to have a well-defined research question, and collect data appropriate to the analysis that will best answer that question
40 Multivariate Methods: Quick Glance Organizational Chart based on: Type of Research Focus (Group differences or Correlational). Research Question IVs: Number and Scale # & Scale Method Research Focus IVs DVs Multivariate Number & Scale Number & Scale Method Group Differences 1+ categorical & continuous 1 continuous ANCOVA 1+ categorical 2+ continuous MANOVA 2+ continuous 1+ categorical DFA 1+categ or cont 1 categorical LR Correlational 2+ continuous 1 continuous MR 2+ continuous 2+ continuous CC - 2+ continuous PCA & FA Note: Scale and number of Independent (IV) and Dependent (DV) categorical or continuous variables. + indicates 1 or more; ANCOVA = Analysis of Covariance; MANOVA = Multivariate Analysis of Variance; DFA = Discriminant Function Analysis; LR=Logistic Regression; MR = Multiple Regression; CC = Canonical Correlation; PCA/FA = Principal Components/Factor Analysis
41 Summary of Methods The multivariate methods we will look at are a set of tools for analyzing multiple variables in an integrated and powerful way. They allow the examination of richer and perhaps more realistic designs than can be assessed with traditional univariate methods that only analyze one outcome variable and usually just one or two independent variables (IVs) Compared to univariate methods, multivariate methods allow us to analyze a complex array of variables, providing greater assurance that we can come to some synthesizing conclusions with less error and more validity than if we were to analyze variables in isolation.
DISCRIMINANT FUNCTION ANALYSIS (DA)
DISCRIMINANT FUNCTION ANALYSIS (DA) John Poulsen and Aaron French Key words: assumptions, further reading, computations, standardized coefficents, structure matrix, tests of signficance Introduction Discriminant
More informationWhen to Use a Particular Statistical Test
When to Use a Particular Statistical Test Central Tendency Univariate Descriptive Mode the most commonly occurring value 6 people with ages 21, 22, 21, 23, 19, 21 - mode = 21 Median the center value the
More informationMultivariate Analysis of Variance (MANOVA)
Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction
More informationBusiness Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.
Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing
More informationCourse Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics
Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This
More informationModule 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
More informationHow To Understand Multivariate Models
Neil H. Timm Applied Multivariate Analysis With 42 Figures Springer Contents Preface Acknowledgments List of Tables List of Figures vii ix xix xxiii 1 Introduction 1 1.1 Overview 1 1.2 Multivariate Models
More informationData analysis process
Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis
More informationMultiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationProfile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:
Profile Analysis Introduction Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: ) Comparing the same dependent variables
More informationService courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.
Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are
More informationSPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011
SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011 Statistical techniques to be covered Explore relationships among variables Correlation Regression/Multiple regression Logistic regression Factor analysis
More informationIntroduction to Regression and Data Analysis
Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it
More informationIntroduction to Principal Components and FactorAnalysis
Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a
More informationMultivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More informationHow to Get More Value from Your Survey Data
Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2
More informationHLM software has been one of the leading statistical packages for hierarchical
Introductory Guide to HLM With HLM 7 Software 3 G. David Garson HLM software has been one of the leading statistical packages for hierarchical linear modeling due to the pioneering work of Stephen Raudenbush
More information10. Analysis of Longitudinal Studies Repeat-measures analysis
Research Methods II 99 10. Analysis of Longitudinal Studies Repeat-measures analysis This chapter builds on the concepts and methods described in Chapters 7 and 8 of Mother and Child Health: Research methods.
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationMultivariate analyses
14 Multivariate analyses Learning objectives By the end of this chapter you should be able to: Recognise when it is appropriate to use multivariate analyses (MANOVA) and which test to use (traditional
More informationUnivariate Regression
Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is
More informationSimple Predictive Analytics Curtis Seare
Using Excel to Solve Business Problems: Simple Predictive Analytics Curtis Seare Copyright: Vault Analytics July 2010 Contents Section I: Background Information Why use Predictive Analytics? How to use
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationMultivariate Statistical Inference and Applications
Multivariate Statistical Inference and Applications ALVIN C. RENCHER Department of Statistics Brigham Young University A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim
More informationHYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION
HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate
More informationWhy Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
More informationIntroduction to Longitudinal Data Analysis
Introduction to Longitudinal Data Analysis Longitudinal Data Analysis Workshop Section 1 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section 1: Introduction
More informationCase Study in Data Analysis Does a drug prevent cardiomegaly in heart failure?
Case Study in Data Analysis Does a drug prevent cardiomegaly in heart failure? Harvey Motulsky hmotulsky@graphpad.com This is the first case in what I expect will be a series of case studies. While I mention
More informationMultivariate Analysis of Variance (MANOVA)
Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various
More informationREVIEWING THREE DECADES WORTH OF STATISTICAL ADVANCEMENTS IN INDUSTRIAL-ORGANIZATIONAL PSYCHOLOGICAL RESEARCH
1 REVIEWING THREE DECADES WORTH OF STATISTICAL ADVANCEMENTS IN INDUSTRIAL-ORGANIZATIONAL PSYCHOLOGICAL RESEARCH Nicholas Wrobel Faculty Sponsor: Kanako Taku Department of Psychology, Oakland University
More informationII. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
More informationSilvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com
SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING
More informationCommon factor analysis
Common factor analysis This is what people generally mean when they say "factor analysis" This family of techniques uses an estimate of common variance among the original variables to generate the factor
More informationSTATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
More informationFactor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More informationSimple linear regression
Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between
More informationWeek 1. Exploratory Data Analysis
Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam
More informationJanuary 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
More informationIntroduction to Data Analysis in Hierarchical Linear Models
Introduction to Data Analysis in Hierarchical Linear Models April 20, 2007 Noah Shamosh & Frank Farach Social Sciences StatLab Yale University Scope & Prerequisites Strong applied emphasis Focus on HLM
More informationDimensionality Reduction: Principal Components Analysis
Dimensionality Reduction: Principal Components Analysis In data mining one often encounters situations where there are a large number of variables in the database. In such situations it is very likely
More informationMissing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13
Missing Data: Part 1 What to Do? Carol B. Thompson Johns Hopkins Biostatistics Center SON Brown Bag 3/20/13 Overview Missingness and impact on statistical analysis Missing data assumptions/mechanisms Conventional
More informationMathematics within the Psychology Curriculum
Mathematics within the Psychology Curriculum Statistical Theory and Data Handling Statistical theory and data handling as studied on the GCSE Mathematics syllabus You may have learnt about statistics and
More informationDescription. Textbook. Grading. Objective
EC151.02 Statistics for Business and Economics (MWF 8:00-8:50) Instructor: Chiu Yu Ko Office: 462D, 21 Campenalla Way Phone: 2-6093 Email: kocb@bc.edu Office Hours: by appointment Description This course
More informationStatistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
More informationAn analysis method for a quantitative outcome and two categorical explanatory variables.
Chapter 11 Two-Way ANOVA An analysis method for a quantitative outcome and two categorical explanatory variables. If an experiment has a quantitative outcome and two categorical explanatory variables that
More informationFairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
More informationPart 2: Analysis of Relationship Between Two Variables
Part 2: Analysis of Relationship Between Two Variables Linear Regression Linear correlation Significance Tests Multiple regression Linear Regression Y = a X + b Dependent Variable Independent Variable
More informationOrganizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
More informationCanonical Correlation Analysis
Canonical Correlation Analysis LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the similarities and differences between multiple regression, factor analysis,
More informationSection Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini
NEW YORK UNIVERSITY ROBERT F. WAGNER GRADUATE SCHOOL OF PUBLIC SERVICE Course Syllabus Spring 2016 Statistical Methods for Public, Nonprofit, and Health Management Section Format Day Begin End Building
More informationA Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution
A Primer on Mathematical Statistics and Univariate Distributions; The Normal Distribution; The GLM with the Normal Distribution PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 4: September
More informationDescriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
More informationBasic Concepts in Research and Data Analysis
Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the
More informationDirections for using SPSS
Directions for using SPSS Table of Contents Connecting and Working with Files 1. Accessing SPSS... 2 2. Transferring Files to N:\drive or your computer... 3 3. Importing Data from Another File Format...
More informationExploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016
and Principal Components Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 Agenda Brief History and Introductory Example Factor Model Factor Equation Estimation of Loadings
More informationModeration. Moderation
Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation
More informationIntroduction to Statistics and Quantitative Research Methods
Introduction to Statistics and Quantitative Research Methods Purpose of Presentation To aid in the understanding of basic statistics, including terminology, common terms, and common statistical methods.
More informationExploratory Data Analysis. Psychology 3256
Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find
More informationExample: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
More informationModule 5: Multiple Regression Analysis
Using Statistical Data Using to Make Statistical Decisions: Data Multiple to Make Regression Decisions Analysis Page 1 Module 5: Multiple Regression Analysis Tom Ilvento, University of Delaware, College
More informationElements of statistics (MATH0487-1)
Elements of statistics (MATH0487-1) Prof. Dr. Dr. K. Van Steen University of Liège, Belgium December 10, 2012 Introduction to Statistics Basic Probability Revisited Sampling Exploratory Data Analysis -
More information[This document contains corrections to a few typos that were found on the version available through the journal s web page]
Online supplement to Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67,
More informationDesign & Analysis of Ecological Data. Landscape of Statistical Methods...
Design & Analysis of Ecological Data Landscape of Statistical Methods: Part 3 Topics: 1. Multivariate statistics 2. Finding groups - cluster analysis 3. Testing/describing group differences 4. Unconstratined
More informationApplied Multiple Regression/Correlation Analysis for the Behavioral Sciences
Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences Third Edition Jacob Cohen (deceased) New York University Patricia Cohen New York State Psychiatric Institute and Columbia University
More informationOverview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
More informationDESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS
DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics
More informationbusiness statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar
business statistics using Excel Glyn Davis & Branko Pecar OXFORD UNIVERSITY PRESS Detailed contents Introduction to Microsoft Excel 2003 Overview Learning Objectives 1.1 Introduction to Microsoft Excel
More informationRegression Analysis: A Complete Example
Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty
More information(and sex and drugs and rock 'n' roll) ANDY FIELD
DISCOVERING USING SPSS STATISTICS THIRD EDITION (and sex and drugs and rock 'n' roll) ANDY FIELD CONTENTS Preface How to use this book Acknowledgements Dedication Symbols used in this book Some maths revision
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level of Significance
More informationAnalysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk
Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
More informationAnalysis of Data. Organizing Data Files in SPSS. Descriptive Statistics
Analysis of Data Claudia J. Stanny PSY 67 Research Design Organizing Data Files in SPSS All data for one subject entered on the same line Identification data Between-subjects manipulations: variable to
More information4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"
Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses
More informationUNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)
UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design
More informationDEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9
DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9 Analysis of covariance and multiple regression So far in this course,
More informationFactor Analysis. Chapter 420. Introduction
Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.
More informationChapter Eight: Quantitative Methods
Chapter Eight: Quantitative Methods RESEARCH DESIGN Qualitative, Quantitative, and Mixed Methods Approaches Third Edition John W. Creswell Chapter Outline Defining Surveys and Experiments Components of
More informationUsing Multivariate Statistics
/ K FIFTH EDITION 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. Using Multivariate Statistics Barbara G. Tabachnick
More informationStatistics Review PSY379
Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses
More informationRunning head: SCHOOL COMPUTER USE AND ACADEMIC PERFORMANCE. Using the U.S. PISA results to investigate the relationship between
Computer Use and Academic Performance- PISA 1 Running head: SCHOOL COMPUTER USE AND ACADEMIC PERFORMANCE Using the U.S. PISA results to investigate the relationship between school computer use and student
More informationImproving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
More informationTeaching Multivariate Analysis to Business-Major Students
Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis
More informationStatistics Graduate Courses
Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.
More informationDATA COLLECTION AND ANALYSIS
DATA COLLECTION AND ANALYSIS Quality Education for Minorities (QEM) Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. August 23, 2013 Objectives of the Discussion 2 Discuss
More informationAn analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression
Chapter 9 Simple Linear Regression An analysis appropriate for a quantitative outcome and a single quantitative explanatory variable. 9.1 The model behind linear regression When we are examining the relationship
More informationIntroduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
More informationAssumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model
Assumptions Assumptions of linear models Apply to response variable within each group if predictor categorical Apply to error terms from linear model check by analysing residuals Normality Homogeneity
More information11. Analysis of Case-control Studies Logistic Regression
Research methods II 113 11. Analysis of Case-control Studies Logistic Regression This chapter builds upon and further develops the concepts and strategies described in Ch.6 of Mother and Child Health:
More informationBill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1
Bill Burton Albert Einstein College of Medicine william.burton@einstein.yu.edu April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1 Calculate counts, means, and standard deviations Produce
More informationData Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
More informationFactors affecting online sales
Factors affecting online sales Table of contents Summary... 1 Research questions... 1 The dataset... 2 Descriptive statistics: The exploratory stage... 3 Confidence intervals... 4 Hypothesis tests... 4
More informationOverview Classes. 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7)
Overview Classes 12-3 Logistic regression (5) 19-3 Building and applying logistic regression (6) 26-3 Generalizations of logistic regression (7) 2-4 Loglinear models (8) 5-4 15-17 hrs; 5B02 Building and
More informationAppendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP. Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study.
Appendix G STATISTICAL METHODS INFECTIOUS METHODS STATISTICAL ROADMAP Prepared in Support of: CDC/NCEH Cross Sectional Assessment Study Prepared by: Centers for Disease Control and Prevention National
More informationAn introduction to Value-at-Risk Learning Curve September 2003
An introduction to Value-at-Risk Learning Curve September 2003 Value-at-Risk The introduction of Value-at-Risk (VaR) as an accepted methodology for quantifying market risk is part of the evolution of risk
More informationCurriculum - Doctor of Philosophy
Curriculum - Doctor of Philosophy CORE COURSES Pharm 545-546.Pharmacoeconomics, Healthcare Systems Review. (3, 3) Exploration of the cultural foundations of pharmacy. Development of the present state of
More information