Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003
|
|
|
- Loren Hodge
- 10 years ago
- Views:
Transcription
1 Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003 FA is not worth the time necessary to understand it and carry it out. -Hills, 1977 Factor analysis should not be used in most practical situations. -Chatfield and Collins, 1980, pg. 89. At the present time, factor analysis still maintains the flavor of an art, and no single strategy should yet be "chiseled into stone". -Johnson and Wichern, 2002, pg The number of articles ISI Web of Science shows with the topic Factor Analsysis for the week ending October 11, The number of FA articles in 2003 through October 15 th. References: Chatfield, C. & Collins, A. J. (1980). Introduction to Multivariate Analysis. London: Chapman and Hall. Hair, J.F. Jr., Anderson, R.E., Tatham, R.L., & Black, W.C. (1998). Multivariate Data Analysis, (5 th Edition). Upper Saddle River, NJ: Prentice Hall. Hatcher, L. (1994). A Step-by-Step Approach to Using the SAS System for Factor Analysis and Structural Equation Modeling. Cary, NC: SAS Institute Inc. Hills, M. (1977). Book Review, Applied Statistics, 26, Johnson, D.E. (1998). Applied Multivariate Methods for Data Analysis. Pacific Grove, CA: Brooks/Cole Publishing. Johnson, R.A. & Wichern, D.W. (2002). Applied Multivariate Statistical Analyses (5 th Edition). Upper Saddle River, NJ: Prentice Hall. Mardia, K.V., Kent, J.T., & Bibby, J.M. (1979). Multivariate Analysis. New York: Academic Press. Sharma, S. (1996). Applied Multivariate Techniques. United States: John Wiley & Sons. Stevens, J. (2002). Applied Multivariate Statistics for the Social Sciences (4 th Edition). Mahwah, NJ: Lawrence Erlbaum Associates. Thompson, B. (2000). Q-Technique Factor Analysis: One Variation on the Two-Mode Factor Analysis of Variables. In Grimm, L.G. & Yarnold, P., Reading and Understanding More Multivariate Statistics. Washington, D.C.: American Psychological Association.
2 Goal: To explain the variance in the observed variables in terms of underlying latent factors. The Model: The following assumes that the p observed variables (the X i ) that have been measured for each of the n subjects have been standardized. X1 = a11f1+ a1 mfm + e1 X2 = a21f1+ a2mfm + e2 X = a F + a F + e p p1 1 pm m p The F j are the m common factors, the e i are the p specific errors, and the a ij are the factor pxm factor loadings. The F j have mean zero and standard deviation one, and are generally assumed to be independent. (We will assume this orthogonality below, but it is not true for oblique rotations.) The e i are also independent and the F j and e i are mutually independent of each other. In matrix form this can be written as: which is equivalent to X = A F + e p 1 p m m 1 p 1 T Σ= AA + cov( e) where Σ pxp is the correlation matrix of X px1. Since the errors are assumed to be independent, cov(e) should be a pxp diagonal matrix. This implies that: Var( X ) = a + Var( e ) i m 2 ij j= 1 The sum of X i 's squared factor loadings is called its communality (the variance it has in common with the other variables through the common factors). The i th error variance is called the specificity of X i (the variance that is specific to variable i). Path Diagram: Factor Analytic models are often represented by path diagrams. Each latent variable is represented by a circle, and each manifest variable is represented by a square. An arrow indicates causality which can get to be a pretty complicated subject in some models! i The diagram at the left is for a 2-factor model of 3 variables (m=2, p=3), where the first variable loads only on factor 1 (a 12 =0), the second variable loads on both factors, and the third variable loads only on factor 2 (a 31 =0).
3 Types of Factor Analysis: Some authors refer to several different types of factor analysis, such as R- Factor Analysis or Q-Factor Analysis. These simply refer to what is serving as the variables (the columns of the data set) and what is serving as the observations (the rows). Columns (what the factors explain) Rows (measured by the columns) R variables participants Q participants variables O occasions variables P variables occasions T occasions participants S participants occasions Thompson (2000, pg. 211) What we have been doing so far has been R factor analysis, we have been looking for latent factors that lie behind the variables. This would, for example, let us group different test questions that seem to be measuring the same underlying construct. In Q factor analysis we would be looking for factors that seem to be underlying the examinees and be asking how many types of people are there. The math is the same, but the terminology and goals can be different. Everything we have below refers to R factor analysis. Is the data appropriate? Correlation - It doesn't make sense to use factor analysis if the different variables are unrelated (why model common factors if they have nothing in common?) Rule of thumb: A substantial number of correlations are > 0.3. Some other methods also include: Bartlett's test of sphericity, the measuring of sampling adequacy (MSA), the anti-image correlation matrix, and the Kaiser-Meyer-Olkin measure (KMO). I haven t seen that much is gained by using these others. Multivariate Normality In order to use maximum likelihood estimation or to perform any of the tests of hypotheses, it is necessary for the data to be multivariate normal. This is not required for fitting the model using principal components or principal factor methods. Sample Size: A general rule of thumb is that you should have at least 50 observations and at least 5 times as many observations as variables. Stevens (2002, pg. 395) summarizes some results that are a bit more specific and backed by simulations. The number of observations required for factors to be reliable depend on the data. In particular on how well the variables load on the different factors. A factor is reliable if it has: 3 or more variables with loadings of 0.8 and any n 4 or more variables with loadings of 0.6 and any n 10 or more variables with loadings of 0.4 and n 150 Factors with only a few loadings require n 300 Obviously this doesn't cover every case, but it does give some guidance.
4 How many factors? There are several rules for determining how many factors are appropriate for your data. Mardia, Kent, and Bibby (1979, pg. 258) point out that there is a limit to how many factors you can have and actually end up with a model that is simpler than what your raw data. The quantity s is the difference between the number of unique values in your data's pxp correlation matrix and the number of parameters in the m factor model: 1 1 s = p m p+ m ( ) ( ) It only makes sense to perform a factor analysis if s>0, and some programs will not let you estimate the factor analytic model if it is not true. Even though you could always exactly fit a 5 factor model to 5 variables using principal components and no specificity, s>0 only for one or two factors. The minimum number of variables required for different numbers of factors are: # Factors Variables Required In general this is not something we worry about too much since we usually want to have a much smaller number of factors than variables. Also, recall that we want several variables loading on each factor before we can actually trust that factor to be meaningful anyway. Kaiser's Criterion / Eigen Value > 1 Take as many factors as there are eigenvalues > 1 for the correlation matrix. Hair, et.al. (1998, pg. 103) reports that this rule is good if there are 20 to 50 variables, but it tends to take too few if there are <20 variables, and to many if there are >50. Stevens (2002, pg. 389) reports that it tends to take too many if there are > 40 variables and their communalities are around 0.4. It tends to be accurate with variables and their communalities are around 0.7. Scree Plot Take the number of factors corresponding to the last eigenvalue before they start to level off. Hair, et.al. (1998, pg. 104) reports that it tends to keep one or more factors more than Kaiser's criterion. Stevens (2002, pg. 390) reports that both Kaiser and Scree are accurate if n>250 and communalities 0.6. Fixed % of Variance Explained Keep as many factors as are required to explain 60%, 70%, 80-85%, or 95%. There is no general consensus and one should check what is common in your field. It seems reasonable that any decent model should have at least 50% of the variance in the variables explained by the common factors. A priori If you have a hypothesis about the number of factors that should underlie the data, then that is probably a good (at least minimum) number to use. In practice there is no single best rule to use and a combination of them is often used, so if you have no a priori hypothesis check all three and use the closest thing to a majority decision. There are also a variety of other methods out there that are very popular with some authors: Minimum average partial correlation, parallel analysis, and modified parallel analysis are three of them. Three that require multivariate normality are: likelihood ratio test, AIC (Akaike's information criterion), and SBC (Schwartz s Bayesian criterion).
5 Methods for Fitting the Factor Analysis Model: Principal Components Factor Analysis Just take the first m loadings from the principal components solution and multiply by the square root of the corresponding eigen value. This is not really appropriate since it attempts to explain all of the variance in the variables and not just the common variance. It therefore will often have highly correlated errors. However, Hair, et.al. (1998, pg. 103) reports that it often gives similar results to other methods if there are 30 variables or if most variables have communalities > 0.6. Principal Factor Factor Analysis (a.k.a. Principal Axis Factoring and sometimes even Principal Components Factoring!) Come up with initial estimates of the communality for each variable and replace the diagonals in the correlation matrix with those. Then do principal components and take the first m loadings. Because you have taken out the specificity the error matrix should be much closer to a diagonal matrix. There are various initial estimates used for the initial communalities: the absolute value of the maximum correlation of that variable with any of the others, the squared multiple correlation coefficient for predicting that variable from the others in multiple regression, and the corresponding diagonal element from the inverse of the correlation matrix. There seems to be no agreement on which is best but the first is a slight bit easier to program. Iterated Principal Factor Factor Analysis - This begins the same way as the principal factor method, but you use the fitted model to get better estimates of the communalities, and then you keep repeating the process. This method will often fail to fit because of the Heywood case (estimated communalities > 1, negative error variances) though. Maximum Likelihood Factor Analysis Requires the assumption of multivariate normality and is difficult to program. It does allow for various tests of hypotheses however. Other Methods Alpha Factoring, Image Factoring, Harris Factoring, Rao's Canonical Factoring, Unweighted Least Squares I would recommend using the iterated principal factor solution if it does not have impossible communalities or error variances. If the iterated method fails then I would use the non-iterated principal factor method if the error covariances do not seem unreasonably large. If that too fails you either need more factors, or the factor analytic model is possibly inappropriate for your data set. Rotation: It is often considered best to use an orthogonal rotation because then all of the mathematical representations we've used so far will work. Two of the most popular are: Varimax Maximizes the sum of the squared factor loadings across the columns. This tends to force each variable to load highly on as few factors as possible. Ideally it will cause each variable to load on only one factor and thus point out good summed scores that could be made. If there is an overall underlying general factor that is working on all of the variables this rotation will not find it. (e.g. The math test has geometry, trigonometry, and algebra but does it also have an overall general math ability too?) Quartimax Whereas varimax focuses on the columns, quartimax focuses on the rows. Sharma (1996, pg. 120) reports that: it tends to have all of the variables load highly on one factor, and then each variable will tend to load highly on one other factor and near zero on the rest; unlike varimax this method will tend to keep the general factor. It sometimes doesn t seem to differ much from the initial solution. There are also oblique rotations that don't keep the independence between the factors. Popular ones include Promax, Oblimin, and Orthoblique. Interpret these with caution!
6 Interpretation: Once you have your factor loadings matrix it is necessary to try and interpret the factors. It is common to indicate which of the loadings are actually significant by underlining or circling them (and possibly erasing the non-significant ones). Significant is measured in two ways. Practical Significance - Are the factor loadings large enough so that the factors actually have a meaningful effect on the variables. (e.g. Roughly speaking, if two things only have a correlation of 0.2 it means that the one only explains 4% of the variation in the other!) Hair et.al. (1998, pg. 111) recommends the following guidelines for practical significance: ±0.3 Minimal ±0.4 More Important ±0.5 Practically Significant Statistical Significance - We also want the loading to be statistically significantly different from zero. (e.g. If the loading is 0.3, but the confidence interval for it is 0.2 to 0.8 do we care?) Stevens (2003, pg. 294) reports the following rules of thumb based on sample size n loading Further Guidance Johnson (1996, pg. 156) recommends not including factors with only one significant loading. Hatcher (1994, pg. 73) reports that we want at least 3 variables loading on each factor and preferably more. This is also related to a result discussed earlier in the section on sample size. Finally, it seems reasonable if you are using varimax to remove variables which load heavily on more than one factor. Factor Scores: Exploratory factor analysis helps us to gain an understanding of the processes underlying our variables. We might also want to come up with estimates for how each of the observations rates on these unobservable (latent) factors. Surrogate Variable Choose a single variable that loads highly on that factor and use its value. It is very simple to calculate and simple to explain too simple Estimate Factor Scores Use a statistical method to actually estimate the values of the F j for each observation. Mardia, et.al. (1979, pg. 274) reports that if multivariate normality holds then Bartlett's weighted least squares method is unbiased, and Thompson's regression method will be biased but have a smaller overall error. Factor scores however are difficult to interpret (what does it mean to take a complicated weighted average of the observed values?) and the exact values estimated depend on the sample. Summed Scores If each variable loads on only a single factor, then make subscores for each factor by summing up all of the variables that load on that factor. This is the compromise solution and is often the one used in practice when designing and analyzing questionnaires. Some Final Steps: Johnson and Wichern (2002, pg. 517) suggest the following to see if your solution seems reasonable: plot the factor scores against each other to look for suspicious observations and for large data sets, split them in half and perform factor analysis on each half to see if the solution is stable.
A Brief Introduction to SPSS Factor Analysis
A Brief Introduction to SPSS Factor Analysis SPSS has a procedure that conducts exploratory factor analysis. Before launching into a step by step example of how to use this procedure, it is recommended
4. There are no dependent variables specified... Instead, the model is: VAR 1. Or, in terms of basic measurement theory, we could model it as:
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in the relationships among the variables--factors are linear constructions of the set of variables; the critical source
2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) F 2 X 4 U 4
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) 3. Univariate and multivariate
Overview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA
PROC FACTOR: How to Interpret the Output of a Real-World Example Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA ABSTRACT THE METHOD This paper summarizes a real-world example of a factor
Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
FACTOR ANALYSIS NASC
FACTOR ANALYSIS NASC Factor Analysis A data reduction technique designed to represent a wide range of attributes on a smaller number of dimensions. Aim is to identify groups of variables which are relatively
Common factor analysis
Common factor analysis This is what people generally mean when they say "factor analysis" This family of techniques uses an estimate of common variance among the original variables to generate the factor
Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016
and Principal Components Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 Agenda Brief History and Introductory Example Factor Model Factor Equation Estimation of Loadings
Factor Analysis. Chapter 420. Introduction
Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.
A Brief Introduction to Factor Analysis
1. Introduction A Brief Introduction to Factor Analysis Factor analysis attempts to represent a set of observed variables X 1, X 2. X n in terms of a number of 'common' factors plus a factor which is unique
Factor Analysis. Factor Analysis
Factor Analysis Principal Components Analysis, e.g. of stock price movements, sometimes suggests that several variables may be responding to a small number of underlying forces. In the factor model, we
5.2 Customers Types for Grocery Shopping Scenario
------------------------------------------------------------------------------------------------------- CHAPTER 5: RESULTS AND ANALYSIS -------------------------------------------------------------------------------------------------------
Statistics in Psychosocial Research Lecture 8 Factor Analysis I. Lecturer: Elizabeth Garrett-Mayer
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
Introduction to Principal Components and FactorAnalysis
Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a
PRINCIPAL COMPONENT ANALYSIS
1 Chapter 1 PRINCIPAL COMPONENT ANALYSIS Introduction: The Basics of Principal Component Analysis........................... 2 A Variable Reduction Procedure.......................................... 2
Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business
Factor Analysis Advanced Financial Accounting II Åbo Akademi School of Business Factor analysis A statistical method used to describe variability among observed variables in terms of fewer unobserved variables
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Factor Analysis. Sample StatFolio: factor analysis.sgp
STATGRAPHICS Rev. 1/10/005 Factor Analysis Summary The Factor Analysis procedure is designed to extract m common factors from a set of p quantitative variables X. In many situations, a small number of
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
Multivariate Analysis (Slides 13)
Multivariate Analysis (Slides 13) The final topic we consider is Factor Analysis. A Factor Analysis is a mathematical approach for attempting to explain the correlation between a large set of variables
Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red)
Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red) The following DATA procedure is to read input data. This will create a SAS dataset named CORRMATR
Practical Considerations for Using Exploratory Factor Analysis in Educational Research
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to
T-test & factor analysis
Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue
APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY
APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY In the previous chapters the budgets of the university have been analyzed using various techniques to understand the
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In
STA 4107/5107. Chapter 3
STA 4107/5107 Chapter 3 Factor Analysis 1 Key Terms Please review and learn these terms. 2 What is Factor Analysis? Factor analysis is an interdependence technique (see chapter 1) that primarily uses metric
The Effectiveness of Ethics Program among Malaysian Companies
2011 2 nd International Conference on Economics, Business and Management IPEDR vol.22 (2011) (2011) IACSIT Press, Singapore The Effectiveness of Ethics Program among Malaysian Companies Rabiatul Alawiyah
Factor analysis. Angela Montanari
Factor analysis Angela Montanari 1 Introduction Factor analysis is a statistical model that allows to explain the correlations between a large number of observed correlated variables through a small number
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
How to report the percentage of explained common variance in exploratory factor analysis
UNIVERSITAT ROVIRA I VIRGILI How to report the percentage of explained common variance in exploratory factor analysis Tarragona 2013 Please reference this document as: Lorenzo-Seva, U. (2013). How to report
Exploratory Factor Analysis: rotation. Psychology 588: Covariance structure and factor models
Exploratory Factor Analysis: rotation Psychology 588: Covariance structure and factor models Rotational indeterminacy Given an initial (orthogonal) solution (i.e., Φ = I), there exist infinite pairs of
Exploratory Factor Analysis
Exploratory Factor Analysis Definition Exploratory factor analysis (EFA) is a procedure for learning the extent to which k observed variables might measure m abstract variables, wherein m is less than
Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression
Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression Saikat Maitra and Jun Yan Abstract: Dimension reduction is one of the major tasks for multivariate
Psychology 7291, Multivariate Analysis, Spring 2003. SAS PROC FACTOR: Suggestions on Use
: Suggestions on Use Background: Factor analysis requires several arbitrary decisions. The choices you make are the options that you must insert in the following SAS statements: PROC FACTOR METHOD=????
An introduction to. Principal Component Analysis & Factor Analysis. Using SPSS 19 and R (psych package) Robin Beaumont [email protected].
An introduction to Principal Component Analysis & Factor Analysis Using SPSS 19 and R (psych package) Robin Beaumont [email protected] Monday, 23 April 2012 Acknowledgment: The original version
What Are Principal Components Analysis and Exploratory Factor Analysis?
Statistics Corner Questions and answers about language testing statistics: Principal components analysis and exploratory factor analysis Definitions, differences, and choices James Dean Brown University
Topic 10: Factor Analysis
Topic 10: Factor Analysis Introduction Factor analysis is a statistical method used to describe variability among observed variables in terms of a potentially lower number of unobserved variables called
To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.
Factor Analysis in SPSS To conduct a Factor Analysis, start from the Analyze menu. This procedure is intended to reduce the complexity in a set of data, so we choose Data Reduction from the menu. And the
UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST
UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly
Exploratory Factor Analysis
Exploratory Factor Analysis ( 探 索 的 因 子 分 析 ) Yasuyo Sawaki Waseda University JLTA2011 Workshop Momoyama Gakuin University October 28, 2011 1 Today s schedule Part 1: EFA basics Introduction to factor
FACTOR ANALYSIS. Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables.
FACTOR ANALYSIS Introduction Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables Both methods differ from regression in that they don t have
James E. Bartlett, II is Assistant Professor, Department of Business Education and Office Administration, Ball State University, Muncie, Indiana.
Organizational Research: Determining Appropriate Sample Size in Survey Research James E. Bartlett, II Joe W. Kotrlik Chadwick C. Higgins The determination of sample size is a common task for many organizational
Introduction to Matrix Algebra
Psychology 7291: Multivariate Statistics (Carey) 8/27/98 Matrix Algebra - 1 Introduction to Matrix Algebra Definitions: A matrix is a collection of numbers ordered by rows and columns. It is customary
Applications of Structural Equation Modeling in Social Sciences Research
American International Journal of Contemporary Research Vol. 4 No. 1; January 2014 Applications of Structural Equation Modeling in Social Sciences Research Jackson de Carvalho, PhD Assistant Professor
Teaching Multivariate Analysis to Business-Major Students
Teaching Multivariate Analysis to Business-Major Students Wing-Keung Wong and Teck-Wong Soon - Kent Ridge, Singapore 1. Introduction During the last two or three decades, multivariate statistical analysis
9.2 User s Guide SAS/STAT. The FACTOR Procedure. (Book Excerpt) SAS Documentation
SAS/STAT 9.2 User s Guide The FACTOR Procedure (Book Excerpt) SAS Documentation This document is an individual chapter from SAS/STAT 9.2 User s Guide. The correct bibliographic citation for the complete
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA ABSTRACT The decision of whether to use PLS instead of a covariance
Factor Analysis Using SPSS
Factor Analysis Using SPSS The theory of factor analysis was described in your lecture, or read Field (2005) Chapter 15. Example Factor analysis is frequently used to develop questionnaires: after all
Simple Regression Theory II 2010 Samuel L. Baker
SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the
Multivariate Analysis of Variance (MANOVA): I. Theory
Gregory Carey, 1998 MANOVA: I - 1 Multivariate Analysis of Variance (MANOVA): I. Theory Introduction The purpose of a t test is to assess the likelihood that the means for two groups are sampled from the
UNDERSTANDING ANALYSIS OF COVARIANCE (ANCOVA)
UNDERSTANDING ANALYSIS OF COVARIANCE () In general, research is conducted for the purpose of explaining the effects of the independent variable on the dependent variable, and the purpose of research design
Statistics for Business Decision Making
Statistics for Business Decision Making Faculty of Economics University of Siena 1 / 62 You should be able to: ˆ Summarize and uncover any patterns in a set of multivariate data using the (FM) ˆ Apply
Exploratory Factor Analysis
Introduction Principal components: explain many variables using few new variables. Not many assumptions attached. Exploratory Factor Analysis Exploratory factor analysis: similar idea, but based on model.
Research Methodology: Tools
MSc Business Administration Research Methodology: Tools Applied Data Analysis (with SPSS) Lecture 02: Item Analysis / Scale Analysis / Factor Analysis February 2014 Prof. Dr. Jürg Schwarz Lic. phil. Heidi
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST
UNDERSTANDING THE DEPENDENT-SAMPLES t TEST A dependent-samples t test (a.k.a. matched or paired-samples, matched-pairs, samples, or subjects, simple repeated-measures or within-groups, or correlated groups)
Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics
INTERNATIONAL BLACK SEA UNIVERSITY COMPUTER TECHNOLOGIES AND ENGINEERING FACULTY ELABORATION OF AN ALGORITHM OF DETECTING TESTS DIMENSIONALITY Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree
Chapter 6: Multivariate Cointegration Analysis
Chapter 6: Multivariate Cointegration Analysis 1 Contents: Lehrstuhl für Department Empirische of Wirtschaftsforschung Empirical Research and und Econometrics Ökonometrie VI. Multivariate Cointegration
A Comparison of Variable Selection Techniques for Credit Scoring
1 A Comparison of Variable Selection Techniques for Credit Scoring K. Leung and F. Cheong and C. Cheong School of Business Information Technology, RMIT University, Melbourne, Victoria, Australia E-mail:
Application of discriminant analysis to predict the class of degree for graduating students in a university system
International Journal of Physical Sciences Vol. 4 (), pp. 06-0, January, 009 Available online at http://www.academicjournals.org/ijps ISSN 99-950 009 Academic Journals Full Length Research Paper Application
A Beginner s Guide to Factor Analysis: Focusing on Exploratory Factor Analysis
Tutorials in Quantitative Methods for Psychology 2013, Vol. 9(2), p. 79-94. A Beginner s Guide to Factor Analysis: Focusing on Exploratory Factor Analysis An Gie Yong and Sean Pearce University of Ottawa
Penalized regression: Introduction
Penalized regression: Introduction Patrick Breheny August 30 Patrick Breheny BST 764: Applied Statistical Modeling 1/19 Maximum likelihood Much of 20th-century statistics dealt with maximum likelihood
SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Chapter 7 Factor Analysis SPSS
Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012
Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts
Review Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
Indices of Model Fit STRUCTURAL EQUATION MODELING 2013
Indices of Model Fit STRUCTURAL EQUATION MODELING 2013 Indices of Model Fit A recommended minimal set of fit indices that should be reported and interpreted when reporting the results of SEM analyses:
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
Factor Analysis: Statnotes, from North Carolina State University, Public Administration Program. Factor Analysis
Factor Analysis Overview Factor analysis is used to uncover the latent structure (dimensions) of a set of variables. It reduces attribute space from a larger number of variables to a smaller number of
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
Conducting Exploratory and Confirmatory Factor Analyses for Competency in Malaysia Logistics Companies
Conducting Exploratory and Confirmatory Factor Analyses for Competency in Malaysia Logistics Companies Dazmin Daud Faculty of Business and Information Science, UCSI University, Kuala Lumpur, MALAYSIA.
Multivariate Statistical Inference and Applications
Multivariate Statistical Inference and Applications ALVIN C. RENCHER Department of Statistics Brigham Young University A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim
Dimensionality Reduction: Principal Components Analysis
Dimensionality Reduction: Principal Components Analysis In data mining one often encounters situations where there are a large number of variables in the database. In such situations it is very likely
Principal Component Analysis
Principal Component Analysis Principle Component Analysis: A statistical technique used to examine the interrelations among a set of variables in order to identify the underlying structure of those variables.
Questionnaire Evaluation with Factor Analysis and Cronbach s Alpha An Example
Questionnaire Evaluation with Factor Analysis and Cronbach s Alpha An Example - Melanie Hof - 1. Introduction The pleasure writers experience in writing considerably influences their motivation and consequently
Correlational Research
Correlational Research Chapter Fifteen Correlational Research Chapter Fifteen Bring folder of readings The Nature of Correlational Research Correlational Research is also known as Associational Research.
DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION
DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION Analysis is the key element of any research as it is the reliable way to test the hypotheses framed by the investigator. This
Factor Rotations in Factor Analyses.
Factor Rotations in Factor Analyses. Hervé Abdi 1 The University of Texas at Dallas Introduction The different methods of factor analysis first extract a set a factors from a data set. These factors are
Chapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE. School of Mathematical Sciences
! ROCHESTER INSTITUTE OF TECHNOLOGY COURSE OUTLINE FORM COLLEGE OF SCIENCE School of Mathematical Sciences New Revised COURSE: COS-MATH-252 Probability and Statistics II 1.0 Course designations and approvals:
A Basic Guide to Analyzing Individual Scores Data with SPSS
A Basic Guide to Analyzing Individual Scores Data with SPSS Step 1. Clean the data file Open the Excel file with your data. You may get the following message: If you get this message, click yes. Delete
Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.
Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C
Applied Regression Analysis and Other Multivariable Methods
THIRD EDITION Applied Regression Analysis and Other Multivariable Methods David G. Kleinbaum Emory University Lawrence L. Kupper University of North Carolina, Chapel Hill Keith E. Muller University of
Manifold Learning Examples PCA, LLE and ISOMAP
Manifold Learning Examples PCA, LLE and ISOMAP Dan Ventura October 14, 28 Abstract We try to give a helpful concrete example that demonstrates how to use PCA, LLE and Isomap, attempts to provide some intuition
We discuss 2 resampling methods in this chapter - cross-validation - the bootstrap
Statistical Learning: Chapter 5 Resampling methods (Cross-validation and bootstrap) (Note: prior to these notes, we'll discuss a modification of an earlier train/test experiment from Ch 2) We discuss 2
Multivariate Analysis
Table Of Contents Multivariate Analysis... 1 Overview... 1 Principal Components... 2 Factor Analysis... 5 Cluster Observations... 12 Cluster Variables... 17 Cluster K-Means... 20 Discriminant Analysis...
Multivariate Normal Distribution
Multivariate Normal Distribution Lecture 4 July 21, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Lecture #4-7/21/2011 Slide 1 of 41 Last Time Matrices and vectors Eigenvalues
Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
GRADES 7, 8, AND 9 BIG IDEAS
Table 1: Strand A: BIG IDEAS: MATH: NUMBER Introduce perfect squares, square roots, and all applications Introduce rational numbers (positive and negative) Introduce the meaning of negative exponents for
Best Practices in Exploratory Factor Analysis: Four Recommendations for Getting the Most From Your Analysis
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to
What is Rotating in Exploratory Factor Analysis?
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to
