Factor Analysis. Sample StatFolio: factor analysis.sgp
|
|
|
- Charles Andrews
- 9 years ago
- Views:
Transcription
1 STATGRAPHICS Rev. 1/10/005 Factor Analysis Summary The Factor Analysis procedure is designed to extract m common factors from a set of p quantitative variables X. In many situations, a small number of common factors may be able to represent a large percentage of the variability in the original variables. The ability to express the covariances amongst the variables in terms of a small number of meaningful factors often leads to important insights about the data being analyzed. This procedure supports both principal components and classical factor analysis. Factor loadings may be extracted from either the sample covariance or sample correlation matrix. The initial loadings may be rotated using either varimax, equimax, or quartimax rotation. Sample StatFolio: factor analysis.sgp Sample Data: The file 93cars.sf6 contains information on 6 variables for n = 93 makes and models of automobiles, taken from Lock (1993). The table below shows a partial list of the data in that file: Make Model Engine Horsepower Fuel Passengers Length Size Tank Acura Integra Acura Legend Audi Audi BMW 535i Buick Century Buick LeSabre Buick Roadmaster Buick Riviera Cadillac DeVille Cadillac Seville Chevrolet Cavalier It is desired to perform a factor analysis on the following variables: Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight 005 by StatPoint, Inc. Factor Analysis - 1
2 A matrix plot of the data is shown below: STATGRAPHICS Rev. 1/10/005 Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight As might be expected, the variables are highly correlated, since most are related to vehicle size. Data Input The data input dialog box requests the names of the columns containing the data: Data: either the original observations or the sample covariance matrix Σˆ. If entering the original observations, enter p numeric columns containing the n values for each column of X. 005 by StatPoint, Inc. Factor Analysis -
3 STATGRAPHICS Rev. 1/10/005 If entering the sample covariance matrix, enter p numeric columns containing the p values for each column of Σˆ. If the covariance matrix is entered, some of the tables and plots will not be available. Point Labels: optional labels for each observation. Select: subset selection. Statistical Model The goal of a factor analysis is to characterize the p variables in X in terms of a small number m of common factors F, which impact all of the variables, and a set of errors or specific factors ε, which affect only a single X variable. Following Johnson and Wichern (00), the orthogonal common factor model expresses the observed variables as X X X 1 µ 1 = l11f1 + l1f l1 mfm + ε1 µ = l F l F l F m m + ε p µ = l... + p p1 F1 + l pf + + l pmfm ε p (1) In matrix notation, X µ = LF + ε () where µ is a vector of means and L is called the factor loading matrix. It is assumed that the common factors and specific factors are all independent of one another. To avoid ambiguity in scaling, the variances of the common factors are assumed to equal 1, while the covariance matrix of the specific factors Ψ is a diagonal matrix with diagonal elements Ψ j. The covariance matrix Σ of the original observations X is related to the factor loading matrix by Σ = L L + Ψ (3) An important result of the above model is the relationship between the variances of the original X variables and the variances of the derived factors. In particular, Var( X ) = l 1 + l l + Ψ (4) j j j jm j This variance is expressed as the sum of two quantities: 1. the communality: l + l + + l j1 j... jm. the specific variance: Ψ j The communality is the variance attributable to factors that all the X variables have in common, while the specific variance is specific to a single factor. It should also be noted that the factor loadings L are not unique. Multiplication by any orthogonal matrix yields another acceptable set of factor loadings. Subsequent to the initial factor 005 by StatPoint, Inc. Factor Analysis - 3
4 STATGRAPHICS Rev. 1/10/005 extraction, it is therefore common to rotate the factor loadings until they can be most easily interpreted. Analysis Summary The Analysis Summary table is shown below: Factor Analysis Data variables: Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight Data input: observations Number of complete cases: 8 Missing value treatment: listwise Standardized: yes Type of factoring: principal components Number of factors extracted: Factor Analysis Factor Percent of Cumulative Number Eigenvalue Variance Percentage Initial Variable Communality Engine Size 1.0 Horsepower 1.0 Fueltank 1.0 Passengers 1.0 Length 1.0 Wheelbase 1.0 Width 1.0 U Turn Space 1.0 Rear seat 1.0 Luggage 1.0 Weight 1.0 Displayed in the table are: Data variables: the names of the p input columns. 005 by StatPoint, Inc. Factor Analysis - 4
5 STATGRAPHICS Rev. 1/10/005 Data input: either observations or matrix, depending upon whether the input columns contain the original observations or the sample covariance matrix. Number of complete cases: the number of cases n for which none of the observations were missing. Missing value treatment: how missing values were treated in estimating the covariance or correlation matrix. If listwise, the estimates were based on complete cases only. If pairwise, all pairs of non-missing data values were used to obtain the estimates. Standardized: yes if the analysis was based on the correlation matrix. No if it was based on the covariance matrix. Type of factoring: either principal components if the factor extraction was done directly on the sample covariance or correlation matrix, or classical if the diagonal elements were adjusted using estimates of the communalities. Number of components extracted: the number of components m extracted from the data. This number is based on the settings on the Analysis Options dialog box. A table is also displayed showing information for each of the p possible factors: Factor number: the factor number j, from 1 to p. Eigenvalue: the eigenvalue of the estimated covariance or correlation matrix, λˆ j, after adjusting for the estimated communalities if using the classical method. Percentage of variance: the percent of the total estimated population variance represented by this factor, equal to 100 ˆ λ1 ˆ λ j % ˆ λ... ˆ + + λ + m (5) Cumulative percentage: the cumulative percentage of the total estimated population variance accounted for by the first j factors. Initial communality: the initial communality used in the calculations, either input by the user or estimated from the sample covariances or correlations. In the example, the first m = factors account for over 84% of the overall variance amongst the 11 variables. 005 by StatPoint, Inc. Factor Analysis - 5
6 Analysis Options STATGRAPHICS Rev. 1/10/005 Missing Values Treatment: method of handling missing values when estimating the sample covariances or correlations. Specify Listwise to use only cases with no missing values for any of the input variables. Specify Pairwise to use all pairs of observations in which neither value is missing. Standardize: Check this box to base the analysis on the sample correlation matrix rather than the sample covariance matrix. This corresponds to standardizing each input variable before calculating the covariances, by subtracting its mean and dividing by its standard deviation. Type of Factoring: Select principal components to extract the factors directly from the covariance or correlation matrix. Select classical to replace the diagonal elements with estimated communalities. If using the classical method, you can specify the communalities by pressing the Communalities button or let the program use an iterative procedure to estimate them. Rotation: the method used to rotate the factor loading matrix after it has been extracted. Varimax rotation maximizes the variance of the squared loadings in each column. Quartimax maximizes the variance of the squared loadings in each row. Equimax attempts to achieve a balance between rows and columns. Extract By: the criterion used to determine the number of factors to extract. Minimum Eigenvalue: if extracting by magnitude of the eigenvalues, the smallest eigenvalue for which a factor will be extracted. Number of Factors: if extracting by number of factors, the number k. 005 by StatPoint, Inc. Factor Analysis - 6
7 There are also two buttons that access additional dialog boxes: Estimation Button STATGRAPHICS Rev. 1/10/005 These fields control the iterations used in: 1. The classical method of factor extraction. Estimated communalities are revised until the proportional change in their sum is less than the stopping criterion, or the maximum iterations is reached.. Rotation of the factor loadings. The stopping criteria apply to the variance of the squared elements on the diagonal of the factor loading matrix. Communalities Button When using the classical estimation method, you may specify a column containing the communalities instead of having the program estimate them iteratively. 005 by StatPoint, Inc. Factor Analysis - 7
8 STATGRAPHICS Rev. 1/10/005 Scree Plot The Scree Plot can be very helpful in determining the number of factors to extract. By default, it plots the size of the eigenvalues corresponding to each of the p possible factors: 8 Scree Plot Eigenvalue Factor A horizontal line is added at the minimum eigenvalue specified on the Analysis Options dialog box. In the plot above, note that only the first factors have large eigenvalues. Pane Options Plot: value plotted on the vertical axis. 005 by StatPoint, Inc. Factor Analysis - 8
9 STATGRAPHICS Rev. 1/10/005 Extraction Statistics The Extraction Statistics pane shows the estimated value of the coefficients l for each factor extracted, before any rotation is applied: Factor Loading Matrix Before Rotation Factor Factor 1 Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight Estimated Specific Variable Communality Variance Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight Also displayed are the communalities and the specific variances. The weights within each column often have interesting interpretations. In the example, note that the weights in the first column are all approximately the same. This implies that the first component is basically an average of all of the input variables. The second component is weighted most heavily in a positive direction on the number of Passengers, the Rear Seat room, and the amount of Luggage space, and in a negative direction on Horsepower. It likely differentiates amongst the different types of vehicles. Note also that U Turn Space and Luggage have a larger specific variance than the others, implying that they are not as well accounted for by the two extracted factors. 005 by StatPoint, Inc. Factor Analysis - 9
10 STATGRAPHICS Rev. 1/10/005 Rotation Statistics The Rotation Statistics pane shows the estimated value of the coefficients l after the requested rotation is applied: Factor Loading Matrix After Varimax Rotation Factor Factor 1 Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight Estimated Specific Variable Communality Variance Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight Note that the rotation has substantially decreased the loading of Passengers, Rear seat, and Luggage on the first factor and made them the dominant variables in the second factor. The second factor seems to distinguish large family vehicles such as minivans and SUV s from the other automobiles. 005 by StatPoint, Inc. Factor Analysis - 10
11 STATGRAPHICS Rev. 1/10/005 D and 3D Scatterplots These plots display the values of or 3 selected factors for each of the n cases, after rotation Scatterplot Factor Dodge Stealth Factor 1 It is useful to examine any points far away from the others, such as the highlighted Dodge Stealth, which has a very low value for the second factor. An interesting variation of this plot is one in which the variables are coded according to another column, such as the type of vehicle: Plot of SCORE_ vs SCORE_1 SCORE_ SCORE_1 Type Compact Large Midsize Small Sporty To produce the above plot: 1. Press the Save Results button and save the Factor Scores to new columns of the datasheet.. Select the X-Y Plot procedure from the top menu and input the new columns. 3. Select Analysis Options and specify Type in the Point Codes field. It is now clear that the first factor is related to the size of the vehicle, while the second factor separates the sporty cars from the others. 005 by StatPoint, Inc. Factor Analysis - 11
12 Pane Options STATGRAPHICS Rev. 1/10/005 Specify the factors to plot on each axis. Factor Scores The Factor Scores pane displays the values of the rotated factor scores for each of the n cases. Table of Factor Scores Factor Factor Row Label 1 1 Integra Legend i Century LeSabre Roadmaster Riviera The factor scores show where each observation falls with respect to the extracted factors. Factor Score Coefficients The table of Factor Score Coefficients shows the coefficients used to create the factor scores from the original variables. Factor Score Coefficients Factor Factor 1 Engine Size Horsepower Fueltank Passengers Length Wheelbase Width U Turn Space Rear seat Luggage Weight If the sample covariance matrix S has been factored, then the coefficients are the leading terms multiplying the deviation of each variable from its mean in 005 by StatPoint, Inc. Factor Analysis - 1
13 ˆ ˆ 1 f j = L S ( x j x) STATGRAPHICS Rev. 1/10/005 (6) If the correlation matrix R has been factored, then the coefficients are the leading terms multiplying the standardized values of each variable according in fˆ = Lˆ R (7) 1 j z j D and 3D Factor Plots The Factor Plots show the location of each variable in the space of or 3 selected factors: Plot of Factor Loadings Rear seat Passengers Luggage Factor Wheelbase Length U Turn SpaceWidth Engine Size Weight Fueltank 0 Horsepower Factor 1 Variables furthest from the reference lines at 0 make the largest contribution to the factors. Save Results The following results may be saved to the datasheet: 1. Eigenvalues the m eigenvalues.. Factor Matrix m columns, each containing p estimates of the coefficients l before rotation. 3. Rotated Factor Matrix m columns, each containing p estimates of the coefficients l after rotation. 4. Transition Matrix the m by m matrix that multiplies the original factor loadings to yield the rotated factor loadings. 5. Communalities the p estimated communalities after rotation. 6. Specific variances the p specific variances after rotation. 005 by StatPoint, Inc. Factor Analysis - 13
14 STATGRAPHICS Rev. 1/10/ Factor Scores m columns, each containing n values corresponding to the extracted factors. 8. Factor Score Coefficients m columns, each containing the p values of the factor score coefficients. 005 by StatPoint, Inc. Factor Analysis - 14
Multiple Box-and-Whisker Plot
Multiple Summary The Multiple procedure creates a plot designed to illustrate important features of a numeric data column when grouped according to the value of a second variable. The box-and-whisker plot
Factor Analysis. Chapter 420. Introduction
Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.
Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red)
Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red) The following DATA procedure is to read input data. This will create a SAS dataset named CORRMATR
To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.
Factor Analysis in SPSS To conduct a Factor Analysis, start from the Analyze menu. This procedure is intended to reduce the complexity in a set of data, so we choose Data Reduction from the menu. And the
A Brief Introduction to SPSS Factor Analysis
A Brief Introduction to SPSS Factor Analysis SPSS has a procedure that conducts exploratory factor analysis. Before launching into a step by step example of how to use this procedure, it is recommended
Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA
PROC FACTOR: How to Interpret the Output of a Real-World Example Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA ABSTRACT THE METHOD This paper summarizes a real-world example of a factor
Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business
Factor Analysis Advanced Financial Accounting II Åbo Akademi School of Business Factor analysis A statistical method used to describe variability among observed variables in terms of fewer unobserved variables
Chapter 7 Factor Analysis SPSS
Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
There are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003
Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003 FA is not worth the time necessary to understand it and carry it out. -Hills, 1977 Factor analysis should not
Exploratory Factor Analysis
Introduction Principal components: explain many variables using few new variables. Not many assumptions attached. Exploratory Factor Analysis Exploratory factor analysis: similar idea, but based on model.
Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models
Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis
FACTOR ANALYSIS NASC
FACTOR ANALYSIS NASC Factor Analysis A data reduction technique designed to represent a wide range of attributes on a smaller number of dimensions. Aim is to identify groups of variables which are relatively
Scatter Plots with Error Bars
Chapter 165 Scatter Plots with Error Bars Introduction The procedure extends the capability of the basic scatter plot by allowing you to plot the variability in Y and X corresponding to each point. Each
Multivariate Analysis
Table Of Contents Multivariate Analysis... 1 Overview... 1 Principal Components... 2 Factor Analysis... 5 Cluster Observations... 12 Cluster Variables... 17 Cluster K-Means... 20 Discriminant Analysis...
Factor Analysis Using SPSS
Psychology 305 p. 1 Factor Analysis Using SPSS Overview For this computer assignment, you will conduct a series of principal factor analyses to examine the factor structure of a new instrument developed
Common factor analysis
Common factor analysis This is what people generally mean when they say "factor analysis" This family of techniques uses an estimate of common variance among the original variables to generate the factor
Advanced Statistical Methods in Insurance
Advanced Statistical Methods in Insurance 7. Multivariate Data All Pairwise Scattergrams Iris Data Set: 3 Species 50 Cases of each with p=4 measurements per case 2 Hudec & Schlögl 1 3-d Scatterplots iris[,
Multivariate Analysis (Slides 13)
Multivariate Analysis (Slides 13) The final topic we consider is Factor Analysis. A Factor Analysis is a mathematical approach for attempting to explain the correlation between a large set of variables
Dimensionality Reduction: Principal Components Analysis
Dimensionality Reduction: Principal Components Analysis In data mining one often encounters situations where there are a large number of variables in the database. In such situations it is very likely
Exploratory Factor Analysis: rotation. Psychology 588: Covariance structure and factor models
Exploratory Factor Analysis: rotation Psychology 588: Covariance structure and factor models Rotational indeterminacy Given an initial (orthogonal) solution (i.e., Φ = I), there exist infinite pairs of
Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016
and Principal Components Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016 Agenda Brief History and Introductory Example Factor Model Factor Equation Estimation of Loadings
Factor Analysis. Factor Analysis
Factor Analysis Principal Components Analysis, e.g. of stock price movements, sometimes suggests that several variables may be responding to a small number of underlying forces. In the factor model, we
Statistics for Business Decision Making
Statistics for Business Decision Making Faculty of Economics University of Siena 1 / 62 You should be able to: ˆ Summarize and uncover any patterns in a set of multivariate data using the (FM) ˆ Apply
Introduction to Principal Components and FactorAnalysis
Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a
PRINCIPAL COMPONENT ANALYSIS
1 Chapter 1 PRINCIPAL COMPONENT ANALYSIS Introduction: The Basics of Principal Component Analysis........................... 2 A Variable Reduction Procedure.......................................... 2
4. There are no dependent variables specified... Instead, the model is: VAR 1. Or, in terms of basic measurement theory, we could model it as:
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in the relationships among the variables--factors are linear constructions of the set of variables; the critical source
2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) F 2 X 4 U 4
1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) 3. Univariate and multivariate
Overview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
Review Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
Least-Squares Intersection of Lines
Least-Squares Intersection of Lines Johannes Traa - UIUC 2013 This write-up derives the least-squares solution for the intersection of lines. In the general case, a set of lines will not intersect at a
Introduction to Principal Component Analysis: Stock Market Values
Chapter 10 Introduction to Principal Component Analysis: Stock Market Values The combination of some data and an aching desire for an answer does not ensure that a reasonable answer can be extracted from
Principal Component Analysis
Principal Component Analysis Principle Component Analysis: A statistical technique used to examine the interrelations among a set of variables in order to identify the underlying structure of those variables.
DATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
Factor Rotations in Factor Analyses.
Factor Rotations in Factor Analyses. Hervé Abdi 1 The University of Texas at Dallas Introduction The different methods of factor analysis first extract a set a factors from a data set. These factors are
15.062 Data Mining: Algorithms and Applications Matrix Math Review
.6 Data Mining: Algorithms and Applications Matrix Math Review The purpose of this document is to give a brief review of selected linear algebra concepts that will be useful for the course and to develop
Data analysis process
Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis
SPC Response Variable
SPC Response Variable This procedure creates control charts for data in the form of continuous variables. Such charts are widely used to monitor manufacturing processes, where the data often represent
Exercise 1.12 (Pg. 22-23)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
Introduction to Matrix Algebra
Psychology 7291: Multivariate Statistics (Carey) 8/27/98 Matrix Algebra - 1 Introduction to Matrix Algebra Definitions: A matrix is a collection of numbers ordered by rows and columns. It is customary
Create Charts in Excel
Create Charts in Excel Table of Contents OVERVIEW OF CHARTING... 1 AVAILABLE CHART TYPES... 2 PIE CHARTS... 2 BAR CHARTS... 3 CREATING CHARTS IN EXCEL... 3 CREATE A CHART... 3 HOW TO CHANGE THE LOCATION
Data Analysis Tools. Tools for Summarizing Data
Data Analysis Tools This section of the notes is meant to introduce you to many of the tools that are provided by Excel under the Tools/Data Analysis menu item. If your computer does not have that tool
Principal Component Analysis
Principal Component Analysis ERS70D George Fernandez INTRODUCTION Analysis of multivariate data plays a key role in data analysis. Multivariate data consists of many different attributes or variables recorded
Reliability Analysis
Measures of Reliability Reliability Analysis Reliability: the fact that a scale should consistently reflect the construct it is measuring. One way to think of reliability is that other things being equal,
Econometrics Simple Linear Regression
Econometrics Simple Linear Regression Burcu Eke UC3M Linear equations with one variable Recall what a linear equation is: y = b 0 + b 1 x is a linear equation with one variable, or equivalently, a straight
How To Use Statgraphics Centurion Xvii (Version 17) On A Computer Or A Computer (For Free)
Statgraphics Centurion XVII (currently in beta test) is a major upgrade to Statpoint's flagship data analysis and visualization product. It contains 32 new statistical procedures and significant upgrades
Factor Analysis and Structural equation modelling
Factor Analysis and Structural equation modelling Herman Adèr Previously: Department Clinical Epidemiology and Biostatistics, VU University medical center, Amsterdam Stavanger July 4 13, 2006 Herman Adèr
Two Correlated Proportions (McNemar Test)
Chapter 50 Two Correlated Proportions (Mcemar Test) Introduction This procedure computes confidence intervals and hypothesis tests for the comparison of the marginal frequencies of two factors (each with
Statistics in Psychosocial Research Lecture 8 Factor Analysis I. Lecturer: Elizabeth Garrett-Mayer
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
T-test & factor analysis
Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue
STA 4107/5107. Chapter 3
STA 4107/5107 Chapter 3 Factor Analysis 1 Key Terms Please review and learn these terms. 2 What is Factor Analysis? Factor analysis is an interdependence technique (see chapter 1) that primarily uses metric
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In
Session 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares
Data exploration with Microsoft Excel: analysing more than one variable
Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical
This chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
Topic 10: Factor Analysis
Topic 10: Factor Analysis Introduction Factor analysis is a statistical method used to describe variability among observed variables in terms of a potentially lower number of unobserved variables called
Psychology 7291, Multivariate Analysis, Spring 2003. SAS PROC FACTOR: Suggestions on Use
: Suggestions on Use Background: Factor analysis requires several arbitrary decisions. The choices you make are the options that you must insert in the following SAS statements: PROC FACTOR METHOD=????
STATISTICA Formula Guide: Logistic Regression. Table of Contents
: Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary
Multivariate Analysis of Variance (MANOVA)
Chapter 415 Multivariate Analysis of Variance (MANOVA) Introduction Multivariate analysis of variance (MANOVA) is an extension of common analysis of variance (ANOVA). In ANOVA, differences among various
January 26, 2009 The Faculty Center for Teaching and Learning
THE BASICS OF DATA MANAGEMENT AND ANALYSIS A USER GUIDE January 26, 2009 The Faculty Center for Teaching and Learning THE BASICS OF DATA MANAGEMENT AND ANALYSIS Table of Contents Table of Contents... i
AP STATISTICS 2010 SCORING GUIDELINES
2010 SCORING GUIDELINES Question 4 Intent of Question The primary goals of this question were to (1) assess students ability to calculate an expected value and a standard deviation; (2) recognize the applicability
Instructions for SPSS 21
1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3
CORRELATION ANALYSIS
CORRELATION ANALYSIS Learning Objectives Understand how correlation can be used to demonstrate a relationship between two factors. Know how to perform a correlation analysis and calculate the coefficient
FACTOR ANALYSIS. Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables.
FACTOR ANALYSIS Introduction Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables Both methods differ from regression in that they don t have
Name: Date: Use the following to answer questions 2-3:
Name: Date: 1. A study is conducted on students taking a statistics class. Several variables are recorded in the survey. Identify each variable as categorical or quantitative. A) Type of car the student
A Solution Manual and Notes for: Exploratory Data Analysis with MATLAB by Wendy L. Martinez and Angel R. Martinez.
A Solution Manual and Notes for: Exploratory Data Analysis with MATLAB by Wendy L. Martinez and Angel R. Martinez. John L. Weatherwax May 7, 9 Introduction Here you ll find various notes and derivations
Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.
Excel Tutorial Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information. Working with Data Entering and Formatting Data Before entering data
Risk Decomposition of Investment Portfolios. Dan dibartolomeo Northfield Webinar January 2014
Risk Decomposition of Investment Portfolios Dan dibartolomeo Northfield Webinar January 2014 Main Concepts for Today Investment practitioners rely on a decomposition of portfolio risk into factors to guide
Group Theory and Molecular Symmetry
Group Theory and Molecular Symmetry Molecular Symmetry Symmetry Elements and perations Identity element E - Apply E to object and nothing happens. bject is unmoed. Rotation axis C n - Rotation of object
[1] Diagonal factorization
8.03 LA.6: Diagonalization and Orthogonal Matrices [ Diagonal factorization [2 Solving systems of first order differential equations [3 Symmetric and Orthonormal Matrices [ Diagonal factorization Recall:
Dealing with Data in Excel 2010
Dealing with Data in Excel 2010 Excel provides the ability to do computations and graphing of data. Here we provide the basics and some advanced capabilities available in Excel that are useful for dealing
STATISTICS AND DATA ANALYSIS IN GEOLOGY, 3rd ed. Clarificationof zonationprocedure described onpp. 238-239
STATISTICS AND DATA ANALYSIS IN GEOLOGY, 3rd ed. by John C. Davis Clarificationof zonationprocedure described onpp. 38-39 Because the notation used in this section (Eqs. 4.8 through 4.84) is inconsistent
A Demonstration of Hierarchical Clustering
Recitation Supplement: Hierarchical Clustering and Principal Component Analysis in SAS November 18, 2002 The Methods In addition to K-means clustering, SAS provides several other types of unsupervised
Multiple regression - Matrices
Multiple regression - Matrices This handout will present various matrices which are substantively interesting and/or provide useful means of summarizing the data for analytical purposes. As we will see,
Statgraphics Getting started
Statgraphics Getting started The aim of this exercise is to introduce you to some of the basic features of the Statgraphics software. Starting Statgraphics 1. Log in to your PC, using the usual procedure
How To Cluster
Data Clustering Dec 2nd, 2013 Kyrylo Bessonov Talk outline Introduction to clustering Types of clustering Supervised Unsupervised Similarity measures Main clustering algorithms k-means Hierarchical Main
The ith principal component (PC) is the line that follows the eigenvector associated with the ith largest eigenvalue.
More Principal Components Summary Principal Components (PCs) are associated with the eigenvectors of either the covariance or correlation matrix of the data. The ith principal component (PC) is the line
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model
Overview of Violations of the Basic Assumptions in the Classical Normal Linear Regression Model 1 September 004 A. Introduction and assumptions The classical normal linear regression model can be written
One-Way ANOVA using SPSS 11.0. SPSS ANOVA procedures found in the Compare Means analyses. Specifically, we demonstrate
1 One-Way ANOVA using SPSS 11.0 This section covers steps for testing the difference between three or more group means using the SPSS ANOVA procedures found in the Compare Means analyses. Specifically,
Exploratory Factor Analysis
Exploratory Factor Analysis Definition Exploratory factor analysis (EFA) is a procedure for learning the extent to which k observed variables might measure m abstract variables, wherein m is less than
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected]
Analysing Questionnaires using Minitab (for SPSS queries contact -) [email protected] Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
Descriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
Nonlinear Iterative Partial Least Squares Method
Numerical Methods for Determining Principal Component Analysis Abstract Factors Béchu, S., Richard-Plouet, M., Fernandez, V., Walton, J., and Fairley, N. (2016) Developments in numerical treatments for
Figure 1.1 Vector A and Vector F
CHAPTER I VECTOR QUANTITIES Quantities are anything which can be measured, and stated with number. Quantities in physics are divided into two types; scalar and vector quantities. Scalar quantities have
A Brief Introduction to Factor Analysis
1. Introduction A Brief Introduction to Factor Analysis Factor analysis attempts to represent a set of observed variables X 1, X 2. X n in terms of a number of 'common' factors plus a factor which is unique
SAS Software to Fit the Generalized Linear Model
SAS Software to Fit the Generalized Linear Model Gordon Johnston, SAS Institute Inc., Cary, NC Abstract In recent years, the class of generalized linear models has gained popularity as a statistical modeling
" Y. Notation and Equations for Regression Lecture 11/4. Notation:
Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through
SF2940: Probability theory Lecture 8: Multivariate Normal Distribution
SF2940: Probability theory Lecture 8: Multivariate Normal Distribution Timo Koski 24.09.2015 Timo Koski Matematisk statistik 24.09.2015 1 / 1 Learning outcomes Random vectors, mean vector, covariance matrix,
Introduction to Microsoft Excel 2007/2010
to Microsoft Excel 2007/2010 Abstract: Microsoft Excel is one of the most powerful and widely used spreadsheet applications available today. Excel's functionality and popularity have made it an essential
Least Squares Estimation
Least Squares Estimation SARA A VAN DE GEER Volume 2, pp 1041 1045 in Encyclopedia of Statistics in Behavioral Science ISBN-13: 978-0-470-86080-9 ISBN-10: 0-470-86080-4 Editors Brian S Everitt & David
State of Stress at Point
State of Stress at Point Einstein Notation The basic idea of Einstein notation is that a covector and a vector can form a scalar: This is typically written as an explicit sum: According to this convention,
5.2 Customers Types for Grocery Shopping Scenario
------------------------------------------------------------------------------------------------------- CHAPTER 5: RESULTS AND ANALYSIS -------------------------------------------------------------------------------------------------------
Chapter 15. Mixed Models. 15.1 Overview. A flexible approach to correlated data.
Chapter 15 Mixed Models A flexible approach to correlated data. 15.1 Overview Correlated data arise frequently in statistical analyses. This may be due to grouping of subjects, e.g., students within classrooms,
