T-test & factor analysis



Similar documents
Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011

5.2 Customers Types for Grocery Shopping Scenario

Chapter 7 Factor Analysis SPSS

APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY

Common factor analysis

Introduction to Principal Components and FactorAnalysis

Research Methods & Experimental Design

Factor Analysis. Chapter 420. Introduction

Data analysis process

Additional sources Compilation of sources:

Overview of Factor Analysis

DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION

A Brief Introduction to SPSS Factor Analysis

Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business

FACTOR ANALYSIS NASC

CHAPTER 4 KEY PERFORMANCE INDICATORS

2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) F 2 X 4 U 4

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

Projects Involving Statistics (& SPSS)

SPSS Tests for Versions 9 to 13

EFFECT OF ENVIRONMENTAL CONCERN & SOCIAL NORMS ON ENVIRONMENTAL FRIENDLY BEHAVIORAL INTENTIONS

Overview of Non-Parametric Statistics PRESENTER: ELAINE EISENBEISZ OWNER AND PRINCIPAL, OMEGA STATISTICS

Principal Component Analysis

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

Does organizational culture cheer organizational profitability? A case study on a Bangalore based Software Company

4. There are no dependent variables specified... Instead, the model is: VAR 1. Or, in terms of basic measurement theory, we could model it as:

Factors affecting teaching and learning of computer disciplines at. Rajamangala University of Technology

Rank-Based Non-Parametric Tests

Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red)

An Empirical Examination of the Relationship between Financial Trader s Decision-making and Financial Software Applications

FACTOR ANALYSIS. Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables.

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Simple Linear Regression Inference

Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA

A STUDY ON ONBOARDING PROCESS IN SIFY TECHNOLOGIES, CHENNAI

A Brief Introduction to Factor Analysis

Post-hoc comparisons & two-way analysis of variance. Two-way ANOVA, II. Post-hoc testing for main effects. Post-hoc testing 9.

Research Methodology: Tools

A STUDY ON FACTORS AFFECTING INDIVIDUALS INVESTMENT TOWARDS LIFE INSURANCE POLICIES

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

(and sex and drugs and rock 'n' roll) ANDY FIELD

An introduction to. Principal Component Analysis & Factor Analysis. Using SPSS 19 and R (psych package) Robin Beaumont robin@organplayers.co.

Factor Analysis Using SPSS

Multivariate Analysis of Variance. The general purpose of multivariate analysis of variance (MANOVA) is to determine

Descriptive Statistics

Statistical tests for SPSS

WHAT IS A JOURNAL CLUB?

THE KRUSKAL WALLLIS TEST

U.P.B. Sci. Bull., Series C, Vol. 77, Iss. 1, 2015 ISSN

A Study to Improve the Response in Campaigning by Comparing Data Mining Segmentation Approaches in Aditi Technologies

CHAPTER VI ON PRIORITY SECTOR LENDING

Testing for differences I exercises with SPSS

Instructions for SPSS 21

Exploratory Factor Analysis

UNDERSTANDING THE TWO-WAY ANOVA

A STUDY OF CONSUMER ATTITUDE TOWARDS ADVERTISING THROUGH MOBILE PHONES

Factor Analysis and Structural equation modelling

UNIVERSITY OF NAIROBI

Study Guide for the Final Exam

Factor Analysis - SPSS

ISSN: (Online) Volume 2, Issue 3, March 2014 International Journal of Advance Research in Computer Science and Management Studies

Analysis of Data. Organizing Data Files in SPSS. Descriptive Statistics

Exploratory Factor Analysis and Principal Components. Pekka Malo & Anton Frantsev 30E00500 Quantitative Empirical Research Spring 2016

Introduction to Statistics and Quantitative Research Methods

Independent t- Test (Comparing Two Means)

II. DISTRIBUTIONS distribution normal distribution. standard scores

Chapter VIII Customers Perception Regarding Health Insurance

The Statistics Tutor s Quick Guide to

Multivariate Analysis of Variance (MANOVA)

Principal Component Analysis

The Human Resource Information System Productiveness in Organization Culture and Its Importance

JOB SATISFACTION DURING RECESSION PERIOD: A STUDY ON PUBLIC & PRIVATE INSURANCE IN PUNJAB

SPSS Guide How-to, Tips, Tricks & Statistical Techniques

Assumptions. Assumptions of linear models. Boxplot. Data exploration. Apply to response variable. Apply to error terms from linear model

Multivariate Analysis of Variance (MANOVA)

Factorial Invariance in Student Ratings of Instruction

Parametric and non-parametric statistical methods for the life sciences - Session I

How To Understand Multivariate Models

SPSS 3: COMPARING MEANS

Chapter 5 Analysis of variance SPSS Analysis of variance

Research on the Custom Pricing Model of Online Retail Clothing

Fairfield Public Schools

SPSS Explore procedure

CORRELATES OF EMPLOYEE SATISFACTION WITH PERFORMANCE APPRAISAL SYSTEM IN FOREIGN MNC BPOs OPERATING IN INDIA

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

THE USING FACTOR ANALYSIS METHOD IN PREDICTION OF BUSINESS FAILURE

An introduction to IBM SPSS Statistics

Mobile Marketing: Examining the impact of Interest, Individual attention, Problem faced and consumer s attitude on intention to purchase

THE INTERNATIONAL JOURNAL OF BUSINESS & MANAGEMENT

To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.

An empirical study of factor analysis on M & A performance of listed companies of Chinese pharmaceutical industry

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Multivariate Analysis (Slides 13)

Transcription:

Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue using parametric test if the sample is large or enough evidence available to prove the usage Transforming and manipulating variables Use non parametric test Dr. Paurav Shukla t-test Tests used for significant differences between groups Independent sample t-test Paired sample t-test One-way ANOVA (between groups) One-way ANOVA (repeated groups) Two-way ANOVA (between groups) MANOVA Used when they are two groups only Compares the mean score on some continuous variable Largely used in measuring before after effect Also called intervention effect Option 1: Same sample over a period of time Option 2: Two different groups at a single point of time Assumptions associated with t-test Population distribution of sample is normal Probability (random) sampling Level of measurement Dependent variable is continuous Observed or measured data must be independent If interaction is unavoidable use a stringent alpha value (p<.01) Homogeneity of variance Assumes that samples are obtained from populations of equal variance ANOVA is reasonably robust against this Why t-test is important? Highly used technique for hypothesis testing Can lead to wrong conclusions Type 1 error Reject null hypothesis instead of accepting When we assume there is a difference between groups, when it is not Type 2 error Accept null hypothesis instead of rejecting When we assume there is no difference between groups, when there is Solution Interdependency between both errors Selection of alpha level Dr. Paurav Shukla 1

Factors influencing power of t-test Procedure for independent sample t-test Sample size Strength of interdependency between dependent and independent variable (Strength of Association or Effect Size) Alpha level Compare means Independent sample t-test Interpreting independent samples t-test Calculating the effect size Group statistics Look for N (missing values) Independent samples test Levene s test If sig value is higher than 0.05 use equal variance assumed If sig value is lower than 0.05 use equal variances not assumed Assessing difference between groups If the sig (2-tailed) is equal or less than 0.05 There is a significant difference in the mean scores If the sig (2-tailed) is great than 0.05 There is no significant difference between the two groups SPSS does not calculate Eta squared to measure effect size for t-test Calculation Eta squared = t 2 + (N1 + N2 2) Interpretation values 0.01 = Small effect 0.06 = Moderate effect 0.14 = Large effect t 2 Paired sample t-test Interpreting paired sample t-test Compare means Paired samples t-test Determining overall significance If the sig (2-tailed) is less than 0.05 Significant difference between two scores If the sig (2-tailed) is higher than 0.05 No significant difference between two scores Comparing mean values Mean values Effect size t 2 Eta squared = t 2 + N - 1 Dr. Paurav Shukla 2

Non-parametric alternatives Independent sample t-test Mann-Whitney U test Instead of comparing mean it compares medians Procedure Non-parametric tests 2 Independent samples Paired sample t-test Wilcoxon signed rank test Procedure Non-parametric tests 2 Related samples Factor Analysis Session overview Basic concept Basic Concept Factor Analysis Model Types of factor analysis Statistics Associated with Factor Analysis Conducting factor analysis Applications of factor analysis A data reduction technique designed to represent a wide range of attributes on a smaller number of dimensions. Factor analysis is an interdependence technique in that an entire set of interdependent relationships is examined without making the distinction between dependent and independent variables. Factor analysis is used in the following circumstances: To identify underlying dimensions, or factors, that explain the correlations among a set of variables. To identify a new, smaller, set of uncorrelated variables to replace the original set of correlated variables in subsequent multivariate analysis (regression or discriminant analysis). To identify a smaller set of salient variables from a larger set for use in subsequent multivariate analysis. Standard Model Types of factor analysis Mathematically, each variable is expressed as a linear combination of underlying factors. The covariation among the variables is described in terms of a small number of common factors plus a unique factor for each variable. If the variables are standardized, the factor model may be represented as: Xi = Ai 1F1 + Ai 2F2 + Ai 3F3 +... + AimFm + ViUi where Xi = i th standardized variable Aij = standardized multiple regression coefficient of variable i on common factor j F = common factor Vi = standardized regression coefficient of variable i on unique factor i Ui = the unique factor for variable i m = number of common factors Exploratory factor analysis Largely known as Principal Components Analysis (PCA) Confirmatory factor analysis Structural Equation Modelling Dr. Paurav Shukla 3

Exploratory Factor Analysis A company producing single malt scotch whisky has asked you to study the status consumption behaviour within a specific group of people. Through literature you have developed three constructs which influence consumer choice of such status consumption brand. The three constructs are: Psychological association scale Brand association scale Situations/events scale Exploratory Factor Analysis Psychological associations scale I buy status alcoholic beverages: for gaining respect from others to be a popular person in groups. to show who I am. to be a symbol of success. to be a symbol of prestige. to indicate my wealth. to be noticed by others. to indicate my achievement. to be appreciated by others. because status interests me. because status is important to me. because status enhances my image. because I am interested in new alcoholic beverages with status. because I would pay more for an alcoholic beverage if it had status. because the status of an alcoholic beverage is irrelevant to me. to increase my own value from others point of view. to be more attractive than others. The question How to know what factors are important to consumers? What variables are grouped together in consumer mind? There being no dependent variable how can I measure the impact? Conducting Factor Analysis Problem formulation Construction of the Correlation Matrix Method of Factor Analysis Determination of Number of Factors Rotation of Factors Interpretation of Factors Calculation of Factor Scores Selection of Surrogate Variables Determination of Model Fit Statistics Associated with Factor Analysis Statistics Associated with Factor Analysis Bartlett's test of sphericity Bartlett's test of sphericity is a test statistic used to examine the hypothesis that the variables are uncorrelated in the population. In other words, the population correlation matrix is an identity matrix; each variable correlates perfectly with itself (r = 1) but has no correlation with the other variables (r = 0). Correlation matrix A correlation matrix is a lower triangle matrix showing the simple correlations, r, between all possible pairs of variables included in the analysis. The diagonal elements, which are all 1, are usually omitted. Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy The Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy is an index used to examine the appropriateness of factor analysis. High values (between 0.5 and 1.0) indicate factor analysis is appropriate. Values below 0.5 imply that factor analysis may not be appropriate. Percentage of variance The percentage of the total variance attributed to each factor. Residuals are the differences between the observed correlations, as given in the input correlation matrix, and the reproduced correlations, as estimated from the factor matrix. Communality Communality is the amount of variance a variable shares with all the other variables being considered. This is also the proportion of variance explained by the common factors. Eigenvalue The eigenvalue represents the total variance explained by each factor. Factor loadings Factor loadings are simple correlations between the variables and the factors. Factor loading plot A factor loading plot is a plot of the original variables using the factor loadings as coordinates. Factor matrix A factor matrix contains the factor loadings of all the variables on all the factors extracted. Scree plot A scree plot is a plot of the Eigenvalues against the number of factors in order of extraction. Factor scores Factor scores are composite scores estimated for each respondent on the derived factors. Dr. Paurav Shukla 4

Assumptions for factor analysis Procedure Sample size 1 to 10 ratio Variable correlation higher than 0.3 in most cases Linearity Outliers Data reduction Factor Descriptives Correlation matrix Coefficients KMO and Bartlett s test of sphericity Statistics Initial solution Options Missing values Exclude cases pairwise Coefficients display format Sorted by size Suppress absolute value less than 0.3 Extraction Method Principal components Analyse Correlations matrix Display Screeplot Unrotated factor solution Extract Eigenvalues over 1 Rotation Method Varimax Correlation KMO and Bartlett's Test Kaiser-Meyer-Olkin Measure of Sampling Adequacy. Bartlett's Test of Sphericity Status enhance my image Willing to try new drink Status is important to me Approx. Chi-Square df Sig..888 2750.261 136.000 Show Symbol of Symbol of who I am success prestige 1.000.379.464.524.465.379 1.000.598.102.009.464.598 1.000.255.102.524.102.255 1.000.749.465.009.102.749 1.000.421.543.647.247.209.598.294.411.582.503.440.538.650.165.143.371.206.228.442.499.394.310.340.376.368.200.338.369.173.124.374.357.410.293.240.456.604.594.149.044.325.631.506.105.098 345 187 173 481 523 KMO value 0.6 and above Sig. Value less than 0.05 No o Most correlations of 0.3 and above Total Variance Explained Initial Eigenvalues Extraction Sums of Squared Loadings Rotation Sums of Squared Loadings Component Total % of Variance Cumulative % Total % of Variance Cumulative % Total % of Variance Cumulative % 1 6.815 40.087 40.087 6.815 40.087 40.087 4.741 27.887 27.887 2 2.915 17.145 57.231 2.915 17.145 57.231 3.440 20.237 48.124 3 1.328 7.812 65.043 1.328 7.812 65.043 2.876 16.920 65.043 4.965 5.679 70.722 5.732 4.308 75.030 6.568 3.343 78.372 7.555 3.264 81.637 8.485 2.851 84.487 9.421 2.476 86.964 10.391 2.298 89.261 11.373 2.196 91.457 12.345 2.029 93.487 13.296 1.742 95.229 14.239 1.407 96.636 15.209 1.230 97.866 16.192 1.130 98.996 17.171 1.004 100.000 Extraction Method: Principal Component Analysis. Cumulative percent of variance explained. We are looking for an eigenvalue above 1.0. 7 6 5 e lu 4 a v n e ig 3 E 2 1 0 Scree Plot 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 Component Number Clear bend in Scree plot Component Matrix a Component 1 2 3.733 Status enhance my.723 -.428 image.715 -.326.701 -.379.699 -.473.688.399.688.677 -.425 Status is important to me.646.450 -.412.643.393 -.376.630 -.442.609 -.493 Willing to try new drink.587 -.467 Status not related to me -.407 Indicate wealth.388.718.518.695.573.577 Extraction Method: Principal Component Analysis. a. 3 components extracted. Most components loading on factor 1 Less components loading on factor 2 Least components loading on the last factor Dr. Paurav Shukla 5

What shall the component be called? Rotated Component Matrix a Component 1 2 3.820.795.794.779.774.724 Status not related to me -.353.871.843 Indicate wealth.792.457.643.515.618 Status is important to me.441.771 Status enhance my.764 image When a variable is loaded on two or more factors include it with the factor where it has highest value Status not related to me Indicate wealth Status is important to me Status enhance my image Willing to try new drink Willing to try new drink.415.727.664.372.462.538 Extraction Method: Principal Component Analysis. Rotation Method: Varimax with Kaiser Normalization. a. Rotation converged in 6 iterations. Calculating component scores Component Plot in Rotated Space Helps in brand level comparisons 1.0 0.5 2 n t e 0.0 o n p m o C-0.5-1.0-1.0 Indicate_wealth Symbol_of_success Gain_respect Interest_in_status S_import_me S_enhan_ima Show_who_I_am Presence_o_others Try_new_drink -0.5 0.0 Component 1 0.5 Symbol_of_prestige 1.0 Value -0.5 0.5 0.0 1.0 3 Component -1.0 Because it s a 3 factor solution we are getting a 3 dimensional plot. What if we get a 2 factor solution? How to get this represented in 2 dimension plot so it is easier to understand? Tricks of the trade Applications of factor analysis Self-determining number of factors Combining reliability analysis (Cronbach s Alpha) A more concise representation of the marketing situation and hence communication may be enhanced Fewer questions may be required on future surveys Market segmentation and identifying the underlying variables on which to group customers Identifying brand attributes that influence customer choice Identifying media consumption habits of consumers Perceptual maps become feasible Dr. Paurav Shukla 6

Further questions Confirmatory factor analysis What if we want to measure a hypothesis or theories concerning the structure underlying a set of variables? Such as, in our case do all these variables affect the psychological association of a brand and which in turn would affect status consumption? to be a popular person in groups. to show who I am. to be a symbol of success. to be a symbol of prestige. to indicate my wealth. to be noticed by others. to indicate my achievement. to be appreciated by others. because status interests me. because status is important to me. because status enhances my image. because I am interested in new alcoholic beverages with status. because I would pay more for an alcoholic beverage if it had status. because the status of an alcoholic beverage is irrelevant to me. to increase my own value from others point of view. to be more attractive than others. Psychological association Status consumption Robust but complex Used by a lot of advance level researchers The in phenomenon in research Software: Lisrel or Amos (SPSS) and several others Confirmatory factor analysis (Structural Equation Modelling) Dr. Paurav Shukla 7