Research Methodology: Tools

Size: px
Start display at page:

Download "Research Methodology: Tools"

Transcription

1 MSc Business Administration Research Methodology: Tools Applied Data Analysis (with SPSS) Lecture 02: Item Analysis / Scale Analysis / Factor Analysis February 2014 Prof. Dr. Jürg Schwarz Lic. phil. Heidi Bruderer Enzler Contents Slide 2 Aims of the Lecture 3 Typical Syntax 4 Introduction 5 An Example of a Construct... 5 What are Constructs?... 7 Evaluating the Quality of an Instrument... 8 Part I: Item Analysis and Scale Analysis 9 Descriptive Analysis... 9 Item Difficulty (Also Called p-value) Item Discrimination Reliability Analysis Part I: Item Analysis and Scale Analysis with SPSS 16 Part II: Factor Analysis 19 Main Steps for Performing a Factor Analysis Problematic Aspects Part II: Factor Analysis with SPSS 21 First Step: Select the Variables Third Step: Determining the Number of Factors Fifth Step (Optional): Calculate Sum Scales or Factor Scores References Appendix 43

2 Aims of the Lecture Slide 3 You will understand the term "construct". You will understand item difficulty and how this can be calculated with SPSS. You will understand item discrimination and how this can be calculated with SPSS. You will understand Cronbach's Alpha as a measure for reliability, and how it can be calculated with SPSS. You will understand the concept of factor analysis and know how to perform it with SPSS (Principal Component Analysis). Specifically, you will understand how C a correlation matrix is interpreted. determine the "correct" number of factors (Scree Plot, Kaiser criterion). Varimax rotation is used to be better able to interpret a factor solution. to interpret factors in relationship to their meaning. how factor scores and sum scales are calculated. Typical Syntax Slide 4 Reliability analysis RELIABILITY /VARIABLES=ghqconc ghqsleep ghquse ghqdecis /SCALE('ALL VARIABLES') ALL /MODEL=ALPHA /STATISTICS=DESCRIPTIVE /SUMMARY=TOTAL. Items (list incomplete) Cronbachs Alpha Factor analysis FACTOR /VARIABLES ghqconc ghqconfi ghqdecis ghqenjoy Items (list incomplete) /MISSING LISTWISE /ANALYSIS ghqconc ghqconfi ghqdecis ghqenjoy /PRINT INITIAL CORRELATION SIG KMO INV EXTRACTION ROTATION /PLOT EIGEN ROTATION /CRITERIA MINEIGEN(1) ITERATE(25) Extractions method /EXTRACTION PC /CRITERIA ITERATE(25) Rotation /ROTATION VARIMAX /SAVE REG(ALL) /METHOD=CORRELATION. Saving factor scores

3 Introduction An Example of a Construct Measuring psychosocial well-being with the "General Health Questionnaire"? In the 2003 Health Survey for England the "General Health Questionnaire" (GHQ-12) was used to measure "psychosocial well-being". The GHQ-12 is a battery of 12 items. Slide 5 An extract from the questionnaire: Question I: Are these 12 items collectively suitable for measuring a construct? => Perform an item analysis and a scale analysis Underlying dimensions of the General Health Questionnaire? There could be groups of items on the GHQ-12 that measure different aspects (dimension, factor) of "psychosocial well-being" Slide 6 ghqdecis Felt capable of making decisions ghquse Felt playing useful part in things ghqface Been able to face problems ghqenjoy Able to enjoy day-to-day activities Positive health Construct "psychosocial well-being" ghqconc Able to concentrate ghqhappy Been feeling reasonably happy ghqunhap Been feeling unhappy and depressed ghqstrai Felt constantly under strain Dimensions (factors) of "psychosocial well-being" ghqconfi Been losing confidence in self ghqsleep Lost sleep over worry Psychological distress ghqover Felt couldn t overcome difficulties ghqworth Been thinking of self as worthless Question II: Does the GHQ-12 have a structure? Are there dimensions that must be considered as separate? => Perform a factor analysis

4 What are Constructs? A construct is an abstract idea that cannot be directly observed and measured. Factor technical term Slide 7 Dimension theoretical term Item in the questionnaire Indicator theoretical term Examples: Psychosocial well-being, motivation, anxiety, employee satisfaction Complex constructs often contain various aspects (dimensions, factors). Examples: Psychosocial well-being = positive health + psychological distress Stress reactivity = Play-dead reflex + heart phobia + negativism + disturbance Intelligence = Verbal intelligence + mathematical-logical intelligence + C In order to measure constructs, indicators that are themselves measurable must be found. Example: Social status = Income + years of education + job category In order to measure constructs, multiple items with rating scales are used often. It is assumed that these items are an indicator for the construct. Technically, a construct is therefore often regarded as an item battery. In SPSS, articles and this script, an item battery is also designated as a scale. Example: General Health Questionnaire (GHQ-12) There are different possibilities for compressing these items into a score. As a sum scale: The item values of an item battery are summed up. An example is the Apgar Score (determining the vital signs of a newborn). Evaluating the Quality of an Instrument Slide 8 How can the quality of an instrument be assessed? Common sense: Content considerations Item analysis und scale analysis Descriptive analysis Item difficulty Item discrimination Reliability analysis Examination of dimensionality Factor analysis "Part I" of the lecture "Part II" of the lecture When do we want to assess the quality of an instrument for which ends? Pretests: Finding out where and how a questionnaire can be improved. Existing datasets: Choosing appropriate items for further analysis.

5 Part I: Item Analysis and Scale Analysis Descriptive Analysis Example GHQ-12 in the Health Survey for England 2003 (n = 8,833) Slide 9 Missing values: Around 6% missing values appears unproblematic. In addition, all items have a similar proportion of missing values. ghquse ("felt playing useful part in things") may be somewhat vague, and so less frequently answered. Mean / Median / Skewness / Kurtosis: Difference between mean and median distributions are not symmetric. All distributions are more or less right skewed (skewness > 0) and more peaked than a normal distribution (kurtosis > 0). Standard deviation: Variability is assessed with the help of the histograms (see below). Minimum / Maximum: Values between 1 and 4 The entire range of the scaled was used. Example GHQ-12: Histograms Slide 10 ghqconc ghqsleep ghquse ghqdecis ghqstrai ghqover ghqenjoy ghqface ghqunhap ghqconfi ghqworth ghqhappy Variables ghqconc, ghquse, ghqdecis, ghqenjoy, ghqface and ghqhappy: Items display little variability, could be problematic. The most frequently chosen category 2 represents "same as always". Variables qhqsleep, ghqstrai, ghqover, qhqunhap, ghqconfi and ghqworth: These items exhibit more variance, but are strongly skewed to the right. Floor effects, especially with qhqunhap, ghqconfi and ghqworth. The most commonly chosen value of 1 represents an especially good condition of health.

6 Item Difficulty (Also Called p-value) The difficulty of an item indicates the proportion of respondents who respond to the item in a manner that indicates the characteristic in question is present to a greater extent. Slide 11 Computing item difficulty Continuous items (Rating scales): Item difficulty = arithmetic mean of the item Binary items (for example, Yes/No): Item difficulty is the proportion of respondents who answered a question affirmatively or correctly, when the coding is: affirmative or correct = 1, negative or false = 0 Item difficulty Item i Number of correct responsesitem i = p= Number of valid responses Item i Interpretation of item difficulty High value Item is "easy" Low value Item is "difficult" As a rule, a good mix in an item battery is desirable. Items with extreme values should be revised. With binary items the p-value lies between 0 and 1 (.5 = middle difficulty): Items with p <.1 (very difficult) or p >.9 (very easy) should be revised. Example item difficulty with binary items: A fictitious questionnaire about taxes For example item 5: "In your opinion, should taxes be lowered? Yes or No?" Slide 12 Respondent Item n n n n ny na nt p Legend 0 = no, 1 = yes, n = no answer ny = number of those who answered "Yes" na = number of those who answered nt = total number of those asked p = item difficulty = ny/na Item 1 is too "easy", no distinction is visible. Item 2 appears to be too "difficult" or is not understood. Both items should be eliminated. Item 3 has an average difficulty. Item 4 is relatively difficult, item 5 relatively simple. An example of items with a rating scale follows below.

7 Item Discrimination Item discrimination is the correlation between an item and the item battery without this item. Item discrimination is computed for every item. Slide 13 The discrimination value of an item indicates how well this single item predicts the value of the item battery. Values range from -1 to 1. The higher the value, the better the item measures what the item battery measures. Positive values near 1 are desirable. If the value is negative, this may be due to the rotation of the item (direction/polarity). If the rotation is correct, however, the item should be discarded or revised. Rule of thumb: Items with a discrimination value under.30 are discarded or revised. Changing item rotation in SPSS Example: item v01 with for possible values 1, 2, 3, and 4. RECODE v01 (1 = 4)(2 = 3)(3 = 2)(4 = 1) INTO v01_r. FREQUENCIES v01 v01_r. Example GHQ-12 AnalyzeScaleReliability AnalysisC Statistics: "Scale if item deleted" Slide 14 Column "Corrected Item-Total Correlation": Item discrimination ranges from.487 to.738. Thus all values are above the threshold of.30. This means that each of the items reflects sufficiently well what the scale as a whole measures.

8 Reliability Analysis Reliability analysis is used to quantify how well all items in a battery collectively measure the same theoretical construct, for example, psychosocial well-being. A high average correlation between the items indicates that they all measure the same construct. Slide 15 Usually, reliability is measured through the Cronbach's alpha coefficient. Cronbach's alpha is a measurement of the "internal consistency" of a scale. Cronbach's alpha is a positive function of the average correlation among items in the battery, and is positively correlated with the number of items and the size of the sample. Preconditions for calculating Cronbach's alpha All items measure the same theoretical construct Metric variables All items are scaled in the same direction (for example, low values for the items represent high psychosocial well-being) All items have the same distribution (ideally: normal distribution) Evaluation of Cronbach's alpha Alpha should be >.80. In practice, however, values of.60 or.70 are acceptable. Part I: Item Analysis and Scale Analysis with SPSS General Health Questionnaire (GHQ-12) in the Health Survey for England 2003 (n = 8,833) Slide 16 AnalyzeScaleReliability AnalysisC

9 Slide 17 Results Case Processing Summary 7.3% of the cases are excluded because of missing values. OK Reliability Statistics The reliability of the scale is high (Cronbach's alpha =.891). OK Item Statistics Since the item difficulty (p-value) is the arithmetic mean, it is found in the "Mean" column. Scale from 1 to 4 average difficulty of 2.5. However, in this case the wording suggests that 2 is a type of middle category ("same as usual"). The item difficulties vary between 1.40 and That means that the mixture is not very high; some items are very "difficult" (low p-values). Item-Total Statistics Slide 18 Since item discrimination is the correlation between a single item and the total score without this item, it appears in the column "Corrected Item- Total Correlation". Item discrimination varies between.487 and.738 and therefore lies above.30 OK In the column "Cronbach's Alpha if Item Deleted", we see for each item how high Cronbach's alpha would be if this item were omitted. (Including all items: Alpha =.891) In the example, Cronbach's alpha would not be higher if any of items were omitted. This indicates that all items should remain. OK Summary Question I Each of the items represents the entire battery. However, the total battery displays little variation. From the standpoint of reliability, the GHQ-12 items can be used as a scale.

10 Part II: Factor Analysis Main Steps for Performing a Factor Analysis 1. Select the variables Include only theoretically relevant variables Sufficient number of variables (4 or more per factor) Not too small a sample Perform descriptive analysis (as in Part I: item analysis and reliability analysis) Consider the correlation matrix Slide Test suitability and extract the factors Determination of suitability: Evaluate inverse of correlation matrix, Bartlett's test and KMO Select extraction method (principal component analysis, principal axis analysis) 3. Specify the number of factors Criteria: Eigenvalue, scree plot, rules of thumb 4. Interpret the factors Rotation of the factor matrix, mapping variables to factors, interpretation 5. Optional: Compute sum scales or factor scores Computing sum scales or factor scores Problematic Aspects Slide 20 Many decisions regarding extraction and interpretation of factors are subjective. The same dataset can produce different results, depending on the "decision path". Although variables must be at least interval-scaled, in practice, variables with lower scale levels are often included, which can lead to false conclusions. Sample size There is no scientifically exact rule how large the sample should be. One of the possible rules of thumb suggests that there should be at least 10 subjects per item ("Rule of 10"). Problematic missing values In item batteries, there are often many missing values. Results are different, depending on "missing treatment" in relation to Number of factors Interpretation of factors There is no single solution for handling missing data. Depending on the available data and context, another approach may be needed.

11 Part II: Factor Analysis with SPSS Slide 21 Item battery "General Health Questionnaire" (GHQ-12) in the Health Survey for England 2003 First Step: Select the Variables Theoretically relevant variables Sufficient number of variables Not too small a sample Descriptive analysis Correlation matrix Scale GHQ-12: All plausible items OK Assumption: 2 factors with 6 variables each Psychological distress Positive health OK Very large sample (n = 8,833) OK See histograms (see Part I). In an ideal situation, normally distributed variables. See below How can dimensions (factors) be discovered? From slide 6: Slide 22 A factor analysis deals with how different items relate to each other, and how they can be grouped into factors. The goal is to group items together as a factor and to replace them with general terms. The general term reflects the underlying content. Each factor represents several items. It is more efficient to represent something through fewer factors than through many single items. Attention: Theoretical and empirical facts are incorporated into a factor analysis. Basic idea of factor analysis Assumption: Some variables tend to be related. Example GHQ-12: ghqstrai ("felt constantly under strain") ghqconfi ("been losing confidence in self") Three possible causes for the ghqstrai ghqconfi correlation: Variable ghqstrai influences variable ghqconfi. Variable ghqconfi influences variable ghqstrai. Both variables are influenced by a factor.

12 Correlations matrix of the variables AnalyzeCorrelateBivariate C Slide 23 Correlations b ghqsleep ghqstrai ghqover ghqworth ghqunhap ghqconfi ghqhappy ghqface ghqconc ghquse ghqdecis ghqenjoy ghqsleep Pearson correlation coefficient **.483 **.387 **.547 **.459 **.343 **.312 **.330 **.228 **.226 **.297 ** Significance (2-sided) Pearson correlation coefficient.545 ** **.418 **.592 **.492 **.376 **.365 **.352 **.217 **.268 **.351 ** ghqstrai Attention: For teaching purposes, Significance (2-sided) Pearson correlation coefficient.483 **.598 ** **.593 **.582 the variables **.397 **.443 are **.369 grouped **.344 together **.345 **.379 ** ghqover Significance (2-sided) Pearson correlation coefficient.387 **.418 **.517 ** **.689 so as **.449 to better **.395 observe **.319 **.362 the structure. This is not normally the case! **.330 **.323 ** ghqworth Significance (2-sided) ghqunhap Pearson correlation coefficient.547 **.592 **.593 **.574 ** **.502 **.428 **.362 **.321 **.308 **.377 ** Significance (2-sided) ghqconfi Pearson correlation coefficient.459 **.492 **.582 **.689 **.671 ** **.427 **.379 **.368 **.355 **.363 ** Significance (2-sided) ghqhappy Pearson correlation coefficient.343 **.376 **.397 **.449 **.502 **.460 ** **.370 **.368 **.380 **.416 ** Significance (2-sided) ghqface Pearson correlation coefficient.312 **.365 **.443 **.395 **.428 **.427 **.487 ** **.389 **.485 **.469 ** Significance (2-sided) ghqconc Pearson correlation coefficient.330 **.352 **.369 **.319 **.362 **.379 **.370 **.417** **.438 **.440 ** Significance (2-sided) ghquse Pearson correlation coefficient.228 **.217 **.344 **.362 **.321 **.368 **.368 **.389 **.389** **.439 ** Significance (2-sided) ghqdecis Pearson correlation coefficient.226 **.268 **.345 **.330 **.308 **.355 **.380 **.485 **.438**.488 ** ** Significance (2-sided) ghqenjoy Pearson correlation coefficient.297 **.351 **.379 **.323 **.377 **.363 **.416 **.469 **.440**.439 **.367 ** 1 Significance (2-sided) **. The correlation is significant at the 0.01 level (2-sided). b. Listwise N=8188 There could be two factors (blue, green), where ghqhappy and ghqface cannot be clearly assigned. But: We cannot decide on the number of factors on the basis of a correlation matrix. => Perform a factor analysis! What is important about evaluating the correlation matrix? Slide 24 Significance levels of the correlations The significance level shows whether the correlation coefficient is merely different from zero by chance, or whether there is a high probability that it is truly different from zero. The significance level should be chosen at the beginning of the study (1% or 5%). This depends on the sample size and the goals of the analysis. On rare occasions, especially with large sample sizes, the significance level 0.1% is chosen. Values of the correlation coefficients A factor analysis is problematic when there are many low, yet no high correlation coefficients present. In this case, the data structure is too heterogeneous. Best would be if there are clusters of highly correlated variables that are separated. These clusters are indication of an underlying factor structure.

13 Second Step: Test of Suitability and Extraction of Factors AnalyzeDimension ReductionFactor F Slide 25 Optional Inverse of correlation matrix: Evaluating suitability of data I Slide 26 The correlation structure is suitable for a factor analysis if its inverse approaches a diagonal matrix. Essentially, it acts as a visual aid. Evaluation: There are no universal rules! A matrix is diagonal if the non-diagonal values are as close as possible to zero. That means that the non-diagonal values should be much smaller than those on the diagonal. Example GHQ-12 The non-diagonal values are significantly smaller than the values on the diagonal. The correlation structure is well-suited for a factor analysis. Diagonal

14 Bartlett's test: Evaluating suitability of data II Null hypothesis H 0 : The sample is drawn from a population in which all variables are completely uncorrelated. Slide 27 Assumption: The data are normally distributed. Example GHQ-12 In the case of the GHQ-data, Bartlett's test is significant (Sig. =.000) and correspondingly the null hypothesis can be rejected. The variables are not completely uncorrelated. So the factor analysis can be continued. Attention: The statement "The variables are correlated." is false. The alternative hypothesis cannot be postulated. Kaiser-Meyer-Olkin (KMO): Evaluating suitability of data III Kaiser, Meyer and Olkin developed a "Measure of Sampling Adequacy" (MSA), which is the standard test for assessing whether the data are suitable for factor analysis. MSA values refer to single variables. The KMO index is a generalization for the entire dataset. The KMO index shows the extent to which the variables belong together and so helps to determine whether or not a factor analysis is appropriate. It tests whether the partial correlations between the variables are small. If these are small, then the KMO is high. Rule of thumb: The KMO should be.60 or higher in order to continue with the factor analysis. Kaiser (1970) suggests a lower limit of.50, though a value of.80 or higher would be desirable. Slide 28 Example GHQ-12 KMO value.00 to.49 unacceptable.50 bis.59 miserable.60 bis.69 mediocre.70 bis.79 middling.80 bis.89 meritorious.90 bis 1.00 marvellous

15 Example GHQ-12: Comparing the extraction results Slide 29 Factor loading Factor loadings The factor loading of a variable is the correlation between the variable and the factor. Typical statement: "The variable ghqconc loads with.619 on factor 1." Values between -1 and +1 are theoretically possible. The magnitude of the factor loading shows how closely a variable correlates with a factor: Magnitudes close to 0 indicate that a relationship barely exists. The higher the magnitude, the closer the relationship. Graphical interpretation of the factor loadings Each variable can be described as a vector in a coordinate system. The vector is formed by the factor loadings of the variable. The factor loadings can be interpreted as coordinates. Slide Factor ghquse ghqdecis ghqenjoy ghqconc ghqface ghqhappy Factor ghqworth ghqconfi ghqover ghqunhap ghqsleep ghqstrai

16 Communality The variables cannot usually be completely explained by the factors. Slide 31 Communality is the total amount of a variable's variance that can be explained by all factors. Communality indicates the extent to which a variable is explained through the factors. Example GHQ-12 Example variable ghqconq: Communality after extraction % of the variance of ghqconq is explained through factors 1 and 2. Association with Component Matrix = % = 38.3% + 9.8% Third Step: Determining the Number of Factors Slide 32 There is no unambiguous method for determining the number of factors. Common sense: Limit the number of factors to those which are understood. The number of factors is usually significantly less than the number of variables. Kaiser Criterion The Kaiser criterion removes all components whose eigenvalue is under 1.0. It is the default in SPSS and most other statistical programs. The Kaiser criterion, however, is not recommended as the only basis for a decision. Scree Plot The scree plot shows the components along the x-axis and the corresponding eigenvalues along the y-axis. All components past the elbow are omitted. Explained variance as the criterion Some researchers apply the rule that there be a sufficient number of factors to describe 90% (sometimes 80%) of the variance.

17 Slide 33 Kaiser criterion (Eigenvalues > 1.0) An eigenvalue shows how much of the total variance is explained by the factor. The Kaiser criterion requires that all components with an eigenvalue < 1.0 be rejected. The Kaiser criterion is not recommended as the only basis for a decision, since it commonly leads to choosing too many factors. Example GHQ-12 The eigenvalue corresponds to the proportion of the total variance that is explained by the factor. The variables are z-transformed for this purpose (standard deviation 1 and mean 0). Thus the total variance of the GHQ-12 to be explained (12 variables) = 12. The first factor explains of this amount, and so 5.588/12 (46.568%) of the variance. The second factor explains 1.330/12 (11.082%) of the variance. 2 factors at most Kaiser criterion (Eigenvalue > 1.0) Scree Plot: Example GHQ-12 Slide 34 The elbow occurs at 3 2 factors Typical statement: elbow flat slope I have decided on two factors, because this is theoretically plausible and consistent with the Kaiser criterion (eigenvalue > 1) as well as the scree plot. Scree plot: Example with random data No elbow Elbow criterion: If the factors arise randomly, then the slope is flat. Only factors above the elbow are counted. Scree

18 Fourth Step: Interpretation of the Factors Rotation Rotation leads to a better readability of the results. Varimax rotation preserves the independence of the factors. The factors are rotated, so that the variance of the squared loadings per factor is maximized. Mid-level loadings tend to become "smaller" or "larger", so that the factor structure is easier to interpret. Slide 35 Symbolic representation: Before rotation: After rotation: Factor 1 Factor 1 Factor 2 Factor 2 Example GHQ-12: Rotation and loading plots in SPSS Loading plots essentially constitute a visualization. Slide 36 Before rotation: After rotation:

19 Evaluating the loadings and assigning variables to factors Slide 37 In principle, each variable is assigned to the factor on which it loads highest. How high should a factor loading minimally be in order to be interpreted? There are different possible rules of thumb: Factor loadings lower than.20 should not be considered. If an item does not load any higher on any of the factors: Discard the item and redo the analysis. Factor loadings of ±.30 to ±.40 are minimally acceptable but higher values are desirable (particularly if the sample is small and only few variables are considered) Irrespective of sample size: A factor can be interpreted ifc at least 4 variables show loadings of.60. at least 10 variables have a loading of.40. If n < 300, factors with low loadings only should not be interpreted. Example GHQ-12 The factors show 6 and 5 variables with loadings.6, respectively. Cross-loadings Slide 38 A cross-loading occurs if a variable has high loadings on more than one factor. (two or more factor loadings >.3 or >.4) The item is related with more than one factor. It correlates with other items that load on the affected factors. If the goal is to find sharply-defined factors, and from them, for example, form sum scores, the difference between loadings is considered. Large difference (>.2): The variable can be assigned to the factor with the highest loading. Small difference (<.2): The variable cannot be assigned to a factor. Exclude and perform the analysis again. (Unless theoretical considerations contradict this.) If the goal is to demonstrate commonality of concepts, cross-loadings are of theoretical interest and are retained. They must be theoretically plausible.

20 Content interpretation of the factors I Slide 39 The items are sorted within the factor according to the decreasing magnitude of the loading. SPSS can facilitate doing so. Not sorted: Sorted by size: Slide 40 Content interpretation of the factors II The sign of the loading (or the loading itself) is noted. What is the "theme" of the factors? What do the factors stand for? Inspect item text. The highest-loading variables are especially helpful (so-called "marker variables"). The context of the study can also provide information. Example GHQ-12 ghqunhap Been feeling unhappy and depressed ghqstrai Felt constantly under strain ghqconfi Been losing confidence in self ghqover Felt couldn t overcome difficulties ghqsleep Lost sleep over worry ghqworth Been thinking of self as worthless ghqdecis Felt capable of making decisions ghquse Felt playing useful part in things ghqface Been able to face problems ghqenjoy Able to enjoy day-to-day activities ghqconc Able to concentrate ghqhappy Been feeling reasonably happy Factor 1 Factor 2 Possible general term: "psychological distress" "positive health"

21 Fifth Step (Optional): Calculate Sum Scales or Factor Scores To use factors in further analyses, we need to generate one variable per factor to represent it. Usually, sum scales are calculated on the basis of the factor solution. Alternatively, factor scores can be calculated. Slide 41 Calculating sum scales in SPSS TransformCompute variablesc Test each factor for reliability beforehand! (Cronbach's alpha) Calculating factor scores in SPSS It is implemented as part of SPSS factor analysis (Option "Scores"). The "regression method" is most often used. This results in one variable per factor, e.g. FAC1_1, FAC1_2. References Hankins, M. (2008) The factor structure of the twelve item General Health Questionnaire (GHQ-12): the result of negative phrasing? Clinical Practice and Epidemiology in Mental Health, 4:10, (Date of access: February 2014) Slide 42 Jackson, C. (2007) The General Health Questionnaire. Occupational Medicine, 57:79, (Date of access: February 2014) Kaiser, H. (1970). A second generation little jiffy. Psychometrika, 35(4), (Date of access: February 2014)

22 Appendix Slide 43 Types of Factorizations: Principal Components Analysis vs. Principal Axis Factoring Principal Component Analysis (PCA, default in SPSS) Goal: Reproduce the data structure No causal relationship between factors and variables Factors are "general terms" and are often called components. Process: The first factor is chosen so that it describes the greatest possible proportion of variance in the variables. Each additional factor describes a maximum amount of the remaining variance. Factors are extracted until the total variance among the variables is explained. If variables are added, the factor loadings change. Principal Axis Factoring (PAF) Goal: Determine the cause of the correlation structure Causal interpretation: Factors cause the correlations among variables. Process: The first factor is chosen so that it explains the greatest possible portion of the common variance of the variables. Each additional factor explains a maximum portion of the remaining common variance of the variables. Factors are extracted until the factors can explain all the common variance in a set of variables. In principle it is possible to add variables without affecting the factor loadings. Notes: Slide 44

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models Factor Analysis Principal components factor analysis Use of extracted factors in multivariate dependency models 2 KEY CONCEPTS ***** Factor Analysis Interdependency technique Assumptions of factor analysis

More information

T-test & factor analysis

T-test & factor analysis Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue

More information

A Brief Introduction to SPSS Factor Analysis

A Brief Introduction to SPSS Factor Analysis A Brief Introduction to SPSS Factor Analysis SPSS has a procedure that conducts exploratory factor analysis. Before launching into a step by step example of how to use this procedure, it is recommended

More information

4. There are no dependent variables specified... Instead, the model is: VAR 1. Or, in terms of basic measurement theory, we could model it as:

4. There are no dependent variables specified... Instead, the model is: VAR 1. Or, in terms of basic measurement theory, we could model it as: 1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data 2. Linearity (in the relationships among the variables--factors are linear constructions of the set of variables; the critical source

More information

5.2 Customers Types for Grocery Shopping Scenario

5.2 Customers Types for Grocery Shopping Scenario ------------------------------------------------------------------------------------------------------- CHAPTER 5: RESULTS AND ANALYSIS -------------------------------------------------------------------------------------------------------

More information

FACTOR ANALYSIS NASC

FACTOR ANALYSIS NASC FACTOR ANALYSIS NASC Factor Analysis A data reduction technique designed to represent a wide range of attributes on a smaller number of dimensions. Aim is to identify groups of variables which are relatively

More information

Data analysis process

Data analysis process Data analysis process Data collection and preparation Collect data Prepare codebook Set up structure of data Enter data Screen data for errors Exploration of data Descriptive Statistics Graphs Analysis

More information

Overview of Factor Analysis

Overview of Factor Analysis Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,

More information

To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.

To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method. Factor Analysis in SPSS To conduct a Factor Analysis, start from the Analyze menu. This procedure is intended to reduce the complexity in a set of data, so we choose Data Reduction from the menu. And the

More information

Common factor analysis

Common factor analysis Common factor analysis This is what people generally mean when they say "factor analysis" This family of techniques uses an estimate of common variance among the original variables to generate the factor

More information

How to report the percentage of explained common variance in exploratory factor analysis

How to report the percentage of explained common variance in exploratory factor analysis UNIVERSITAT ROVIRA I VIRGILI How to report the percentage of explained common variance in exploratory factor analysis Tarragona 2013 Please reference this document as: Lorenzo-Seva, U. (2013). How to report

More information

Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA

Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA PROC FACTOR: How to Interpret the Output of a Real-World Example Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA ABSTRACT THE METHOD This paper summarizes a real-world example of a factor

More information

Research Methodology: Tools

Research Methodology: Tools MSc Business Administration Research Methodology: Tools Applied Data Analysis (with SPSS) Lecture 11: Nonparametric Methods May 2014 Prof. Dr. Jürg Schwarz Lic. phil. Heidi Bruderer Enzler Contents Slide

More information

APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY

APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY In the previous chapters the budgets of the university have been analyzed using various techniques to understand the

More information

2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) F 2 X 4 U 4

2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) F 2 X 4 U 4 1 Neuendorf Factor Analysis Assumptions: 1. Metric (interval/ratio) data. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) 3. Univariate and multivariate

More information

Exploratory Factor Analysis of Demographic Characteristics of Antenatal Clinic Attendees and their Association with HIV Risk

Exploratory Factor Analysis of Demographic Characteristics of Antenatal Clinic Attendees and their Association with HIV Risk Doi:10.5901/mjss.2014.v5n20p303 Abstract Exploratory Factor Analysis of Demographic Characteristics of Antenatal Clinic Attendees and their Association with HIV Risk Wilbert Sibanda Philip D. Pretorius

More information

Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red)

Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red) Factor Analysis Example: SAS program (in blue) and output (in black) interleaved with comments (in red) The following DATA procedure is to read input data. This will create a SAS dataset named CORRMATR

More information

Simple Linear Regression Inference

Simple Linear Regression Inference Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation

More information

DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION

DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION Analysis is the key element of any research as it is the reliable way to test the hypotheses framed by the investigator. This

More information

An introduction to. Principal Component Analysis & Factor Analysis. Using SPSS 19 and R (psych package) Robin Beaumont robin@organplayers.co.

An introduction to. Principal Component Analysis & Factor Analysis. Using SPSS 19 and R (psych package) Robin Beaumont robin@organplayers.co. An introduction to Principal Component Analysis & Factor Analysis Using SPSS 19 and R (psych package) Robin Beaumont robin@organplayers.co.uk Monday, 23 April 2012 Acknowledgment: The original version

More information

Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003

Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003 Exploratory Factor Analysis Brian Habing - University of South Carolina - October 15, 2003 FA is not worth the time necessary to understand it and carry it out. -Hills, 1977 Factor analysis should not

More information

1-3 id id no. of respondents 101-300 4 respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank

1-3 id id no. of respondents 101-300 4 respon 1 responsible for maintenance? 1 = no, 2 = yes, 9 = blank Basic Data Analysis Graziadio School of Business and Management Data Preparation & Entry Editing: Inspection & Correction Field Edit: Immediate follow-up (complete? legible? comprehensible? consistent?

More information

(303C8) Social Research Methods on Psychology (Masters) Convenor: John Drury

(303C8) Social Research Methods on Psychology (Masters) Convenor: John Drury (303C8) Social Research Methods on Psychology (Masters) Convenor: John Drury Exercise EXC (60%) The re-sit for coursework is an exercise in which you must attempt one of two tasks in analysis and data

More information

PRINCIPAL COMPONENT ANALYSIS

PRINCIPAL COMPONENT ANALYSIS 1 Chapter 1 PRINCIPAL COMPONENT ANALYSIS Introduction: The Basics of Principal Component Analysis........................... 2 A Variable Reduction Procedure.......................................... 2

More information

Principal Component Analysis

Principal Component Analysis Principal Component Analysis ERS70D George Fernandez INTRODUCTION Analysis of multivariate data plays a key role in data analysis. Multivariate data consists of many different attributes or variables recorded

More information

A Brief Introduction to Factor Analysis

A Brief Introduction to Factor Analysis 1. Introduction A Brief Introduction to Factor Analysis Factor analysis attempts to represent a set of observed variables X 1, X 2. X n in terms of a number of 'common' factors plus a factor which is unique

More information

Introduction to Principal Components and FactorAnalysis

Introduction to Principal Components and FactorAnalysis Introduction to Principal Components and FactorAnalysis Multivariate Analysis often starts out with data involving a substantial number of correlated variables. Principal Component Analysis (PCA) is a

More information

Factor Analysis Using SPSS

Factor Analysis Using SPSS Psychology 305 p. 1 Factor Analysis Using SPSS Overview For this computer assignment, you will conduct a series of principal factor analyses to examine the factor structure of a new instrument developed

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Chapter 7 Factor Analysis SPSS

Chapter 7 Factor Analysis SPSS Chapter 7 Factor Analysis SPSS Factor analysis attempts to identify underlying variables, or factors, that explain the pattern of correlations within a set of observed variables. Factor analysis is often

More information

STATISTICA Formula Guide: Logistic Regression. Table of Contents

STATISTICA Formula Guide: Logistic Regression. Table of Contents : Table of Contents... 1 Overview of Model... 1 Dispersion... 2 Parameterization... 3 Sigma-Restricted Model... 3 Overparameterized Model... 4 Reference Coding... 4 Model Summary (Summary Tab)... 5 Summary

More information

Factor Analysis and Structural equation modelling

Factor Analysis and Structural equation modelling Factor Analysis and Structural equation modelling Herman Adèr Previously: Department Clinical Epidemiology and Biostatistics, VU University medical center, Amsterdam Stavanger July 4 13, 2006 Herman Adèr

More information

Principal Component Analysis

Principal Component Analysis Principal Component Analysis Principle Component Analysis: A statistical technique used to examine the interrelations among a set of variables in order to identify the underlying structure of those variables.

More information

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not.

Example: Credit card default, we may be more interested in predicting the probabilty of a default than classifying individuals as default or not. Statistical Learning: Chapter 4 Classification 4.1 Introduction Supervised learning with a categorical (Qualitative) response Notation: - Feature vector X, - qualitative response Y, taking values in C

More information

Instructions for SPSS 21

Instructions for SPSS 21 1 Instructions for SPSS 21 1 Introduction... 2 1.1 Opening the SPSS program... 2 1.2 General... 2 2 Data inputting and processing... 2 2.1 Manual input and data processing... 2 2.2 Saving data... 3 2.3

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

FACTOR ANALYSIS. Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables.

FACTOR ANALYSIS. Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables. FACTOR ANALYSIS Introduction Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables Both methods differ from regression in that they don t have

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.

CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In

More information

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics

Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics INTERNATIONAL BLACK SEA UNIVERSITY COMPUTER TECHNOLOGIES AND ENGINEERING FACULTY ELABORATION OF AN ALGORITHM OF DETECTING TESTS DIMENSIONALITY Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree

More information

How to Get More Value from Your Survey Data

How to Get More Value from Your Survey Data Technical report How to Get More Value from Your Survey Data Discover four advanced analysis techniques that make survey research more effective Table of contents Introduction..............................................................2

More information

Factor Analysis Using SPSS

Factor Analysis Using SPSS Factor Analysis Using SPSS The theory of factor analysis was described in your lecture, or read Field (2005) Chapter 15. Example Factor analysis is frequently used to develop questionnaires: after all

More information

What Has Been Happening to Job Satisfaction in Britain?

What Has Been Happening to Job Satisfaction in Britain? Updated March 2001 What Has Been Happening to Job Satisfaction in Britain? Andrew Oswald Professor of Economics Warwick University and Jonathan Gardner Research Fellow Warwick University Email: andrew.oswald@warwick.ac.uk

More information

IBM SPSS Direct Marketing 23

IBM SPSS Direct Marketing 23 IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release

More information

PATTERNS OF ENVIRONMENTAL MANAGEMENT IN THE CHILEAN MANUFACTURING INDUSTRY: AN EMPIRICAL APPROACH

PATTERNS OF ENVIRONMENTAL MANAGEMENT IN THE CHILEAN MANUFACTURING INDUSTRY: AN EMPIRICAL APPROACH PATTERS OF EVIROMETAL MAAGEMET I THE CHILEA MAUFACTURIG IDUSTRY: A EMPIRICAL APPROACH Dr. Maria Teresa Ruiz-Tagle Research Associate, University of Cambridge, UK Research Associate, Universidad de Chile,

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Review Jeopardy. Blue vs. Orange. Review Jeopardy Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?

More information

Normality Testing in Excel

Normality Testing in Excel Normality Testing in Excel By Mark Harmon Copyright 2011 Mark Harmon No part of this publication may be reproduced or distributed without the express permission of the author. mark@excelmasterseries.com

More information

Chapter VIII Customers Perception Regarding Health Insurance

Chapter VIII Customers Perception Regarding Health Insurance Chapter VIII Customers Perception Regarding Health Insurance This chapter deals with the analysis of customers perception regarding health insurance and involves its examination at series of stages i.e.

More information

IBM SPSS Direct Marketing 22

IBM SPSS Direct Marketing 22 IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release

More information

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011

SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011 SPSS ADVANCED ANALYSIS WENDIANN SETHI SPRING 2011 Statistical techniques to be covered Explore relationships among variables Correlation Regression/Multiple regression Logistic regression Factor analysis

More information

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the

More information

Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases:

Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: Profile Analysis Introduction Profile analysis is the multivariate equivalent of repeated measures or mixed ANOVA. Profile analysis is most commonly used in two cases: ) Comparing the same dependent variables

More information

Statistics in Psychosocial Research Lecture 8 Factor Analysis I. Lecturer: Elizabeth Garrett-Mayer

Statistics in Psychosocial Research Lecture 8 Factor Analysis I. Lecturer: Elizabeth Garrett-Mayer This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

This section of the memo will review the questions included in the UW-BHS survey that will be used to create a social class index.

This section of the memo will review the questions included in the UW-BHS survey that will be used to create a social class index. Social Class Measure This memo discusses the creation of a social class measure. Social class of the family of origin is one of the central concepts in social stratification research: students from more

More information

Introduction to Regression and Data Analysis

Introduction to Regression and Data Analysis Statlab Workshop Introduction to Regression and Data Analysis with Dan Campbell and Sherlock Campbell October 28, 2008 I. The basics A. Types of variables Your variables may take several forms, and it

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Factor Analysis. Chapter 420. Introduction

Factor Analysis. Chapter 420. Introduction Chapter 420 Introduction (FA) is an exploratory technique applied to a set of observed variables that seeks to find underlying factors (subsets of variables) from which the observed variables were generated.

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

THE USING FACTOR ANALYSIS METHOD IN PREDICTION OF BUSINESS FAILURE

THE USING FACTOR ANALYSIS METHOD IN PREDICTION OF BUSINESS FAILURE THE USING FACTOR ANALYSIS METHOD IN PREDICTION OF BUSINESS FAILURE Mary Violeta Petrescu Ph. D University of Craiova Faculty of Economics and Business Administration Craiova, Romania Abstract: : After

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics

Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics Statistical Testing of Randomness Masaryk University in Brno Faculty of Informatics Jan Krhovják Basic Idea Behind the Statistical Tests Generated random sequences properties as sample drawn from uniform/rectangular

More information

Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business

Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business Factor Analysis Advanced Financial Accounting II Åbo Akademi School of Business Factor analysis A statistical method used to describe variability among observed variables in terms of fewer unobserved variables

More information

Practical Considerations for Using Exploratory Factor Analysis in Educational Research

Practical Considerations for Using Exploratory Factor Analysis in Educational Research A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

A STUDY ON ONBOARDING PROCESS IN SIFY TECHNOLOGIES, CHENNAI

A STUDY ON ONBOARDING PROCESS IN SIFY TECHNOLOGIES, CHENNAI A STUDY ON ONBOARDING PROCESS IN SIFY TECHNOLOGIES, CHENNAI ABSTRACT S. BALAJI*; G. RAMYA** *Assistant Professor, School of Management Studies, Surya Group of Institutions, Vikravandi 605652, Villupuram

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Component Ordering in Independent Component Analysis Based on Data Power

Component Ordering in Independent Component Analysis Based on Data Power Component Ordering in Independent Component Analysis Based on Data Power Anne Hendrikse Raymond Veldhuis University of Twente University of Twente Fac. EEMCS, Signals and Systems Group Fac. EEMCS, Signals

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Factor Analysis. Sample StatFolio: factor analysis.sgp

Factor Analysis. Sample StatFolio: factor analysis.sgp STATGRAPHICS Rev. 1/10/005 Factor Analysis Summary The Factor Analysis procedure is designed to extract m common factors from a set of p quantitative variables X. In many situations, a small number of

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST

UNDERSTANDING THE INDEPENDENT-SAMPLES t TEST UNDERSTANDING The independent-samples t test evaluates the difference between the means of two independent or unrelated groups. That is, we evaluate whether the means for two independent groups are significantly

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Performance Metrics for Graph Mining Tasks

Performance Metrics for Graph Mining Tasks Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics Supervised Learning Performance Metrics Unsupervised Learning Performance Metrics Optimizing Metrics Statistical

More information

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis Chapter 3 Student Lecture Notes 3- Chapter 3 Introduction to Linear Regression and Correlation Analsis Fall 2006 Fundamentals of Business Statistics Chapter Goals To understand the methods for displaing

More information

Does organizational culture cheer organizational profitability? A case study on a Bangalore based Software Company

Does organizational culture cheer organizational profitability? A case study on a Bangalore based Software Company Does organizational culture cheer organizational profitability? A case study on a Bangalore based Software Company S Deepalakshmi Assistant Professor Department of Commerce School of Business, Alliance

More information

Moderation. Moderation

Moderation. Moderation Stats - Moderation Moderation A moderator is a variable that specifies conditions under which a given predictor is related to an outcome. The moderator explains when a DV and IV are related. Moderation

More information

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA)

INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) INTERPRETING THE ONE-WAY ANALYSIS OF VARIANCE (ANOVA) As with other parametric statistics, we begin the one-way ANOVA with a test of the underlying assumptions. Our first assumption is the assumption of

More information

Data exploration with Microsoft Excel: analysing more than one variable

Data exploration with Microsoft Excel: analysing more than one variable Data exploration with Microsoft Excel: analysing more than one variable Contents 1 Introduction... 1 2 Comparing different groups or different variables... 2 3 Exploring the association between categorical

More information

Univariate Regression

Univariate Regression Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Projects Involving Statistics (& SPSS)

Projects Involving Statistics (& SPSS) Projects Involving Statistics (& SPSS) Academic Skills Advice Starting a project which involves using statistics can feel confusing as there seems to be many different things you can do (charts, graphs,

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

The Human Resource Information System Productiveness in Organization Culture and Its Importance

The Human Resource Information System Productiveness in Organization Culture and Its Importance The Human Resource Information System Productiveness in Organization Culture and Its Importance Dr. Shine David Assistant professor Institute Of Management Studies Devi Ahilya Vishwa Vidyalaya Indore shinelavi77@gmail.com

More information

Gamma Distribution Fitting

Gamma Distribution Fitting Chapter 552 Gamma Distribution Fitting Introduction This module fits the gamma probability distributions to a complete or censored set of individual or grouped data values. It outputs various statistics

More information

The Effect of Physical Exercise on Depression; Case study: Professional and Nonprofessional

The Effect of Physical Exercise on Depression; Case study: Professional and Nonprofessional Available online athttp://www.ijashss.com International Journal of Advanced Studies in Humanities and Social Science Volume 1, Issue 8, 2013: 1161-1165 The Effect of Physical Exercise on Depression; Case

More information

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln Log-Rank Test for More Than Two Groups Prepared by Harlan Sayles (SRAM) Revised by Julia Soulakova (Statistics)

More information

Roadmap to Data Analysis. Introduction to the Series, and I. Introduction to Statistical Thinking-A (Very) Short Introductory Course for Agencies

Roadmap to Data Analysis. Introduction to the Series, and I. Introduction to Statistical Thinking-A (Very) Short Introductory Course for Agencies Roadmap to Data Analysis Introduction to the Series, and I. Introduction to Statistical Thinking-A (Very) Short Introductory Course for Agencies Objectives of the Series Roadmap to Data Analysis Provide

More information

An introduction to using Microsoft Excel for quantitative data analysis

An introduction to using Microsoft Excel for quantitative data analysis Contents An introduction to using Microsoft Excel for quantitative data analysis 1 Introduction... 1 2 Why use Excel?... 2 3 Quantitative data analysis tools in Excel... 3 4 Entering your data... 6 5 Preparing

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST

EPS 625 INTERMEDIATE STATISTICS FRIEDMAN TEST EPS 625 INTERMEDIATE STATISTICS The Friedman test is an extension of the Wilcoxon test. The Wilcoxon test can be applied to repeated-measures data if participants are assessed on two occasions or conditions

More information

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION HOD 2990 10 November 2010 Lecture Background This is a lightning speed summary of introductory statistical methods for senior undergraduate

More information

WHAT IS A JOURNAL CLUB?

WHAT IS A JOURNAL CLUB? WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club

More information