Does item homogeneity indicate internal consistency or item redundancy in psychometric scales?
|
|
- Cynthia Skinner
- 7 years ago
- Views:
Transcription
1 Does item homogeneity indicate internal consistency or item redundancy in psychometric scales? By Gregory J. Boyle Department of Psychology, University of Queensland, St Lucia 4067, Queensland, Australia Abstract The term internal consistency has been used extensively in classical psychometrics to refer to the reliability of a scale based on the degree of withinscale item intercorrelation, as measured by say the split-half method, or more adequately by Cronbach's (1951) (Psychometrika, 16, ) alpha, as well as the KR 20 and KR 21 coefficients. This term is a misnomer, as a high estimate of internal item consistency/item homogeneity may also suggest a high level of item redundancy, wherein essentially the same item is rephrased in several different ways. Internal consistency or item homogeneity is often used for estimating intra-scale reliability, in terms of the item variances and covariances derived from a single occasion of measurement. While it is desirable that items in a psychometric scale measure something in common (i.e. exhibit uni-dimensionality), Hattie (1985) has indicated that there is still no satisfactory index. As Hattie (pp ) pointed out, a uni-dimensional scale (having an underlying latent trait), is not necessarily reliable, internally consistent or homogeneous. Hattie concluded that the frequent use of Cronbach s alpha coefficient as a measure of uni-dimensionality is not justified. Hattie further stated that, alpha can be high even if there is no general factor, since (1) it is influenced by the number of items and parallel repetitions of items, (2) it increases as the number of factors pertaining to each item increases, and (3) it decreases moderately as the item communalities increase. The subsequent assertion by Ray (1988) that internal consistency of a psychometric scale should be maximised, represents a further restatement of classical itemetric theory, and ignores the previous work of Hattie (1985), and many others, as outlined below. There is an optimal range of internal consistency/item homogeneity, if significant item redundancy is to be avoided (Boyle, 1983, ). According to Kline (1979, p. 3), with item inter-correlations which are lower than about 0.3, each part of the test must be measuring something different A higher correlation than (0.7), on the other hand suggests that the test is too narrow and too specific if one constructs items that are virtually paraphrases of each other, the results would be high internal consistency and very low validity. Furthermore, according to Kline (1986, p. 3), maximum validity is obtained where test items do not all correlate with each other, but where each correlates positively with the criterion. Such a test would have only low internal-consistency reliability.
2 As Cattell (1978) pointed out, a scale comprised of many items which are essentially repetitions of each other can appear in factor analysis as a bloated specific (as in Guilford s S-O-I model of intellectual structure, cf. Brody & Brody. 1976). Kline (1986, pp ) further remarked that high internal consistency can be antithetical to high validity the importance of internal-consistency reliability has been exaggerated in psychometry (i.e. I agree with Cattell) According to Hayes. Nelson and Jarrett (1987, p. 972). a measure could readily have treatment utility without internal consistency high internal consistency should not necessarily be expected. Likewise, as Allen and Potkay (1983. p. 1088). Lachar and Wirt (1981. p. 616) and McDonald (1981) have all shown, either high or low item homogeneity can be associated with either high or low reliability, despite classical itemetric opinion. According to McDonald (p. 113). Coefficient alpha cannot be used as a reliability coefficient McDonald (p. 100) has refuted on mathematical grounds, the commonly held belief that the alpha coefficient measures internal consistency or item homogeneity of a scale. McDonald (p. 110) stated that, it has never been made clear what is meant by internal consistency or why KR-20 or coefficient alpha can be deemed to measure it confusion pertaining to coefficient alpha has a long history reviewed by Green, Lissitz & Mulaik (1977). Furthermore, McDonald (p. 111) concluded that, alpha has not been shown to be a quantitative measure of any intelligible and useful psychometric concept, except when computed from items with equal covariances. This conclusion was based on the original use of item homogeneity as an estimate of scale reliability by Gulliksen (1950). which was shown by Lord and Novick (1968) to be valid only when items are tau equivalent. Accordingly, it may often be more appropriate to regard estimates such as the alpha coefficient as indicators of item redundancy and narrowness of a scale (cf. Boyle. 1985). Items should be selected which are loaded maximally by the factor representing that scale, but which exhibit moderate to low item inter-correlations in order to maximise the breadth of measurement of the given factor. Merely adding additional items to a scale as classical itemetrics has advocated in accord with the Spearman Brown formula, ignores the error variance associated with each item, and must be regarded by any contemporary and objective assessment (such as demonstrated with LISREL congeneric factor analysis-joreskog & Sorbom, 1989), as being a rather unsophisticated method of increasing scale reliability. Ray (1988) uncritically cited Nunnally (1967) not (1978) -as well as Cronbach (1951) in restating classical reliability theory. However, Pedhazur (1982, p. 636) has indicated that Nunnally s classical approach to reliability failed to acknowledge that measurement errors are
3 often systematic and non-random. Ray s comments arc therefore founded on psychometric views published over 20 years ago! Ray (1988) claimed that broad validity of a scale is facilitated by the use of subscales. However, this ignores the fact that in many multidimensional psychometric instruments (such as the EPI. EPQ, JEPI, 16PF. CAQ, 8SQ, POMS, DES-IV, MAACL, etc.) each subscale actually measures a discrete factor analytic dimension. Despite Ray s dogmatic assertions, semantic overlap of items is only one possible influence on observed item inter-correlations, as indicated above in relation to Hattie s (1985) work. As well, Ray made no distinction between state vs trait scales (cf. Boyle, 1983, , 1987). While a reliable trait scale should exhibit high test-retest correlations for both immediate retest (dependability) and for longer term retest (stability), a reliable state scale should exhibit only a high dependability coefficient, if the scale is truly sensitive to situational variability in mood. Ray and Pedersen (1986) asserted on the basis of a highly biased, unrepresentative and very restricted sample of the U.S.A. population, that Eysenck s Psychoticism scale in the EPQ was a failed experiment, not on the grounds of inadequate validity, but again merely on the basis of dated classical itemetric references. Ray objected to Eysenck s Psychoticism scale because he found that the mean item inter-correlations were only moderate. Yet, Ray s results with the EPQ were clearly biased due to severe restriction of variance in his data. Ray (1988) subsequently criticised Smedslund (1987) for not appreciating the virtues of the EPQ, despite denigrating it in the Ray and Pedersen note (cf. Smedslund. 1988). This amounts to little more than the pot calling the kettle black. Ray (1988) recommended Comrey s FHID approach to scale construction with the aim of increasing scale reliability. However, he was mistaken as to the actual composition of the item parcels in the CPS (four items counterbalanced for direction of scoring, not three as stated). While it is undoubtedly true that such item-parcel variables are more reliable than items as such, nevertheless, for a specified number of items in a scale, less of the pertinent construct is actually measured. Moreover, Cattell (1973. p. 360) has indicated that, The high homogeneity in the FHIDs is carried over with the second-order factor scales, leaving them excessively homogeneous. Hence, Ray s assertions concerning scale construction with itemparcel variables would seem quite inadequate. Cattell (1973. pp ; pp ; 1982) has argued that generally there is an optimally low level of item homogeneity. Cattell provided a conceptual demonstration of high item validity in the context of zero item homogeneity. Since a scale which is valid must also be reliable, it follows that it is theoretically possible for a scale to be reliable even though the internal consistency is zero. On the other hand, it is well known that even a highly reliable scale is not necessarily valid. Any number of invalid scales can be made more reliable simply by adding further invalid items in accord with the Spearman-Brown prophecy formula, and/or by adding further items which are essentially mere repetitions of the items already included in the scale. Ray s (1988) recommendations, if followed, can only result in significant item redundancy and likely contamination of the factor purity of psychometric scales.
4 The advantage of moderate to low item homogeneity is seen in multiple regression analysis, wherein a higher multiple R is produced from predictor variables (items) with only moderate item inter-correlations. Cattell s behavioural dispersion principal suggests that only when there is considerable item diversity, enabling sampling of behaviours across a wide spectrum of life expressions, can individuals be advantaged equally in responding to the items in a particular psychometric scale. As well, reduced item homogeneity facilitates the maintenance of validity across different cultures. A given item may elicit discrepant responses in different cultural settings. If there is high item homogeneity and most of the items are similar (i.e. there is significant item redundancy cf. Boyle, 1985), measurement error due to cultural distortions probably will be evident. This problem can be minimised by including a wide diversity of items (i.e. maximising breadth of measurement) in psychometric scale construction. Cattell indicated that a scale which has high internal consistency is probably contaminated on the one hand, by a bloated specific factor (such as in Guilford s S-O- I model), wherein over-inclusion of particular items pertaining to a specific dimension, gives the impression of a substantive factor, despite its lack of practical significance and evident triviality. On the other hand, psychometric scale contamination occurs by inclusion of several items predictive of an unwanted common factor. Cattell (1978. p. 289) demonstrated that, a very narrow specific can be blown up to the apparent status of a common factor in any given matrix by entering the experiment with several items that arc close variants on the specific variable. In this instance, item homogeneity (internal consistency) is increased by confounding the true factor with a bloated specific. Selection of items with high homogeneity/internal consistency, undoubtedly often results in a scale with a contaminated factor structure. To minimise these distorting influences, it is desirable to invoke suppressor action by including items that arc loaded positively and negatively on the unwanted dimensions, which also are loaded significantly on the relevant common factor. In contrast to Ray s (1988) restatement of classical itemetric opinion, Cattell (1973. p. 359) asserted that. In practice the random tendency to opposite loadings on these other factors will reduce the item homogeneity virtually to zero. Item diversity therefore, results in reduced item homogeneity and concomitantly, reduced item inter-correlations, but maximises breadth of measurement of a given construct. However, Cattell cautioned that, since low homogeneity means different specific factors and suppressor action by opposite loadings on unwanted common factors a test which (misguidedly) advertises high homogeneity is contaminated either with a bloated specific or by items sharing a common unwanted factor. In summary, high internal consistency/item homogeneity results spuriously from the inadvertent inclusion of essentially similar items in a psychometric scale.
5 Determination of what should be considered appropriate item homogeneity for a scale is, according to Cattell (1973. pp ) far more complex than is commonly considered in classical itemetrics The complexity is generated on the one side by the natural history of the domain and on the other by the unusual complexity of the purely statistical psychometric laws involved. According to Cattell, to obtain a broad but valid, behaviourally based rather than semantically based scale, test constructors will need to sift by factor analysis hundreds of items to get those having validity despite high diversity. In this regard, the newer congeneric factor analytic methods using programs such as LISREL (Joreskog & Sorbom, 1989) will undoubtedly minimise the amount of noise which is so prevalent among the items of many existing psychometric scales, designed along classical psychometric lines, wherein internal consistency has been maximised. This traditional itemetric view of intra-class correlation still persists in the contemporary psychometric literature [e.g. Crocker & Algina, 1986, pp : Cronbach, 1990, pp ; Ferguson, 1981, pp : also see Boyle, 1987, for a discussion of the limitations of the (1985) AERA/APA/NCME Standards in this regard]. However, especially in the non-ability areas of motivation, personality and mood states, moderate to low item homogeneity is actually preferred if one is to ensure a broad coverage of the particular constructs being measured. References Allen and Potkay, B.P. Allen and C.R. Potkay, Just as arbitrary as ever: comments on Zuckerman's rejoinder. Journal of Personality and Social Psychology 44 (1983), pp Boyle, G.J. Boyle, Critical review of state-trait curiosity test development. Motivation and Emotion 7 (1983), pp Boyle, G.J. Boyle, Self-report measures of depression: some psychometric considerations. British Journal of Clinical Psychology 24 (1985), pp Boyle, G.J. Boyle, Higher-order factors in the Differential Emotions Scale (DES-III). Personality and Individual Differences 7 (1986), pp Boyle, G.J. Boyle, Review of the (1985) Standards for educational and psychological testing: AERA, APA and NCME.. Australian Journal of Psychology 39 (1987), pp Brody and Brody, E.B. Brody and N. Brody, Intelligence: Nature, determinants, and consequences., Academic Press, New York (1976). Cattell, R.B. Cattell, Personality and mood by questionnaire., Jossey-Bass, San Francisco, CA (1973). Cattell, R.B. Cattell, Scientific use of factor analysis in behavioral and life sciences., Plenum Press, New York (1978).
6 Cattell, R.B. Cattell, The psychometry of objective motivation measurement: a response to the critique of Cooper and Kline. British Journal of Educational Psychology 52 (1982), pp Crocker and Algina, L. Crocker and J. Algina, Introduction to classical and modern test theory., Holt, Rinehart & Winston, New York (1986). Cronbach, L.J. Cronbach, Coefficient alpha and the internal consistency of tests. Psychometrika 16 (1951), pp Cronbach, L.J. Cronbach, Essentials of psychological testing. (5th edn. ed.), Harper & Row, New York (1990). Ferguson, G.A. Ferguson, Statistical analysis in psychology and education. (5th edn. ed.),, McGraw-Hill, Auckland (1981). Green et al., S.B. Green, R.W. Lissitz and S.A. Mulaik, Limitations of coefficient alpha as an index of test unidimensionality. Educational and Psychological Measurement 37 (1977), pp Gulliksen, H. Gulliksen, Theory of mental tests., Wiley, New York (1950). Hattie, J. Hattie, Methodology review: assessing unidimensionality of tests and items. Applied Psychological Measurement 9 (1985), pp Hayes et al., S.C. Hayes, R.O. Nelson and J.B. Jarrett, The treatment utility of assessment: a functional approach to evaluating assessment quality. American Psychologist 42 (1987), pp Jöreskog and Sörbom, K.G. Jöreskog and D. Sörbom, LISREL 7: A guide to the program and applications., SPSS Inc., Chicago, IL (1989). Kline, P. Kline, Psychometrics and psychology. Academic Press, London (1979). Kline, P. Kline, A handbook of test construction: Introduction to psychometric design., Methuen, New York (1986). Lachar and Wirt, D. Lachar and R.D. Wirt, A data-based analysis of the psychometric performance of the Personality Inventory for Children (PIC): an alternative to the Achenbach review. Journal of Personality Assessment 45 (1981), pp Lord and Novick, F.M. Lord and M.R. Novick, Statistical theories of mental test scores., Addison-Wesley, Reading, MA (1968). McDonald, R.P. McDonald, The dimensionality of tests and items. British Journal of Mathematical and Statistical Psychology 34 (1981), pp
7 Nunnally, 1967/1978. J.C. Nunnally, Psychometric theory., McGraw-Hill, New York (1967/1978). Pedhazur, E.J. Pedhazur, Multiple regression in behavioral research., Holt, Rinehart & Winston, New York (1982). Ray, J.J. Ray, Semantic overlap between scale items may be a good thing: reply to Smedslund. Scandinavian Journal of Psychology 29 (1988), pp Ray and Pedersen, J.J. Ray and R. Pedersen, Internal consistency in the Eysenck Psychoticism scale. Journal of Psychology 120 (1986), pp Smedslund, J. Smedslund, The epistemic status of inter-item correlations in Eysenck's Personality Questionnaire: the a priori versus the empirical in psychological data. Scandinavian Journal of Psychology 28 (1987), pp Smedslund, J. Smedslund, What is measured by a psychological measure?. Scandinavian Journal of Psychology 29 (1988), pp Standards for educational and psychological testing: AERA/APA/NCME, American Psychological Association, Washington, DC (1985).
Richard E. Zinbarg northwestern university, the family institute at northwestern university. William Revelle northwestern university
psychometrika vol. 70, no., 23 33 march 2005 DOI: 0.007/s336-003-0974-7 CRONBACH S α, REVELLE S β, AND MCDONALD S ω H : THEIR RELATIONS WITH EACH OTHER AND TWO ALTERNATIVE CONCEPTUALIZATIONS OF RELIABILITY
More informationInternal Consistency: Do We Really Know What It Is and How to Assess It?
Journal of Psychology and Behavioral Science June 2014, Vol. 2, No. 2, pp. 205-220 ISSN: 2374-2380 (Print) 2374-2399 (Online) Copyright The Author(s). 2014. All Rights Reserved. Published by American Research
More informationX = T + E. Reliability. Reliability. Classical Test Theory 7/18/2012. Refers to the consistency or stability of scores
Reliability It is the user who must take responsibility for determining whether or not scores are sufficiently trustworthy to justify anticipated uses and interpretations. (AERA et al., 1999) Reliability
More informationConstructing a TpB Questionnaire: Conceptual and Methodological Considerations
Constructing a TpB Questionnaire: Conceptual and Methodological Considerations September, 2002 (Revised January, 2006) Icek Ajzen Brief Description of the Theory of Planned Behavior According to the theory
More informationQ FACTOR ANALYSIS (Q-METHODOLOGY) AS DATA ANALYSIS TECHNIQUE
Q FACTOR ANALYSIS (Q-METHODOLOGY) AS DATA ANALYSIS TECHNIQUE Gabor Manuela Rozalia Petru Maior Univerity of Tg. Mure, Faculty of Economic, Legal and Administrative Sciences, Rozalia_gabor@yahoo.com, 0742
More informationEstimate a WAIS Full Scale IQ with a score on the International Contest 2009
Estimate a WAIS Full Scale IQ with a score on the International Contest 2009 Xavier Jouve The Cerebrals Society CognIQBlog Although the 2009 Edition of the Cerebrals Society Contest was not intended to
More informationBasic Concepts in Classical Test Theory: Relating Variance Partitioning in Substantive Analyses. to the Same Process in Measurement Analyses ABSTRACT
Basic Concepts in Classical Test Theory: Relating Variance Partitioning in Substantive Analyses to the Same Process in Measurement Analyses Thomas E. Dawson Texas A&M University ABSTRACT The basic processes
More informationChapter 3 Psychometrics: Reliability & Validity
Chapter 3 Psychometrics: Reliability & Validity 45 Chapter 3 Psychometrics: Reliability & Validity The purpose of classroom assessment in a physical, virtual, or blended classroom is to measure (i.e.,
More informationExploring Graduates Perceptions of the Quality of Higher Education
Exploring Graduates Perceptions of the Quality of Higher Education Adee Athiyainan and Bernie O Donnell Abstract Over the last decade, higher education institutions in Australia have become increasingly
More informationRESEARCH METHODS IN I/O PSYCHOLOGY
RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 01/17/11 [Arthur]
More informationOriginal Article. Charles Spearman: British Behavioral Scientist
The Human Nature Review ISSN 1476-1084 URL of this document http://human-nature.com/nibbs/03/spearman.html Human Nature Review 3 (2003) 114-118 Original Article Charles Spearman: British Behavioral Scientist
More informationReporting and Interpreting Scores Derived from Likert-type Scales
Journal of Agricultural Education, 55(5), 30-47. doi: 10.5032/jae.2014.05030 Derived from Likert-type Scales J. Robert Warmbrod 1 Abstract Forty-nine percent of the 706 articles published in the Journal
More informationExploratory Factor Analysis
Exploratory Factor Analysis ( 探 索 的 因 子 分 析 ) Yasuyo Sawaki Waseda University JLTA2011 Workshop Momoyama Gakuin University October 28, 2011 1 Today s schedule Part 1: EFA basics Introduction to factor
More informationGuided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.
Guided Reading Educational Research: Competencies for Analysis and Applications 9th Edition EDFS 635: Educational Research Chapter 1: Introduction to Educational Research 1. List and briefly describe the
More informationA PARADIGM FOR DEVELOPING BETTER MEASURES OF MARKETING CONSTRUCTS
A PARADIGM FOR DEVELOPING BETTER MEASURES OF MARKETING CONSTRUCTS Gilber A. Churchill (1979) Introduced by Azra Dedic in the course of Measurement in Business Research Introduction 2 Measurements are rules
More informationReliability Overview
Calculating Reliability of Quantitative Measures Reliability Overview Reliability is defined as the consistency of results from a test. Theoretically, each test contains some error the portion of the score
More informationValidity and Reliability in Social Science Research
Education Research and Perspectives, Vol.38, No.1 Validity and Reliability in Social Science Research Ellen A. Drost California State University, Los Angeles Concepts of reliability and validity in social
More informationThis chapter discusses some of the basic concepts in inferential statistics.
Research Skills for Psychology Majors: Everything You Need to Know to Get Started Inferential Statistics: Basic Concepts This chapter discusses some of the basic concepts in inferential statistics. Details
More informationInstrument Validation Study. Regarding Leadership Circle Profile. By Industrial Psychology Department. Bowling Green State University
Instrument ValidationStudy RegardingLeadershipCircleProfile ByIndustrialPsychologyDepartment BowlingGreenStateUniversity InstrumentValidationStudy ExecutiveSummaryandResponsetotheRecommendations ThefollowingvaliditystudyonTheLeadershipCircleProfile(TLCP)isanindependentstudy.It
More informationRESEARCH METHODS IN I/O PSYCHOLOGY
RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 09/01/11 [Arthur]
More informationCHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES. From Exploratory Factor Analysis Ledyard R Tucker and Robert C.
CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES From Exploratory Factor Analysis Ledyard R Tucker and Robert C MacCallum 1997 180 CHAPTER 8 FACTOR EXTRACTION BY MATRIX FACTORING TECHNIQUES In
More informationMATHEMATICS AS THE CRITICAL FILTER: CURRICULAR EFFECTS ON GENDERED CAREER CHOICES
MATHEMATICS AS THE CRITICAL FILTER: CURRICULAR EFFECTS ON GENDERED CAREER CHOICES Xin Ma University of Kentucky, Lexington, USA Using longitudinal data from the Longitudinal Study of American Youth (LSAY),
More informationExploring Epistemological Beliefs and Conceptual Change in Undergraduate Psychology Students
Exploring Epistemological Beliefs and Conceptual Change in Undergraduate Psychology Students Armin Günther, Günter Krampen, Gabriel Schui, Anne- Kathrin Mayer, Johannes Peter, Nikolas Leichner 8th International
More informationDesigning a Questionnaire
Designing a Questionnaire What Makes a Good Questionnaire? As a rule of thumb, never to attempt to design a questionnaire! A questionnaire is very easy to design, but a good questionnaire is virtually
More informationExtending the debate between Spearman and Wilson 1929: When do single variables optimally reproduce the common part of the observed covariances?
1 Extending the debate between Spearman and Wilson 1929: When do single variables optimally reproduce the common part of the observed covariances? André Beauducel 1 & Norbert Hilger University of Bonn,
More informationTest Reliability Indicates More than Just Consistency
Assessment Brief 015.03 Test Indicates More than Just Consistency by Dr. Timothy Vansickle April 015 Introduction is the extent to which an experiment, test, or measuring procedure yields the same results
More informationPsychological measurements: their uses and misuses
Psychological measurements: their uses and misuses Measure all that can be measured and render measurable all that defies measurement. Galileo Galilei Not everything that counts can be counted, and not
More informationGlossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Battery Bias
Glossary of Terms Ability A defined domain of cognitive, perceptual, psychomotor, or physical functioning. Accommodation A change in the content, format, and/or administration of a selection procedure
More informationPARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA
PARTIAL LEAST SQUARES IS TO LISREL AS PRINCIPAL COMPONENTS ANALYSIS IS TO COMMON FACTOR ANALYSIS. Wynne W. Chin University of Calgary, CANADA ABSTRACT The decision of whether to use PLS instead of a covariance
More informationTest Bias. As we have seen, psychological tests can be well-conceived and well-constructed, but
Test Bias As we have seen, psychological tests can be well-conceived and well-constructed, but none are perfect. The reliability of test scores can be compromised by random measurement error (unsystematic
More informationThe relationship between emotional intelligence and school management
Available Online at http://iassr.org/journal 2013 (c) EJRE published by International Association of Social Science Research - IASSR ISSN: 2147-6284 European Journal of Research on Education, 2013, 1(1),
More informationFactorial Invariance in Student Ratings of Instruction
Factorial Invariance in Student Ratings of Instruction Isaac I. Bejar Educational Testing Service Kenneth O. Doyle University of Minnesota The factorial invariance of student ratings of instruction across
More information[This document contains corrections to a few typos that were found on the version available through the journal s web page]
Online supplement to Hayes, A. F., & Preacher, K. J. (2014). Statistical mediation analysis with a multicategorical independent variable. British Journal of Mathematical and Statistical Psychology, 67,
More informationSyllabus for Psychology 492 Psychological Measurement Winter, 2006
Instructor: Jonathan Oakman 888-4567 x3659 jmoakman@uwaterloo.ca Syllabus for Psychology 492 Psychological Measurement Winter, 2006 Office Hours: Jonathan Oakman: PAS 3015 TBA Teaching Assistants: Office
More informationBRIEF REPORT: Short Form of the VIA Inventory of Strengths: Construction and Initial Tests of Reliability and Validity
International Journal of Humanities Social Sciences and Education (IJHSSE) BRIEF REPORT: Short Form of the VIA Inventory of Strengths: Construction and Initial Tests of Reliability and Validity Hadassah
More informationLevels of Measurement. 1. Purely by the numbers numerical criteria 2. Theoretical considerations conceptual criteria
Levels of Measurement 1. Purely by the numbers numerical criteria 2. Theoretical considerations conceptual criteria Numerical Criteria 1. Nominal = different categories based on some kind of typology 2.
More informationValidation of the Core Self-Evaluations Scale research instrument in the conditions of Slovak Republic
Validation of the Core Self-Evaluations Scale research instrument in the conditions of Slovak Republic Lenka Selecká, Jana Holienková Faculty of Arts, Department of psychology University of SS. Cyril and
More informationOverview of Factor Analysis
Overview of Factor Analysis Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August 1,
More informationReliability Analysis
Measures of Reliability Reliability Analysis Reliability: the fact that a scale should consistently reflect the construct it is measuring. One way to think of reliability is that other things being equal,
More information2011 Validity and Reliability Results Regarding the SIS
2011 Validity and Reliability Results Regarding the SIS Jon Fortune, Ed.D., John, Agosta, Ph.D., March 14, 2011 and Julie Bershadsky, Ph.D. Face Validity. Developed to measure the construct of supports,
More informationCorrelational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots
Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship
More informationHistory and Purpose of the Principles for the Validation and Use of Personnel Selection Procedures
The following summary of the Principles for the Validation and Use of Personnel Selection Procedures has been prepared by the State Personnel Board s Test Validation and Construction Unit. Introduction
More informationThe Revised Dutch Rating System for Test Quality. Arne Evers. Work and Organizational Psychology. University of Amsterdam. and
Dutch Rating System 1 Running Head: DUTCH RATING SYSTEM The Revised Dutch Rating System for Test Quality Arne Evers Work and Organizational Psychology University of Amsterdam and Committee on Testing of
More informationANALYZING TWO ASSUMPTIONS UNDERLYING THE SCORING OF CLASSROOM ASSESSMENTS
ANALYZING TWO ASSUMPTIONS UNDERLYING THE SCORING OF CLASSROOM ASSESSMENTS by Robert J. Marzano Aurora, Colorado January, 2000 Assessment is one of the most fundamental of classroom activities. Research
More informationThe Relationship between Social Intelligence and Job Satisfaction among MA and BA Teachers
Kamla-Raj 2012 Int J Edu Sci, 4(3): 209-213 (2012) The Relationship between Social Intelligence and Job Satisfaction among MA and BA Teachers Soleiman Yahyazadeh-Jeloudar 1 and Fatemeh Lotfi-Goodarzi 2
More informationThis chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
More informationWHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
More informationFactor Rotations in Factor Analyses.
Factor Rotations in Factor Analyses. Hervé Abdi 1 The University of Texas at Dallas Introduction The different methods of factor analysis first extract a set a factors from a data set. These factors are
More informationStatistics, Research, & SPSS: The Basics
Statistics, Research, & SPSS: The Basics SPSS (Statistical Package for the Social Sciences) is a software program that makes the calculation and presentation of statistics relatively easy. It is an incredibly
More informationApplication of a Psychometric Rating Model to
Application of a Psychometric Rating Model to Ordered Categories Which Are Scored with Successive Integers David Andrich The University of Western Australia A latent trait measurement model in which ordered
More informationPart III. Item-Level Analysis
Part III Item-Level Analysis 6241-029-P3-006-2pass-r02.indd 169 1/16/2013 9:14:56 PM 6241-029-P3-006-2pass-r02.indd 170 1/16/2013 9:14:57 PM 6 Exploratory and Confirmatory Factor Analysis Rex Kline 6.1
More informationPilot Testing and Sampling. An important component in the data collection process is that of the pilot study, which
Pilot Testing and Sampling An important component in the data collection process is that of the pilot study, which is... a small-scale trial run of all the procedures planned for use in the main study
More informationHow to report the percentage of explained common variance in exploratory factor analysis
UNIVERSITAT ROVIRA I VIRGILI How to report the percentage of explained common variance in exploratory factor analysis Tarragona 2013 Please reference this document as: Lorenzo-Seva, U. (2013). How to report
More informationApplications of Structural Equation Modeling in Social Sciences Research
American International Journal of Contemporary Research Vol. 4 No. 1; January 2014 Applications of Structural Equation Modeling in Social Sciences Research Jackson de Carvalho, PhD Assistant Professor
More informationEmotionally unstable? It spells trouble for work, relationships and life
Emotionally unstable? It spells trouble for work, relationships and life Rob Bailey and Tatiana Gulko, OPP Ltd Summary This presentation explores a range of studies of resilience using the 16PF questionnaire,
More informationGeneral Symptom Measures
General Symptom Measures SCL-90-R, BSI, MMSE, CBCL, & BASC-2 Symptom Checklist 90 - Revised SCL-90-R 90 item, single page, self-administered questionnaire. Can usually be completed in 10-15 minutes Intended
More informationCanonical Correlation Analysis
Canonical Correlation Analysis LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the similarities and differences between multiple regression, factor analysis,
More informationAssessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview
Assessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview The abilities to gather and interpret information, apply counseling and developmental theories, understand diagnostic frameworks,
More informationChoosing the Right Type of Rotation in PCA and EFA James Dean Brown (University of Hawai i at Manoa)
Shiken: JALT Testing & Evaluation SIG Newsletter. 13 (3) November 2009 (p. 20-25) Statistics Corner Questions and answers about language testing statistics: Choosing the Right Type of Rotation in PCA and
More informationScale Construction and Psychometrics for Social and Personality Psychology. R. Michael Furr
Scale Construction and Psychometrics for Social and Personality Psychology R. Michael Furr 00-Furr-4149-Prelims.indd 3 11/10/2010 5:18:13 PM 2 Core Principles, Best Practices, and an Overview of Scale
More informationARE OBSESSIVE BELIEFS AND INTERPRETATIVE BIAS OF INTRUSIONS PREDICTORS OF OBSESSIVE COMPULSIVE SYMPTOMATOLOGY? A study WITH A TURKISH SAMPLE
SOCIAL BEHAVIOR AND PERSONALITY, 2009, 37(3), 355-364 Society for Personality Research (Inc.) DOI 10.2224/sbp.2009.37.3.355 ARE OBSESSIVE BELIEFS AND INTERPRETATIVE BIAS OF INTRUSIONS PREDICTORS OF OBSESSIVE
More informationMultivariate Analysis of Variance (MANOVA)
Multivariate Analysis of Variance (MANOVA) Aaron French, Marcelo Macedo, John Poulsen, Tyler Waterson and Angela Yu Keywords: MANCOVA, special cases, assumptions, further reading, computations Introduction
More informationSEM Analysis of the Impact of Knowledge Management, Total Quality Management and Innovation on Organizational Performance
2015, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com SEM Analysis of the Impact of Knowledge Management, Total Quality Management and Innovation
More informationScores, 7: Immediate Recall, Delayed Recall, Yield 1, Yield 2, Shift, Total Suggestibility, Confabulation.
Gudjonsson Suggestibility Scales. Purpose: "Developed in order to measure objectively the vulnerability or proneness of people [to suggestive influence and/or] to give erroneous accounts when interviewed,"
More informationPsyD Psychology (2014 2015)
PsyD Psychology (2014 2015) Program Information Point of Contact Marianna Linz (linz@marshall.edu) Support for University and College Missions Marshall University is a multi campus public university providing
More informationASSESSMENT: Coaching Efficacy As Indicators Of Coach Education Program Needs
March, 2003 Volume 5, Issue 1 ASSESSMENT: Coaching Efficacy As Indicators Of Coach Education Program Needs Lena Fung, Ph.D. Department of Physical Education Hong Kong Baptist University Hong Kong, SAR
More informationThe Personal Learning Insights Profile Research Report
The Personal Learning Insights Profile Research Report The Personal Learning Insights Profile Research Report Item Number: O-22 995 by Inscape Publishing, Inc. All rights reserved. Copyright secured in
More informationTHE ACT INTEREST INVENTORY AND THE WORLD-OF-WORK MAP
THE ACT INTEREST INVENTORY AND THE WORLD-OF-WORK MAP Contents The ACT Interest Inventory........................................ 3 The World-of-Work Map......................................... 8 Summary.....................................................
More informationLearner Self-efficacy Beliefs in a Computer-intensive Asynchronous College Algebra Course
Learner Self-efficacy Beliefs in a Computer-intensive Asynchronous College Algebra Course Charles B. Hodges Georgia Southern University Department of Leadership, Technology, & Human Development P.O. Box
More informationMultiple Regression: What Is It?
Multiple Regression Multiple Regression: What Is It? Multiple regression is a collection of techniques in which there are multiple predictors of varying kinds and a single outcome We are interested in
More informationAbstract. Introduction
Predicting Talent Management Indices Using the 16 Primary Personality Factors John W. Jones, Ph.D.; Catherine C. Maraist, Ph.D.; Noelle K. Newhouse, M.S. Abstract This study investigates whether or not
More informationEvaluating a Fatigue Management Training Program For Coaches
Evaluating a fatigue management training program for coach drivers. M. Anthony Machin University of Southern Queensland Abstract A nonprescriptive fatigue management training program was developed that
More informationINVESTIGATING BUSINESS SCHOOLS INTENTIONS TO OFFER E-COMMERCE DEGREE-PROGRAMS
INVESTIGATING BUSINESS SCHOOLS INTENTIONS TO OFFER E-COMMERCE DEGREE-PROGRAMS Jean Baptiste K. Dodor College of Business Jackson State University HTUjeandodor@yahoo.comUTH 601-354-1964 Darham S. Rana College
More informationChapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
More informationChapter 1 Introduction. 1.1 Introduction
Chapter 1 Introduction 1.1 Introduction 1 1.2 What Is a Monte Carlo Study? 2 1.2.1 Simulating the Rolling of Two Dice 2 1.3 Why Is Monte Carlo Simulation Often Necessary? 4 1.4 What Are Some Typical Situations
More informationEnhancing Customer Relationships in the Foodservice Industry
DOI: 10.7763/IPEDR. 2013. V67. 9 Enhancing Customer Relationships in the Foodservice Industry Firdaus Abdullah and Agnes Kanyan Faculty of Business Management, Universiti Teknologi MARA Abstract. Intensification
More informationUndergraduate Psychology Major Learning Goals and Outcomes i
Undergraduate Psychology Major Learning Goals and Outcomes i Goal 1: Knowledge Base of Psychology Demonstrate familiarity with the major concepts, theoretical perspectives, empirical findings, and historical
More informationWhat is Effect Size? effect size in Information Technology, Learning, and Performance Journal manuscripts.
The Incorporation of Effect Size in Information Technology, Learning, and Performance Research Joe W. Kotrlik Heather A. Williams Research manuscripts published in the Information Technology, Learning,
More informationThe Self-Regulation Questionnaire (SRQ)
The Self-egulation Questionnaire (SQ) Self-regulation is the ability to develop, implement, and flexibly maintain planned behavior in order to achieve one's goals. Building on the foundational work of
More informationPublishing multiple journal articles from a single data set: Issues and recommendations
Publishing multiple journal articles from a single data set: Issues and recommendations By: Mark A. Fine and Lawrence A. Kurdek Fine, M. A., & Kurdek, L. A. (1994). Publishing multiple journal articles
More informationHarrison, P.L., & Oakland, T. (2003), Adaptive Behavior Assessment System Second Edition, San Antonio, TX: The Psychological Corporation.
Journal of Psychoeducational Assessment 2004, 22, 367-373 TEST REVIEW Harrison, P.L., & Oakland, T. (2003), Adaptive Behavior Assessment System Second Edition, San Antonio, TX: The Psychological Corporation.
More informationA Reasoned Action Explanation for Survey Nonresponse 1
Pp. 101-110 in: Seppo Laaksonen (Ed.). (1996). International Perspectives on Nonresponse. Helsinki: Statistics Finland. A Reasoned Action Explanation for Survey Nonresponse 1 Joop Hox Department of Education,
More informationinterpretation and implication of Keogh, Barnes, Joiner, and Littleton s paper Gender,
This essay critiques the theoretical perspectives, research design and analysis, and interpretation and implication of Keogh, Barnes, Joiner, and Littleton s paper Gender, Pair Composition and Computer
More informationMultidimensional Constructs in Organizational Behavior Research: An Integrative Analytical Framework
ORGANIZATIONAL Edwards / MULTIDIMENSIONAL RESEARCH CONSTRUCTS METHODS Multidimensional Constructs in Organizational Behavior Research: An Integrative Analytical Framework JEFFREY R. EDWARDS University
More informationLEARNING OUTCOMES FOR THE PSYCHOLOGY MAJOR
LEARNING OUTCOMES FOR THE PSYCHOLOGY MAJOR Goal 1. Knowledge Base of Psychology Demonstrate familiarity with the major concepts, theoretical perspectives, empirical findings, and historical trends in psychology.
More informationUsing a Mental Measurements Yearbook Review to Evaluate a Test
Using a Mental Measurements Yearbook Review to Evaluate a Test Anthony J. Nitko Professor Emeritus of Psychology in Education, University of Pittsburgh Adjunct Professor of Educational Psychology, University
More informationCognitive Behavior Group Therapy in Mathematics Anxiety
299 Journal of the Indian Academy of Applied Psychology July 2009, Vol. 35, No. 2, 299-303. Cognitive Behavior Group Therapy in Mathematics Anxiety Ayatollah Karimi and S Venkatesan All Indian Institute
More informationA REVIEW OF SCALE DEVELOPMENT PRACTICES IN NONPROFIT MANAGEMENT AND MARKETING
Walter Wymer, Helena Maria Baptista Alves 143 Walter Wymer, Helena Maria Baptista Alves, A Review of Scale Development Practices in Nonprofit Management and Marketing, Economics & Sociology, Vol. 5, No
More informationWhat Is a Case Study? series of related events) which the analyst believes exhibits (or exhibit) the operation of
What Is a Case Study? Mitchell (1983) defined a case study as a detailed examination of an event (or series of related events) which the analyst believes exhibits (or exhibit) the operation of some identified
More informationExpectancy Value Theory: Motivating Healthcare Workers
Expectancy Value Theory: Motivating Healthcare Workers Stefania De Simone Researcher in Organizational Behavior Institute for Research on Innovation and Services for Development National Research Council
More informationTest-Retest Reliability and The Birkman Method Frank R. Larkey & Jennifer L. Knight, 2002
Test-Retest Reliability and The Birkman Method Frank R. Larkey & Jennifer L. Knight, 2002 Consultants, HR professionals, and decision makers often are asked an important question by the client concerning
More informationStructural Equation Modelling (SEM)
(SEM) Aims and Objectives By the end of this seminar you should: Have a working knowledge of the principles behind causality. Understand the basic steps to building a Model of the phenomenon of interest.
More informationQuantitative Research: Reliability and Validity
Quantitative Research: Reliability and Validity Reliability Definition: Reliability is the consistency of your measurement, or the degree to which an instrument measures the same way each time it is used
More informationT-test & factor analysis
Parametric tests T-test & factor analysis Better than non parametric tests Stringent assumptions More strings attached Assumes population distribution of sample is normal Major problem Alternatives Continue
More informationStatistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
More informationReport on the Ontario Principals Council Leadership Study
Report on the Ontario Principals Council Leadership Study (February 2005) Howard Stone 1, James D. A. Parker 2, and Laura M. Wood 2 1 Learning Ways Inc., Ontario 2 Department of Psychology, Trent University,
More informationMEASURING INFORMATION QUALITY OF WEB SITES: DEVELOPMENT OF AN INSTRUMENT
MEASURING INFORMATION QUALITY OF WEB SITES: DEVELOPMENT OF AN INSTRUMENT Pairin Katerattanakul Keng Siau College of Business Administration University of Nebraska, Lincoln U.S.A. Abstract Web sites have
More informationValidity, Fairness, and Testing
Validity, Fairness, and Testing Michael Kane Educational Testing Service Conference on Conversations on Validity Around the World Teachers College, New York March 2012 Unpublished Work Copyright 2010 by
More information