Psychometrics 101 Part 2: Essentials of Test Score Interpretation. Steve Saladin, Ph.D. University of Idaho

Size: px
Start display at page:

Download "Psychometrics 101 Part 2: Essentials of Test Score Interpretation. Steve Saladin, Ph.D. University of Idaho"

Transcription

1 Psychometrics 101 Part 2: Essentials of Test Score Interpretation Steve Saladin, Ph.D. University of Idaho

2 Standards for Educational and Psychological Testing Those responsible for testing programs should provide appropriate interpretations when test score information is released to students, parents, legal representatives, teachers, or the media. The interpretations should describe in simple language what the test covers, what scores mean, common misinterpretations of test scores, and how scores will be used.

3 Pay no attention to the man behind the curtain! L. Frank Baum, The Wonderful Wizard of Oz

4 Where are we going today? A score is only as good as the test Ø Reliability Ø Validity Not all scales are created equal Ø Nominal to Ratio A score by any other name Ø Norm or Criterion referenced Yeah, but what does my score mean?

5 Error, Error Everywhere No test is perfect, no measurement is perfect There is always error in any measurement Score = Truth + Error

6 Error, Error Everywhere Error can be lots of things including Ø The environment

7 Error, Error Everywhere Error can be lots of things including Ø The environment Ø The test-taker

8 Error, Error Everywhere Error can be lots of things including Ø The environment Ø The test-taker Ø Procedural variations*

9 Error, Error Everywhere Error can be lots of things including Ø The environment Ø The test-taker Ø Procedural variations Ø The test itself John has 13 stickers. Jill gives him 10 more. How many stickers does Jack have? Since error makes scores inconsistent or unreliable, a measure of reliability of scores is important

10 Reliability Test-Retest Ø Test group on two different occasions and correlate the results Ø Are results stable over time Ø Does not mean they are the same scores, only that they stay the same relative to each other

11 Reliability Test-Retest Inter-Rater Ø Two people score same test Ø Are results the same Ø Important when there is element of subjectivity

12 Reliability Test-Retest Inter-Rater Internal Consistency Ø Correlate score on each item to total Ø Are they all measuring the same thing All correlations so subject to same problems

13 Things that artificially effect reliability estimates Length of test Ø Longer test = higher reliability More questions, differing response on one item has less effect May be artificial, but shorter tests are inherently less reliable Variability in what is being measured Ø Restriction of range If everyone scores the same, you can t see that low scores tend to remain low and high scores tend to remain high Ø Size of sample Small sample makes it hard to find relationship

14 Some Facts About Correlation Small samples may miss relationship Heterogeneous samples may miss relationship

15 So what s good? SAT reports reliabilities of Ø Test Characteristics of the SAT on ACCUPLACER reports reliabilities ranging from.84 for Sentence Skills to.90 for Arithmetic Ø College Board webinar on ACCUPLACER cut scores CLEP College Algebra estimates reliability of 0.90 Ø Test Information Guide: College-Level Examination Program , College Algebra 0.90 is great 0.80 should be used with caution if at all

16 Reliability & Error in Individual Scores Can t totally get rid of Error, but can estimate how much is there Using reliability you can estimate how much a persons score would vary due to error. Standard Error of the Measurement Ø SEM =SD * 1 r Ø an index of the extent to which an individual s scores vary over multiple administrations Ø gives the range within which the true score is likely to exist

17 Theoretical distribution of scores % % % 14% 34% 34% 14% 2% % 16% 50% 84% 96% 1 SEM below to 1 SEM above = 68% confidence 2 SEM below to 2 SEM above = 95% confidence

18 SEM for some tests ACCUPLACER Diagnostic scores (about 1.3 average), so 68% confidence interval for score of 10 is 9-11, 95% confidence is 7-13 ACT Composite SEM.91, so 68% confidence interval for score of 20 is 19-21, 95% confidence is Ø ACT Technical Manual WAIS-IV FSIQ SEM is 2.16, so 68 % confidence interval for score of 100 is , 95% confidence is

19 Are scores on two tests really different? Standard Error of the Difference Ø SE diff = Sqr Root of SEM SEM 2 2 Ø must be on same scale Ø Diff >= 1 Se diff 68% confident Ø Diff >= 2 Se diff 95% confident This is the scientific approach Can also use a common sense approach

20 Are these score really different? ACT Composite scores of 18 and 20? Ø 68% confident 18 is really Ø 68% confident 20 is really Ø Confidence band overlaps, so difference could easily be due to error ACT Composite scores of 20 and 23 Ø 68% confident 20 is really Ø 68% confident 23 is really Ø Confidence bands do not overlap so at least 68% sure the difference is not due to error

21 Does Reliability = Validity? Getting a consistent result means reliability NO! Having that result be meaningful is validity Validity is based on inferences you make from results Ø Test has to be reliable to be valid Ø Test does not have to be valid to be reliable

22 Validity Any evidence that a test measures what it says it is measuring Any evidence that inferences made from the test are useful and meaningful

23 Validity 3 types of evidence Ø Content Ø Criterion-Related Ø Construct

24 Content Validity Think of a test as a sample of possible problems/items Ø 4 th grade spelling test should be a representative sample of 4 th grade spelling words Ø GRE Quantitative should be a representative sample of the math problems a grad school applicant might be expected to solve Should be part of design Ø Identifying # of algebra, trig, calculus, etc. should be on test (table of specifications) Frequently evaluated by item analysis or expert opinions

25 Criterion-Related Validity How does test score correlate with some external measure (criterion) Ø Placement test score and performance in class Ø Admission test score and GPA for first semester Sometimes called Predictive or Concurrent Validity Correlation that is effected by error in the test and error in the criterion Ø Only top students take GRE Ø Graduate School grade restriction

26 To use or not to use. Depends on the question. Ø What is impact of decision? Ø What is cost of using? Of not using? Decision Theory can be a guide to determining incremental validity Ø Net gain in using scores

27 Construct Validity Most important for psychological test where what you are measuring is abstract or theoretical Ø Intelligence Ø Personality characteristics Ø Attitudes and beliefs Usually involves multiple pieces of evidence

28 Not all scales are created equal Nominal Scale Ø Simply places in a specific category Ø Often dichotomous like sex/gender Ø Can be multiple categories like race, religion Ø Does not provide quantitative information even if you use numbers for categories Catholic=1, Baptist=2, Buddhist=3, Atheist=4 Does not mean Catholic is worth less than Atheist

29 Not all scales are created equal Ordinal Scale Ø Places in a position that is ordered in some way Typically from low to high Ø Percentile Rank Ø Letter Grades Ø One score clearly better or higher, but only within that group And provides no information about the group No information about how closely 2 scores are related

30 100 students, 100 questions If scores evenly distributed (1 person for each possible score from 1-100) then Ø 10th percentile=11 Ø 25th percentle=26 Ø 50th percentile=51 Ø 60th percentle=60 Ø 75th percentile=75 Ø 90th percentile=

31 100 students, 100 questions Now suppose they still range from 1-100, but, 20 scored 1, 20 scored 10, 20 scored 25, 10 scored 50, 20 scored 75 and 10 scored 100 Ø 10th percentile=1 Ø 25th percentle=10 Ø 50th percentile=25 Ø 60th percentle=35 Ø 75th percentile=75 Ø 90th percentile=

32 100 students, 100 questions Now suppose they still range from 1-100, but, 1 person scored 1, 1 person scored 100 and the rest were a typical class room distribution Ø 10th percentile=71 Ø 25th percentle=78 Ø 50th percentile=84 Ø 60th percentle=86 Ø 75th percentile=89 Ø 90th percentile=

33 100 students, 100 questions So depending on the distribution of our 100 scores, always ranging from we get Ø 10th percentile= Ø 25th percentle= Ø 50th percentile= Ø 60th percentle= Ø 75th percentile= Ø 90th percentile= Ø To be able to say something about differences in magnitude, you need to go to the next level

34 Not all scales are created equal Interval Scales Ø Scores are arranged from low to high but with equal intervals Ø 5 pt difference in scores is the same whether it is between scores at the bottom, in the middle or at the top Ø Can be a direct measurement Temperature Ø Usually a conversion of raw to standard score of some kind Ø IQ, SAT/ACT Ø Allows statistical manipulation Mean, standard deviation, etc.

35 Not all scales are created equal SAT Math scores are interval scales Ø Brenda 800 Ø Matt 600 Ø Jim 400 Ø Matt is as much better at math than Jim, as Brenda is than Matt. Ø Brenda is twice as good at math as Jim.

36 Not all scales are created equal Ratio Scales Ø Not only do you have equal intervals, but you also have an absolute (meaningful) zero point Ø speed, weight Ø Generally meaningless in educational testing Does a 0 on a history test mean you have no knowledge of history? Ø Allows you to actually say one score is twice as good as another

37

38 Not all scales are created equal For educational testing, scales will almost always be either ordinal or interval Ø Often they provide both ACT Math=19, 47 th percentile

39 A score by any other name. Raw scores are generally meaningless Ø Becky got 87 questions right probably good if there are 100 questions Probably not so good if there are 200 questions Two major ways to make score meaningful Ø Norm-referenced scores Ø Criterion-reference scores

40 Norm-referenced If I want to know how well some one can do something relative to her/his peers, I want to use norm-referenced reporting Many standardized tests are norm-referenced Ø SAT, ACT, GRE Grading on the curve means grade based on comparison with rest of class (norm-referenced) Ø 80% might be a B, an A, a C or something else. Most psychological tests are norm-referenced Ø MMPI-2, WAIS-IV, WJ-III

41 Norm-referenced Scales are typically interval but also report percentile rank To evaluate norm-referenced scores you need to exam the group it was normed on Ø Size bigger is better Ø Characteristics Does the norm group match the group you want to use it with (Age, gender, ethnicity, SES, etc.) If not, is there research to support the use with your group Ø Are they up-to-date? Older they get, the more you need to question them

42 Criterion-referenced If you want to know if someone has a minimal level of skill or knowledge, then you need to use criterionreferenced scores Class room grading is generally criterion based Ø 90% right=a, 80%=B, 70%=C, etc. Certification exams are often criterion-referenced Ø Proctor certification, licensing exams Ø Typically reported as percentage correct or P/F For Criterion-referenced the type of scale is often not important (you either reach a cut point or you don t)

43 Criterion-referenced To evaluate criterion-referenced score you need to examine how the cut-off was determined Ø Who, what, when, where And how it was validated Ø Was it? Ø Who, what, when, where

44 Establishing cut scores Steps in setting cut-scores for Placement test Ø Have faculty agree upon a list of skills necessary for success in the course Ø Have faculty review test content/questions and decide for each one if a student who will be successful in the course should be able to answer the question Ø Administer test to enrolled students at beginning of semester and compare against their final grades Ø Decide on the cut-score to use Ø Use it for placement and then revaluate the appropriateness of the placement Excerpted from College Board webinar on setting cut scores for ACCUPLACER

45 Cut Scores Maximize success Pre Calc A B C D F False negative True negative True positive False Positive ACCUPLACER Elem Alg

46 Cut Scores PreCalc A B C D F Maximize opportunity False negative True negative True positive False Positive ACCUPLACER Elem Alg

47 Cut Scores Effectiveness = True Positive + True Negative True Pos+False Pos+True Neg+False Neg Have to weigh effectiveness against cost

48 Yeah, but what does my score mean?

49 Yeah, but what does my score mean? Start with the test Ø What is it supposed to measure? ACT Test scores reflect what students have learned throughout high school SAT designed to assess your academic readiness for college.what you know and how well you can apply that knowledge ACCUPLACER allows educational institutions to address the challenges of accurate placement and remediation CLEP tests mastery of college-level material in specific areas

50 Yeah, but what does my score mean? Start with the test Ø Is it reliable? Valid? CLEP Intro Psychology r=0.89, SEM=2.89 Content developed by college faculty to reflect content of Intro Psych classes Ø What are characteristics of norm group/how are criterion points determined CLEP Standard setting panel of faculty establish performance needed to reflect a grade of C and B

51 Yeah, but what does my score mean? Consider characteristics of test taker Ø Do they match norm group/group test designed to be used with local vs. national, cultural considerations Ø Are the results consistent with other information GPA, other tests Ø What is her/his goal? Does it make sense without considering test result

52 Yeah, but what does my score mean? Consider the score Ø Generally speaking, Average is about the 25 th to the 75 th percentile If norms are general population, lower average may be pretty weak compared to a college student Local ACT/SAT norms where all HS juniors take test National norms for all college bound, lower average might be typical for the local tech school Ø Cut score should be considered barely enough to meet criteria 50 on CLEP Spanish test may give credit for 101 but should probably do some review before tackle 102

53 Yeah, but what does my score mean? Remember a test result is just a sample of the students performance Ø Poor performance could be due to lots of things Ø Generally, can be sure it tells you the minimum of what they are capable of, but maybe not the maximum

54 Examples of Uses of Test Scores and Related Data that Should Be Avoided: Using test scores as the sole basis for important decisions affecting the lives of individuals, when other information of equal or greater relevance and the resources for using such information are available. Using minimum test scores without proper validation. Making decisions about otherwise qualified students based only on small differences in test scores. Using scores without appropriate consideration to their validity. Providing inadequate or misleading information about the importance of test scores in making judgments or decisions. Requiring or recommending that certain tests be taken when the scores are not used or are used to a negligible extent. Failing to recognize differences in admission standards and requirements that may exist among different schools or departments within many institutions when providing information to fprospective applicants. --From Guidelines on Uses of College Board Test Scores and Data end

55

56

57

58

59

60

61

62 Steve Saladin, Ph.D. University of Idaho Moscow, ID

DATA COLLECTION AND ANALYSIS

DATA COLLECTION AND ANALYSIS DATA COLLECTION AND ANALYSIS Quality Education for Minorities (QEM) Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. August 23, 2013 Objectives of the Discussion 2 Discuss

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Standardized Tests, Intelligence & IQ, and Standardized Scores

Standardized Tests, Intelligence & IQ, and Standardized Scores Standardized Tests, Intelligence & IQ, and Standardized Scores Alphabet Soup!! ACT s SAT s ITBS GRE s WISC-IV WAIS-IV WRAT MCAT LSAT IMA RAT Uses/Functions of Standardized Tests Selection and Placement

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Assessment and Placement: October 21, 2015 Edward A. Morante, Ed.D.

Assessment and Placement: October 21, 2015 Edward A. Morante, Ed.D. Assessment and Placement: October 21, 2015 Edward A. Morante, Ed.D. Assessment : Different Meanings +1. Assessment for basic skills - advising and placement into entering courses 2. Classroom Assessment

More information

Interpreting and Using SAT Scores

Interpreting and Using SAT Scores Interpreting and Using SAT Scores Evaluating Student Performance Use the tables in this section to compare a student s performance on SAT Program tests with the performance of groups of students. These

More information

The test uses age norms (national) and grade norms (national) to calculate scores and compare students of the same age or grade.

The test uses age norms (national) and grade norms (national) to calculate scores and compare students of the same age or grade. Reading the CogAT Report for Parents The CogAT Test measures the level and pattern of cognitive development of a student compared to age mates and grade mates. These general reasoning abilities, which

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

A BRIEF GUIDE TO SELECTING AND USING PRE-POST ASSESSMENTS

A BRIEF GUIDE TO SELECTING AND USING PRE-POST ASSESSMENTS A BRIEF GUIDE TO SELECTING AND USING PRE-POST ASSESSMENTS Prepared by the National Evaluation and Technical Assistance Center for the Education of Children and Youth who are Neglected, Delinquent, and

More information

Creating Pathways to Math Success at a Liberal Arts College. Maria Belk and Lauren Rose Bard College

Creating Pathways to Math Success at a Liberal Arts College. Maria Belk and Lauren Rose Bard College Creating Pathways to Math Success at a Liberal Arts College Maria Belk and Lauren Rose Bard College Bard College Small Liberal Arts college About 2000 students All students required to take a math or computer

More information

Assessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview

Assessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview Assessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview The abilities to gather and interpret information, apply counseling and developmental theories, understand diagnostic frameworks,

More information

Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Battery Bias

Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Battery Bias Glossary of Terms Ability A defined domain of cognitive, perceptual, psychomotor, or physical functioning. Accommodation A change in the content, format, and/or administration of a selection procedure

More information

A s h o r t g u i d e t o s ta n d A r d i s e d t e s t s

A s h o r t g u i d e t o s ta n d A r d i s e d t e s t s A short guide to standardised tests Copyright 2013 GL Assessment Limited Published by GL Assessment Limited 389 Chiswick High Road, 9th Floor East, London W4 4AL www.gl-assessment.co.uk GL Assessment is

More information

X = T + E. Reliability. Reliability. Classical Test Theory 7/18/2012. Refers to the consistency or stability of scores

X = T + E. Reliability. Reliability. Classical Test Theory 7/18/2012. Refers to the consistency or stability of scores Reliability It is the user who must take responsibility for determining whether or not scores are sufficiently trustworthy to justify anticipated uses and interpretations. (AERA et al., 1999) Reliability

More information

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.

Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity. Guided Reading Educational Research: Competencies for Analysis and Applications 9th Edition EDFS 635: Educational Research Chapter 1: Introduction to Educational Research 1. List and briefly describe the

More information

Important Steps to a Career in Higher Education (MA/MS or or PhD)

Important Steps to a Career in Higher Education (MA/MS or or PhD) Important Steps to a Career in Higher Education (MA/MS or or PhD) 1. Take the right courses Courses to take as an undergraduate, besides typical PSY courses (Intro, Research Methods, Stats): PSY courses

More information

Wide Range Achievement Test 4 (WRAT4)

Wide Range Achievement Test 4 (WRAT4) Wide Range Achievement Test 4 (WRAT4) Description of the WRAT4 The WRAT4 is a norm-referenced test that measures the basic academic skills of word reading, sentence comprehension, spelling, and math computation.

More information

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots

Correlational Research. Correlational Research. Stephen E. Brock, Ph.D., NCSP EDS 250. Descriptive Research 1. Correlational Research: Scatter Plots Correlational Research Stephen E. Brock, Ph.D., NCSP California State University, Sacramento 1 Correlational Research A quantitative methodology used to determine whether, and to what degree, a relationship

More information

College Entrance Tests SRJC, PSAT, SAT, ACT, CLEP, AP

College Entrance Tests SRJC, PSAT, SAT, ACT, CLEP, AP College Entrance Tests SRJC, PSAT, SAT, ACT, CLEP, AP College Entrance Tests SRJC : Math and English Placement tests which can be taken more than once. These indicate which level Math/English students

More information

RESEARCH METHODS IN I/O PSYCHOLOGY

RESEARCH METHODS IN I/O PSYCHOLOGY RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 01/17/11 [Arthur]

More information

Use of Placement Tests in College Classes

Use of Placement Tests in College Classes By Andrew Morgan This paper was completed and submitted in partial fulfillment of the Master Teacher Program, a 2-year faculty professional development program conducted by the Center for Teaching Excellence,

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

MATH PLACEMENT & PREREQUISITE GUIDE

MATH PLACEMENT & PREREQUISITE GUIDE MATH PLACEMENT & PREREQUISITE GUIDE WHAT YOU SHOULD KNOW! As part of the University of Wyoming s University Studies program (USP), all undergraduate students seeking a degree from the University of Wyoming

More information

Z - Scores. Why is this Important?

Z - Scores. Why is this Important? Z - Scores Why is this Important? How do you compare apples and oranges? Are you as good a student of French as you are in Physics? How many people did better than you on a test? How many did worse? Are

More information

How NCTQ scores the Selection Criteria Standard

How NCTQ scores the Selection Criteria Standard How NCTQ scores the Selection Criteria Standard Standard and indicators Data used to score this standard Evaluation of elementary, secondary and special education preparation programs on Standard 1: Selection

More information

Important Steps to a Masters or PhD in Public Health

Important Steps to a Masters or PhD in Public Health Important Steps to a Masters or PhD in Public Health 1. Take the right courses Courses to take as an undergraduate, besides typical PSY courses (Intro, Research Methods, Stats): Courses related to public

More information

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade

Answer: C. The strength of a correlation does not change if units change by a linear transformation such as: Fahrenheit = 32 + (5/9) * Centigrade Statistics Quiz Correlation and Regression -- ANSWERS 1. Temperature and air pollution are known to be correlated. We collect data from two laboratories, in Boston and Montreal. Boston makes their measurements

More information

Interpreting GRE Scores: Reliability and Validity

Interpreting GRE Scores: Reliability and Validity Interpreting GRE Scores: Reliability and Validity Dr. Lesa Hoffman Department of Psychology University of Nebraska Lincoln All data from: GRE Guide to the Use of Scores 2010 2011 http://www.ets.org/s/gre/pdf/gre_guide.pdf

More information

Relating the ACT Indicator Understanding Complex Texts to College Course Grades

Relating the ACT Indicator Understanding Complex Texts to College Course Grades ACT Research & Policy Technical Brief 2016 Relating the ACT Indicator Understanding Complex Texts to College Course Grades Jeff Allen, PhD; Brad Bolender; Yu Fang, PhD; Dongmei Li, PhD; and Tony Thompson,

More information

Understanding Types of Assessment Within an RTI Framework

Understanding Types of Assessment Within an RTI Framework Understanding Types of Assessment Within an RTI Framework Slide 1: Welcome to the webinar Understanding Types of Assessment Within an RTI Framework. This is one of 11 webinars that we have developed at

More information

How To Get A College Degree For Less Money

How To Get A College Degree For Less Money Saving Money on College Costs Top 5 Tips for Saving Money on College Costs The high cost of college is front page news, along with the question about whether or not a college degree is worth it. The overwhelming

More information

RESEARCH METHODS IN I/O PSYCHOLOGY

RESEARCH METHODS IN I/O PSYCHOLOGY RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 09/01/11 [Arthur]

More information

Important Steps to Becoming an MA/MS or PhD in Educational Psychology. 2. It is critical to do well in your courses

Important Steps to Becoming an MA/MS or PhD in Educational Psychology. 2. It is critical to do well in your courses Important Steps to Becoming an MA/MS or PhD in Educational Psychology 1. Take the right courses Courses to take as an undergraduate, besides typical PSY courses (Intro, Research Methods, Stats): Any Education

More information

Chapter 3 Psychometrics: Reliability & Validity

Chapter 3 Psychometrics: Reliability & Validity Chapter 3 Psychometrics: Reliability & Validity 45 Chapter 3 Psychometrics: Reliability & Validity The purpose of classroom assessment in a physical, virtual, or blended classroom is to measure (i.e.,

More information

Important Steps to Becoming an MA/MS or PhD in Engineering Psychology and Human Factors

Important Steps to Becoming an MA/MS or PhD in Engineering Psychology and Human Factors Important Steps to Becoming an MA/MS or PhD in Engineering Psychology and Human Factors 1. Take the right courses Courses to take as an undergraduate, besides typical PSY courses (Intro, Research Methods,

More information

WHAT IS PSEO? PSEO is an opportunity to promote rigorous academic pursuits and to provide a wider variety of options to high school students.

WHAT IS PSEO? PSEO is an opportunity to promote rigorous academic pursuits and to provide a wider variety of options to high school students. WHAT IS PSEO? PSEO is an opportunity to promote rigorous academic pursuits and to provide a wider variety of options to high school students. 9 th 12 th grade students earn college credit AND/OR high school

More information

NOT AN OFFICIAL SCORE REPORT. Summary of Results

NOT AN OFFICIAL SCORE REPORT. Summary of Results From SAT TEST MARCH 8, 214 Summary of Results Page 1 of 1 Congratulations on taking the SAT Reasoning Test! You re showing colleges that you are serious about getting an education. The SAT is one indicator

More information

Reliability and validity, the topics of this and the next chapter, are twins and

Reliability and validity, the topics of this and the next chapter, are twins and Research Skills for Psychology Majors: Everything You Need to Know to Get Started Reliability Reliability and validity, the topics of this and the next chapter, are twins and cannot be completely separated.

More information

Levels of measurement in psychological research:

Levels of measurement in psychological research: Research Skills: Levels of Measurement. Graham Hole, February 2011 Page 1 Levels of measurement in psychological research: Psychology is a science. As such it generally involves objective measurement of

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

SOST 201 September 18-20, 2006. Measurement of Variables 2

SOST 201 September 18-20, 2006. Measurement of Variables 2 1 Social Studies 201 September 18-20, 2006 Measurement of variables See text, chapter 3, pp. 61-86. These notes and Chapter 3 of the text examine ways of measuring variables in order to describe members

More information

160-4-2-.38 EDUCATION PROGRAM FOR GIFTED STUDENTS.

160-4-2-.38 EDUCATION PROGRAM FOR GIFTED STUDENTS. Code: IDDD 160-4-2-.38 EDUCATION PROGRAM FOR GIFTED STUDENTS. (1) DEFINITIONS. (a) Differentiated Curriculum - courses of study in which the content, teaching strategies, and expectations of student mastery

More information

Office of Institutional Research & Planning

Office of Institutional Research & Planning NECC Northern Essex Community College NECC College Math Tutoring Center Results Spring 2011 The College Math Tutoring Center at Northern Essex Community College opened its doors to students in the Spring

More information

Basic Concepts in Research and Data Analysis

Basic Concepts in Research and Data Analysis Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the

More information

Studying for Kaplan Nursing School Entrance Exam. Kaplan Nursing School Entrance Exam study guide may be purchased online from these stores:

Studying for Kaplan Nursing School Entrance Exam. Kaplan Nursing School Entrance Exam study guide may be purchased online from these stores: Studying for Kaplan Nursing School Entrance Exam Kaplan Nursing School Entrance Exam study guide may be purchased online from these stores: See next page for an ADDITION to the study guide Kaplan and Lippincott,

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

What High School Curricular Experience Tells Us About College Success *****

What High School Curricular Experience Tells Us About College Success ***** What High School Curricular Experience Tells Us About College Success ***** Serge Herzog, PhD Director, Institutional Analysis Consultant, CRDA Statlab University of Nevada, Reno Reno, NV 89557 Serge@unr.edu

More information

South Dakota. Opportunity Scholarship. Frequently Asked Questions

South Dakota. Opportunity Scholarship. Frequently Asked Questions South Dakota Opportunity Scholarship Frequently Asked Questions Question: What is the difference between a Regents Scholar Diploma and the South Dakota Opportunity Scholarship? Answer: The Regents Scholar

More information

Diagnostics. Revealing Potential. Expanding Opportunity.

Diagnostics. Revealing Potential. Expanding Opportunity. ACCUPLACER Diagnostics Revealing Potential. Expanding Opportunity. Identifying students academic strengths and weaknesses in preparation for college is more critical than ever. Revealing Potential Identifying

More information

RADIOGRAPHY TECHNOLOGY PROGRAM. Application Packet Fall 2014

RADIOGRAPHY TECHNOLOGY PROGRAM. Application Packet Fall 2014 RADIOGRAPHY TECHNOLOGY PROGRAM Application Packet Fall 2014 Application deadline is, 2014 North Idaho College Radiography Technology Program Application Checklist This sheet is provided for you to ensure

More information

Test of Mathematical Abilities Third Edition (TOMA-3) Virginia Brown, Mary Cronin, and Diane Bryant. Technical Characteristics

Test of Mathematical Abilities Third Edition (TOMA-3) Virginia Brown, Mary Cronin, and Diane Bryant. Technical Characteristics Test of Mathematical Abilities Third Edition (TOMA-3) Virginia Brown, Mary Cronin, and Diane Bryant Technical Characteristics The Test of Mathematical Abilities, Third Edition (TOMA-3; Brown, Cronin, &

More information

Measurement. How are variables measured?

Measurement. How are variables measured? Measurement Y520 Strategies for Educational Inquiry Robert S Michael Measurement-1 How are variables measured? First, variables are defined by conceptual definitions (constructs) that explain the concept

More information

EFRT 305 Human Development and Learning

EFRT 305 Human Development and Learning EFRT 305 Human Development and Learning Required Textbooks: Slavin, Robert E., (2010). Educational psychology: Theory and practice. 9 th Ed. ISBN: 9780137034352 EMSP Registration: This course has a field

More information

If your schedule allows you might consider joining ROTC.

If your schedule allows you might consider joining ROTC. Important Steps to Becoming an MA/MS or PhD in Military Psychology 1. Take the right courses Courses to take as an undergraduate, besides typical PSY courses (Intro, Research Methods, Stats): Personality

More information

STUDENT PERFORMANCE DATA DOCUMENT

STUDENT PERFORMANCE DATA DOCUMENT STUDENT PERFORMANCE DATA DOCUMENT Assessment Quality: To measure to progress towards Intended Learning Outcomes, DaVinci Academy has used an array of summative assessments that are aligned with the Utah

More information

Graduate Admissions for a More Diverse Student Body Fall 2011

Graduate Admissions for a More Diverse Student Body Fall 2011 Graduate Admissions for a More Diverse Student Body Fall 2011 Joseph L. Brown, PhD Stanford University School of Humanities & Sciences Acceptable Practices for Graduate Admissions Consider all applicants

More information

Parents Guide Cognitive Abilities Test (CogAT)

Parents Guide Cognitive Abilities Test (CogAT) Grades 3 and 5 Parents Guide Cognitive Abilities Test (CogAT) The CogAT is a measure of a student s potential to succeed in school-related tasks. It is NOT a tool for measuring a student s intelligence

More information

Benchmark Assessment in Standards-Based Education:

Benchmark Assessment in Standards-Based Education: Research Paper Benchmark Assessment in : The Galileo K-12 Online Educational Management System by John Richard Bergan, Ph.D. John Robert Bergan, Ph.D. and Christine Guerrera Burnham, Ph.D. Submitted by:

More information

Test Bias. As we have seen, psychological tests can be well-conceived and well-constructed, but

Test Bias. As we have seen, psychological tests can be well-conceived and well-constructed, but Test Bias As we have seen, psychological tests can be well-conceived and well-constructed, but none are perfect. The reliability of test scores can be compromised by random measurement error (unsystematic

More information

Advanced Placement and College Credit Plus Options

Advanced Placement and College Credit Plus Options Advanced Placement and College Credit Plus Options Important information about CC+ changes CC+ Options through our partner schools Stark State Kent State University The University of Akron AP course offerings

More information

Welcome. Dual Credit 2015-2016

Welcome. Dual Credit 2015-2016 Welcome Dual Credit 2015-2016 1 Table of Contents Office of Dual Credit Contact Information... p. 3 What are the Cooperative Online Course Offerings for Spring of 2016? p. 4-7 What are the Traditional

More information

Basic Assessment Concepts for Teachers and School Administrators.

Basic Assessment Concepts for Teachers and School Administrators. ERIC Identifier: ED447201 Publication Date: 2000-11-00 Basic Assessment Concepts for Teachers and School Administrators. Author: McMillan, James H. ERIC/AE Digest. Source: ERIC Clearinghouse on Assessment

More information

Predicting the Performance of a First Year Graduate Student

Predicting the Performance of a First Year Graduate Student Predicting the Performance of a First Year Graduate Student Luís Francisco Aguiar Universidade do Minho - NIPE Abstract In this paper, I analyse, statistically, if GRE scores are a good predictor of the

More information

Technical Report. Overview. Revisions in this Edition. Four-Level Assessment Process

Technical Report. Overview. Revisions in this Edition. Four-Level Assessment Process Technical Report Overview The Clinical Evaluation of Language Fundamentals Fourth Edition (CELF 4) is an individually administered test for determining if a student (ages 5 through 21 years) has a language

More information

WATSON SCHOOL OF EDUCATION UNIVERSITY OF NORTH CAROLINA WILMINGTON

WATSON SCHOOL OF EDUCATION UNIVERSITY OF NORTH CAROLINA WILMINGTON WATSON SCHOOL OF EDUCATION UNIVERSITY OF NORTH CAROLINA WILMINGTON BACHELOR OF ARTS DEGREE IN ELEMENTARY EDUCATION Program Goals and Objectives The goal of the elementary education program at UNCW is to

More information

Important Steps to Becoming a Masters of Social Work (MSW)

Important Steps to Becoming a Masters of Social Work (MSW) Important Steps to Becoming a Masters of Social Work (MSW) 1. Take the right courses Courses to take as an undergraduate, besides typical PSY courses (Intro, Research Methods, Stats): Personality Clinical

More information

DATA COACHING WORKSHOP DAY 1 PRESENTED BY SARITA SIQUEIROS THORNBURG, JILL PATNODE & KRISSY SOLTMAN

DATA COACHING WORKSHOP DAY 1 PRESENTED BY SARITA SIQUEIROS THORNBURG, JILL PATNODE & KRISSY SOLTMAN DATA COACHING WORKSHOP DAY 1 PRESENTED BY SARITA SIQUEIROS THORNBURG, JILL PATNODE & KRISSY SOLTMAN January 28, 2014 Success for Each Child and Eliminate the Opportunity Gap Welcome and Introductions Overview

More information

APEX program evaluation study

APEX program evaluation study APEX program evaluation study Are online courses less rigorous than in the regular classroom? Chung Pham Senior Research Fellow Aaron Diel Research Analyst Department of Accountability Research and Evaluation,

More information

The new Verbal Reasoning and Quantitative Reasoning score scales:

The new Verbal Reasoning and Quantitative Reasoning score scales: Better by Design The new Verbal Reasoning and Quantitative Reasoning score scales: A helpful overview of what you need to know Inside: An in-depth look at the new 130 170 score scales Guidance on making

More information

The Influence of a Summer Bridge Program on College Adjustment and Success: The Importance of Early Intervention and Creating a Sense of Community

The Influence of a Summer Bridge Program on College Adjustment and Success: The Importance of Early Intervention and Creating a Sense of Community The Influence of a Summer Bridge Program on College Adjustment and Success: The Importance of Early Intervention and Creating a Sense of Community Michele J. Hansen, Ph.D., Director of Assessment, University

More information

The Official Study Guide

The Official Study Guide The Praxis Series ebooks The Official Study Guide Middle School Mathematics Test Code: 5169 Study Topics Practice Questions Directly from the Test Makers Test-Taking Strategies www.ets.org/praxis Study

More information

Welcome to Basic Math Skills!

Welcome to Basic Math Skills! Basic Math Skills Welcome to Basic Math Skills! Most students find the math sections to be the most difficult. Basic Math Skills was designed to give you a refresher on the basics of math. There are lots

More information

Test Reliability Indicates More than Just Consistency

Test Reliability Indicates More than Just Consistency Assessment Brief 015.03 Test Indicates More than Just Consistency by Dr. Timothy Vansickle April 015 Introduction is the extent to which an experiment, test, or measuring procedure yields the same results

More information

Defining and Measuring College Success

Defining and Measuring College Success 1 Defining and Measuring College Success Wayne Camara Krista Mattern Andrew Wiley The College Board Research and Development February 24, 2009 2 Remediation and Post Secondary Success 41.4% of post secondary

More information

GLOSSARY of Assessment Terms

GLOSSARY of Assessment Terms GLOSSARY of Assessment Terms STATE COLLABORATIVE ON ASSESSMENT AND STUDENT STANDARDS ARTS EDUCATION ASSESSMENT CONSORTIUM (SCASS/ARTS) COUNCIL OF CHIEF STATE SCHOOL OFFICERS This glossary was developed

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,

More information

Napa Valley College Fall 2015 Math 106-67528: College Algebra (Prerequisite: Math 94/Intermediate Alg.)

Napa Valley College Fall 2015 Math 106-67528: College Algebra (Prerequisite: Math 94/Intermediate Alg.) 1 Napa Valley College Fall 2015 Math 106-67528: College Algebra (Prerequisite: Math 94/Intermediate Alg.) Room 1204 Instructor: Yolanda Woods Office: Bldg. 1000 Rm. 1031R Phone: 707-256-7757 M-Th 9:30-10:35

More information

Midterm Review Problems

Midterm Review Problems Midterm Review Problems October 19, 2013 1. Consider the following research title: Cooperation among nursery school children under two types of instruction. In this study, what is the independent variable?

More information

GRADUATE SCHOOL APPLICATION PROCESS:

GRADUATE SCHOOL APPLICATION PROCESS: GRADUATE SCHOOL APPLICATION PROCESS: Inside this document: Statement of Purpose or Personal Statement Letters of Recommendation Transcripts and Supplementary Materials Graduate Admission Exams Interviews

More information

Mathematics Placement

Mathematics Placement Mathematics Placement Students will be permitted to enroll in mathematics classes based on a prerequisite course, their VSU Math Index (VMI), or a mathematics placement exam as follows: Course Prerequisites

More information

Student Success Challenges in Three Areas: Developmental Education, Online Learning, and the Structure of the Student Experience

Student Success Challenges in Three Areas: Developmental Education, Online Learning, and the Structure of the Student Experience Student Success Challenges in Three Areas: Developmental Education, Online Learning, and the Structure of the Student Experience Shanna Smith Jaggars Community College Research Center Teachers College/Columbia

More information

Administrative Decision Making in Early Childhood Education:

Administrative Decision Making in Early Childhood Education: Administrative Decision Making in Early Childhood Education: HOW DO I USE THE REPORTS AVAILABLE IN ONLINE REPORTER? Assessment Technology, Incorporated Written by: Jason K. Feld, Ph.D. Assessment Technology,

More information

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple

More information

SAMPLE EDUCATIONAL EVALUATION

SAMPLE EDUCATIONAL EVALUATION SAMPLE EDUCATIONAL EVALUATION Name: School: Home School Date of Birth: 08/26/2002 Teacher: Mother Doe Age: 7 years, 8 months Grade: 1.8 Sex: Male ID: 123-456-7890 Date of Testing: 04/20/2010 Examiner:

More information

SCHOOL PSYCHOLOGY PROGRAM DESCRIPTION

SCHOOL PSYCHOLOGY PROGRAM DESCRIPTION Wichita State University COUNSELING, EDUCATIONAL LEADERSHIP, EDUCATIONAL AND SCHOOL PSYCHOLOGY SCHOOL PSYCHOLOGY PROGRAM DESCRIPTION DESCRIPTION The Department of Counseling, Educational Leadership, Educational

More information

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there

CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there CORRELATIONAL ANALYSIS: PEARSON S r Purpose of correlational analysis The purpose of performing a correlational analysis: To discover whether there is a relationship between variables, To find out the

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Statistics Review PSY379

Statistics Review PSY379 Statistics Review PSY379 Basic concepts Measurement scales Populations vs. samples Continuous vs. discrete variable Independent vs. dependent variable Descriptive vs. inferential stats Common analyses

More information

Career and College Promise

Career and College Promise Career College Promise Career College Promise offers North Carolina high school students a clear path to success in college or in a career. Eligible high school students can begin earning college credit

More information

Melbourne Central Catholic High School HONORS DIPLOMA APPLICATION Completed application due to Guidance by: March 13, 2015 *This application includes

Melbourne Central Catholic High School HONORS DIPLOMA APPLICATION Completed application due to Guidance by: March 13, 2015 *This application includes Melbourne Central Catholic High School HONORS DIPLOMA APPLICATION Completed application due to Guidance by: March 13, 2015 *This application includes Honors Diploma and FIT Partshership. Honors Diploma

More information

Early Childhood Education Application

Early Childhood Education Application Fall Application Due by March 30 th Spring Application Due by October 30 th Early Childhood Education Application Teacher Candidate Applicant Information Full Name: Address: Last First Middle Name Street

More information

Welcome. Dual Credit 2016-2017

Welcome. Dual Credit 2016-2017 Welcome Dual Credit 2016-2017 1 Table of Contents Office of Dual Credit Contact Information... p. 3 What are the Course Offerings for Summer of 2016?. p. 4-5 What are the Dual Enrollment (cooperative online)

More information

The correlation coefficient

The correlation coefficient The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

More information

TITLE 135 PROCEDURAL RULE WEST VIRGINIA COUNCIL FOR COMMUNITY AND TECHNICAL COLLEGE EDUCATION SERIES 21 FRESHMAN ASSESSMENT AND PLACEMENT STANDARDS

TITLE 135 PROCEDURAL RULE WEST VIRGINIA COUNCIL FOR COMMUNITY AND TECHNICAL COLLEGE EDUCATION SERIES 21 FRESHMAN ASSESSMENT AND PLACEMENT STANDARDS TITLE 135 PROCEDURAL RULE WEST VIRGINIA COUNCIL FOR COMMUNITY AND TECHNICAL COLLEGE EDUCATION SERIES 21 FRESHMAN ASSESSMENT AND PLACEMENT STANDARDS 135-21-1. General. 1.1. Scope. This policy establishes

More information

Evaluating a Retest Policy at a North American University

Evaluating a Retest Policy at a North American University Journal of the National College Testing Association 2015/Volume 1/Issue 1 Evaluating a Retest Policy at a North American University CINDY L. JAMES, PhD Thompson Rivers University Retesting for admission

More information

Secondly, this study was peer reviewed, as I have mentioned, by other top experts in the testing and measurement community before it was released.

Secondly, this study was peer reviewed, as I have mentioned, by other top experts in the testing and measurement community before it was released. HOME SCHOOLING WORKS Pass it on! Online Press Conference March 23, 1999, 12:00pm EST A transcript of the opening remarks by Michael Farris, Esq. & Lawrence M. Rudner, Ph.D. Michael Farris: Good morning.

More information

College Readiness Initiatives

College Readiness Initiatives California State University, Fullerton College Readiness Initiatives EAP, ERWC/SMI, EPT/ELM, & Early Start Presented By: The Early Assessment Program Larry Rivera Chapa, MBA Early Assessment Program Coordinator

More information

Study Guide for the Middle School Science Test

Study Guide for the Middle School Science Test Study Guide for the Middle School Science Test A PUBLICATION OF ETS Copyright 2008 by Educational Testing Service EDUCATIONAL TESTING SERVICE, ETS, the ETS logo, LISTENING., LEARNING., LEADING., and GRE,

More information

Woodcock Reading Mastery Tests Revised, Normative Update (WRMT-Rnu) The normative update of the Woodcock Reading Mastery Tests Revised (Woodcock,

Woodcock Reading Mastery Tests Revised, Normative Update (WRMT-Rnu) The normative update of the Woodcock Reading Mastery Tests Revised (Woodcock, Woodcock Reading Mastery Tests Revised, Normative Update (WRMT-Rnu) The normative update of the Woodcock Reading Mastery Tests Revised (Woodcock, 1998) is a battery of six individually administered tests

More information

Using a Mental Measurements Yearbook Review to Evaluate a Test

Using a Mental Measurements Yearbook Review to Evaluate a Test Using a Mental Measurements Yearbook Review to Evaluate a Test Anthony J. Nitko Professor Emeritus of Psychology in Education, University of Pittsburgh Adjunct Professor of Educational Psychology, University

More information