Chapter. Understanding Measurement. Chapter. Outline. Key Terms
|
|
|
- Sharon Andrews
- 9 years ago
- Views:
Transcription
1 Chapter Chapter 10 Understanding Measurement Outline 10-1 Introduction 10-2 The Theory of Measurement 10-3 Levels of Measurement 10-3a Nominal Level of Measurement 10-3b Ordinal Level of Measurement 10-3c Metric Level of Measurement 10-4 Measurement Validity 10-4a Types of Validity 10-4b Some Concluding Comments about Measurement Validity 10-5 Measurement Reliability 10-5a Threats to Measurement Reliability 10-5b Enhancing Measurement Validity and Reliability Chapter Summary Chapter Quiz Suggested Readings Key Terms composite index content validity continuous variable discrete variable discriminate validity Guttman scale index interval measurement levels of measurement Likert scale measurement metric measurement nominal measurement ordinal measurement predictive validity ratio measurement reliability reproducibility scales split-half method test-retest method threats to measurement reliability triangulation validity `181
2 182 Chapter Introduction In this chapter we expand our discussion in Chapter 9 about variables. Remember, a variable is a characteristic or property that differs in value from one unit of analysis to another. Variables are concepts that you operationalize, or measure, in a sample of data. In short, a variable is a measured concept. Consider the following: Concept Measurement Variable Your goal is to determine how much of a characteristic each unit of analysis possesses and why. We use measurement to develop and use some instrument to assign numbers to the possible characteristics so we can attain our goal. This process will allow you to apply the empirical methods of scientific research that we discuss in subsequent chapters. An understanding of this chapter will enable you to 1. Understand the theory of measurement. 2. Define measurement. 3. Differentiate between measurement validity and measurement reliability. 4. Identify ways to establish measurement validity and measurement reliability. 5. Identify threats to measurement reliability. 6. Identify the levels of measurement. measurement: The use of a tool to assign numbers to some phenomenon that we want to analyze and compare with existing criteria The Theory of Measurement Paul D. Leedy writes that measurement is the quantifying of any phenomenon, substantial or insubstantial, and involves a comparison with a standard (Leedy 2001, 24). His definition presents several problems, however. For example, what does quantify mean? What is a substantial or insubstantial phenomenon? How do we determine standards for comparison? Leedy s definition requires us to assign numbers to a concrete observation or a concept, so that we can compare it with some evaluation tool such as a ruler or standardized test. Simply put, then, measurement is the use of a tool to assign numbers to some phenomenon that we want to analyze and compare with existing criteria. To aid in the assignment of numbers to our chosen research topic, we will need to use some type of measurement instrument. For our purposes, examples might include standardized tests, surveys, secondary data, and scales and indices. Before we discuss measurement theory and measurement assumptions, consider the following quote: [W]e moderns, it seems, attempt to measure everything...we evaluate performance by measurement... What is not measurable we strive to render measurable, and what we cannot; we dismiss it from our thoughts and justify our neglect by assigning it the status of the less important.... A moment s reflection, however, is all that is needed to realize that measurement cannot possibly do everything we expect it to do...omitting from our considerations what cannot be measured, or what we do not know how to measure, often leads to irrelevance and even error. (Katzner 1991, 18.) In short, assigning numbers to political phenomena is not the sole way to analyze political phenomena. Nor does it ensure that you can make reasonable inferences about the phenomena. In other words, measurement is simply an additional way to find answers to our questions.
3 Understanding Measurement 183 Measurement theory assumes that a concept representing the phenomenon of concern exists but cannot be directly measured. Educational attainment, level of political participation, and extent of political equality throughout the world, for example, are theoretical concepts that we cannot measure directly. You can, however, measure them indirectly through variables specified by operational definitions. Consider the following examples: 1. Educational attainment: You might measure educational attainment by examining students grade point averages, their scores on standardized achievement tests, or their years in school. 2. Political participation: You might measure political participation by determining a citizen s voting record, involvement in political campaigns, and attendance at city council meetings. 3. Political equality: You might measure a nation s level of political equality by examining the number of regularly scheduled elections, nominating devices, registration requirements, and the proportion of voting turnout to qualified voters. Note the similarities in these examples. First, each uses several variables to measure a single concept. Educational achievement, for example, consists of grade point averages and standardized test scores. Second, you can assign numbers to each indicator. To measure political participation, you can count the number of times an individual votes, the number of hours he or she spends campaigning, and the number of times he or she attends a city council meeting Levels of Measurement In some cases measurement is simpler because the researcher is working with real numbers; for example, the percentage of voter turnout, the gross national product of nations, and the number of revolutions throughout the world. In these cases you can calculate averages, percentages, and various measures of deviation. You can also rank, or order, units according to their values. If we are ranking American states according to their voting turnout, for example, there is no question about which states have the largest and the smallest. We simply compare the values. Some measures, however, may not be as simple as other measures because they lack numerical precision. Consider, for example, race. When you assign a 1 to African Americans, a 2 to Latinos, and a 3 to Anglos, you do not imply any particular ordering among those classes. The numbers are only convenient labels for each category. How do you measure or calculate the average of these groups? You cannot say that one citizen has more race than another citizen. Anglos do not have more race than African Americans. In measurement theory, you can use numbers several ways. At times you use them as labels for categories of variables. On other occasions you use them to rank or order categories of variables. Last, you can use them to specify the interval, or distance, between categories of variables. Thus, there are different levels of measurement. It is important that you understand each level because the various analytic methods and statistics we discuss in subsequent chapters apply to specific levels of measurement. 10-3a Nominal Level of Measurement Nominal measurement merely involves the assignment of numeric labels to the categories of a variable. While computerized statistical programs are amazing, they require us to assign numbers to variables to facilitate statistical analyses. Some levels of measurement: The extent to which typical numbers describe characteristics of a variable. We distinguish nominal, ordinal, interval, and ratio levels. You can use a greater number of statistics and statistical methods with the higher levels of measurement. nominal measurement: A measure for which different scores represent different, but not ordered, categories.
4 184 Chapter 10 packages can do limited mathematical machinations with symbols other than numbers, but they are more effective and have more meaning for you when they deal with numbers. Race, as discussed, is an example of a nominal-level measure. Each time the program counts a 1, it is also counting an African American. There are many other examples of nominal measures applicable to political research. Gender, political party affiliation, nationality, and college major are common examples that you will probably use in your studies. While nominal measures are quite simple to use, they also have some shortcomings. Nominal measures are the weakest or least precise level of measurement. You do not have the ability to state how much of a trait or characteristic is possessed by an object or event. In addition, you cannot even determine whether the variable has more or less of the characteristic. Nominal measures lack any sense of relative size or magnitude; they only allow you to say that the classes of a variable are different. There is no mathematical relationship between the classes. ordinal measurement: A measure for which the scores represent ordered categories (e.g., from the strongest to the weakest ) that are not necessarily equally distant from each other. 10-3b Ordinal Level of Measurement Ordinal measurement derives from the ordinal numbers such as first, second, third, and so on. Similar to nominal measures, ordinal measures allow you to classify categories of variables. They also, however, enable you to rank the characteristics of variables based on the values you assign. You may, for example, assign categories such as strongly support to strongly oppose. In a hierarchal sense individuals included in the strongly support category will favor something more than individuals in the strongly oppose category. It is not possible, however, to specify how much more or less support. Thus, while the numbers indicate a rank ordering of cases, they do not indicate the exact distances, or intervals, between the units. In an election, for example, the order of finish of the candidates does not tell us anything about the number of votes each person received. The order of finish tells you only that the winner received more votes than the other contenders, but not how many more votes. As with nominal variables, ordinal measures merely assign numeric labels to the categories of a variable. There are, however, certain rules you should follow when measuring or coding, ordinal variables. Frankfort-Nachmias and Nachmias offer the following when assigning numbers for variables that can be ranked (Frankfort-Nachmias and Nachmias 2000, ). First, assigned numbers should make intuitive sense. Higher scores, for example, should be assigned higher code numbers. Second, the coding categories must be mutually exclusive. That is, each unit of analysis should fit into one and only one category. Consider the following measurement scheme for a respondent s level of income: 1) 0 $20,000 2) $20,000 $40,000 3) $40,000 $60,000 4) $60,000 $80,000 The example violates the mutually exclusive rule. If the respondent earns $20,000, which category does he or she identify as their income category? Do they belong to category 1 or do they belong to category 2? Third, the coding scheme must be exhaustive. This means that every response must fit into a category. Looking at the income example just presented, you can readily see that anyone earning more than $80,000 does not have a category representing their income level. Thus, perhaps an additional category could be coded as Greater than $80,000. Last, categories must be specific enough to capture differences using the smallest possible number of categories. Frankfort-Nachmias and Nachmias call
5 Understanding Measurement 185 this requirement the criterion of detail. In other words, while you want to ensure you meet the criteria, you do not want to have too many categories for a particular variable. For the income example in this section, you would not want to code income as under $1,000; $1,000 to $2,000; $2,001 to $3,000; and so on. 10-3c Metric Level of Measurement Metric measurement is more precise than either nominal or ordinal measurement. Numbers do not just stand for categories, as in nominal and ordinal measurements. There are two types of metric measurement: interval measurement and ratio measurement. Interval and ratio measurements are very similar. With each level, the values assigned to the classes of a variable have meaning. Thus, we can rank the classes of a variable so that the distance between those classes is exact and constant, and we can calculate differences between observations. In Michael Corbett s words,... each level assigns real numbers to observations and each level has equal intervals of measurement (Corbett 2001, 48 49). There is a major difference, however, between the two types of metric measurement. Interval measurement does not have an absolute zero point. The zero point (if there is one) is an arbitrary point. The zero point does not imply a complete absence of the variable being measured. Political scientists do not use interval data to a great extent. A good example of an interval variable is the Fahrenheit thermometer. The thermometer is divided into different intervals or degrees of heat. We can calculate the difference in temperature. But the zero point does not indicate an absence of temperature. Someone living in Nome, Alaska, will quickly tell you that thirty degrees below zero is colder than zero degrees. With ratio measures, zero is the lowest possible value. You cannot earn less than zero dollars. No one is less than zero years of age. No one gets less than zero votes. It makes sense to say that someone who received 10,000 votes got twice as many as someone who garnered 5,000 votes. Ratio-level data is reported in the ten-year census. More specific examples include a state s population, the number of senior citizens living in cities, and the number of African Americans living in a state. All percentages and proportions are ratio because we start out with ratio measurements to derive them. When collecting your data, you should try to measure concepts at the highest level possible. This will permit enhanced mathematical manipulation and more sophisticated statistical analysis. In political science, however, many measures associated with surveys are of the nominal (race, gender, party affiliation) or ordinal variety (extent of agreement with policy statements). So what do you do? You can use less sophisticated statistical methods, or you can transform your data so that it takes on the qualities of a higher level of measurement. Suppose, for example, a researcher uses the questions (nominal measures) shown in Table 10-1 in a sample of adults in an effort to get an idea about the ideology of her subjects. The researcher constructs her questions so that the responses coded with a 1 are liberal responses and the responses coded with a 0 are conservative responses. As you can see, she arbitrarily coded the possible responses. She just as easily could have coded conservative responses with a 0 and liberal responses with a 1. The numbers are merely labels used to differentiate the responses and enhance computer input and subsequent analysis. Finally, our researcher will use the individual responses, which are nominal measures, into a higher level of measurement by creating an index that would measure the ideology of her subjects. She would compute the overall political metric measurement: The level of measurement that includes interval- and ratio-level variables. It allows for use of the most precise measuring instruments. interval measurement: A measure for which a one-unit difference in scores is the same throughout the range of the measure. Interval measures do not have an absolute zero point. In other words, zero does not indicate a complete absence of the concept that was measured (Fahrenheit thermometer). ratio measurement: A measure for which the scores possess the full mathematical properties of the assigned numbers. Ratio measures have an absolute zero point. Zero means a complete absence of the concept that was measured.
6 186 Chapter 10 Table 10-1 Illustration of Data Transformation 1. The United States Supreme Court has ruled that no state or local government may require the reading of the Lord s Prayer or Bible verses in public schools. What are your views on this do you approve or disapprove of the Court s ruling? 1) Approve 0) Disapprove 2. Are you for or against preferential hiring and promotion of blacks? 1) For 0) Against 3. Do you support a woman s right to an abortion if she wants it for any reason? 1) Support 0) Oppose 4. What are your feelings about government regulation of the economy? 1) Not enough 0) Too much 5. When a person has a disease that cannot be cured, do you think doctors should be allowed by law to end the patient s life by some painless means if the patient and his family request it? 1) Yes 0) No ideology score for each person by adding up the responses to each question. Thus, each person has a score between 0 and 5. The closer a respondent s score is to 5, the more liberal the respondent. You can also see that the new variable is a higher level of measurement (ratio) than any of the individual questions. continuous variable: A variable that, in principle, can take on any value within its range of possible values. For example, the actual time it takes to run a race (sixteen minutes, five seconds, and so on). discrete variable: A variable that can have only certain values within its range. The size of one s family is an example of a discrete variable. validity: The effectiveness of the measuring instrumnent and the extent that the instrument reflects the actual activity or behavior one wants to study. Continuous and Discrete Variables In addition to identifying levels of measurement for variables, it is useful to distinguish between continuous and discrete variables. In principle, a continuous variable can take on any value in the range of possible values (Fox 1998, 15). Ratio variables such as per capita incomes and percent of the population living below the poverty level are continuous variables. On the other hand, a discrete variable takes on only certain values within its range (Fox 1998, 16). Ratio variables such as number of college hours completed and number of points scored in a basketball game are discrete variables. Have you ever seen a final basketball score of 95.5 to 90.3? Nominal-level variables are always discrete. Ratio-level variables, as you can see with our examples, can be continuous or discrete. Some also make the argument that ordinal-level variables can be continuous or discrete (Fox 1998, 16). While this may be true, the majority of ordinal-level variables are best handled as discrete variables when analyzing their properties. We have spent this time differentiating between continuous and discrete variables because, just as with levels of measurement, the continuous versus discrete distinction matters. You will find that some statistical techniques are more appropriate for discrete variables while others are more appropriate for continuous variables Measurement Validity Assigning numbers to your research concepts may sound easy. Alas, measurement is not quite so simple. How do you know you are really measuring what you want to measure? Can you measure political participation solely by analyzing voting turnout? Measurement validity is concerned with the effectiveness of the measuring instrument and the extent that the instrument reflects the actual activity or behavior one wants to study. We say that a measurement tool or variable is a valid measure of a concept if it is an accurate representation of the concept it is intended to measure.
7 Understanding Measurement a Types of Validity There are several types of measurement validity. Content validity deals with the ability of a measuring tool to truly tap the information we seek. Suppose your research project seeks to examine the relationship between unemployment and voting in presidential elections. There are tools that, over time, have demonstrated content validity when determining voting choice in presidential elections. How do we measure unemployment, however? Does data from the unemployment rolls accurately measure unemployment? What about those unemployed individuals who are not on the rolls because they have exhausted their entitlements? Sole use of this data can lead to content validity problems. Thus, you would need to use other measurement tools, such as surveys, to complement the unemployment rolls. Discriminate validity answers the question Does the tool allow the concept to be distinguished from similar concepts? Using achievement scores on standardized tests, for example, may lack discriminate validity if the tests have some cultural bias. That is, are you measuring academic achievement or the consequences of inequity in educational funding? Predictive validity means you can use a tool to predict a specified outcome. If scores on a civil service exam accurately predict on-the-job performance, the exam has predictive validity. If income, education, and occupation measurements in a voting district consistently explain voting turnout, they have predictive validity. 10-4b Some Concluding Comments about Measurement Validity The types of validity we discussed in Section 10-4a are ways to address the need to establish measurement validity. They allow you to argue that a variable is a valid measure. While they may not guarantee that the variable is a valid measure of the concept in question, you do have measurement criteria that can withstand critique if you have considered them when selecting your variables. In Section 10-5b, Enhancing Measurement Validity and Reliability, we discuss some other ways to enhance the validity of your measurement instrument Measurement Reliability A variable has reliability if it consistently assigns the same numbers to a phenomenon. For example, if we measure a neighborhood s perception of police effectiveness twice and obtain the same results, then we consider the indicator to be reliable. Or if two or more people use an instrument and arrive at the same results, then we say that the instrument is reliable. In sum, an instrument is reliable if the same results are consistently obtained despite different settings, different persons applying the measurement, or any factors other than variation in the concept being measured. 10-5a Threats to Measurement Reliability There are several potential problems of measurement that threaten the reliability of measurement tools. You must consider each of these threats to measurement reliability in your research efforts. First, your measure should not rely on the judgment of the measurer or a respondent in a survey. If it does, we say that the measure is subjective. The following question is a good example of a subjective measure: What is your opinion about the quality of life in your hometown? This question requires a subjective response. What does quality of life mean? Several content validity: The ability to demonstrate that a measure of a concept can be used in an analysis by showing that it covers the full theoretical domain of the concept. discriminate validity: A measurement that allows one to distinguish a concept from similar concepts. predictive validity: The effectiveness of the measuring instrument to forecast a specified outcome. The effectiveness, for example, of a civil service exam to accurately forecast job performance. reliability: The degree to which measures yield the same results when applied by different researchers to the same units under the same circumstances (the consistency of a measurement tool). threats to measurement reliability: Possible occurrences that could detract from the reliability of a measure. History and regression artifacts are examples.
8 188 Chapter 10 respondents may have different perceptions about quality of life. As such, their responses will be based on their perceptions. To address this problem, you could gather data others have accepted as measurements of quality of life. For example, median education, median income level, number of city parks, the unemployment rate, and statistics that depict the level of crime in the area. Inexperienced interviewers and misleading questions also detract from the reliability of a measurement tool. One way to control these problems is to test the instrument before you use it in the study. Testing involves administering the survey to several persons and analyzing and correcting any deficiencies that might occur. Another way to prevent these problems is to train interviewers and ensure they understand the instrument and its purpose. The respondent can also contribute to the unreliability of the instrument. For example, the respondent may be careless when completing the questionnaire. In addition, the respondent may falsify responses to some questions. You can control these threats by testing the instrument and convincing respondents that you will ensure the privacy of their responses. You might also offer to give them a synopsis of your research effort upon its completion. Finally, data input errors can affect reliability. Despite computer sophistication, human error is a given in most research situations. We used to control for this possibility by reviewing the input or by having two data processors input the data. Fortunately, many modern data analysis packages have edit procedures that will notify you when you input erroneous data. Before we show you some ways to enhance the validity and reliability of your measurement instrument, we want you to know that while a reliable measure may not be valid, a valid measure will be reliable, because if it accurately measures the concept in question, then it stands to reason that it will do so consistently. Thus, it is more important to demonstrate validity than reliability (Johnson et al. 2001, 92). For example, consider a bathroom scale that always weighs someone ten pounds light. The scale is a reliable measurement tool. It will always weigh you ten pounds lighter than your true weight. As such, to determine your true weight you must add ten pounds to the weight displayed on the scale. The scale, however, is not valid. It is not measuring your true weight. It is erroneous by ten pounds. 10-5b Enhancing Measurement Validity and Reliability Unfortunately, in the real world of research we have no way to guarantee the validity and reliability of measurement. Proof that your instrument is valid is especially difficult to obtain. But we try to do the best we can. In establishing validity, your measurement tool should have one or more of the types of validity we discussed in Section 10-4a. For example, it should have content validity in that it measures what you want to measure. Or it should have predictive validity so that you can predict a specified outcome when using your instrument. Ultimately, however, you must use your judgment to determine the validity of the chosen instrument. A reliable instrument is stable, dependable, and consistent in measurement. We assume that a respondent s score on some measure is very close to the respondent s actual position on the measured concept. We say close because, as discussed, there are several obstacles to reliability that can lead to some error in the results. Unlike validity, you can objectively determine a measure s reliability. There are a number of methods you can use to estimate the reliability of a particular measure.
9 Understanding Measurement 189 Test-Retest Method With this method, you administer the measurement instrument to the same group more than once. Then you examine the two sets of measures. The higher the relationship is between the sets of measures, the more reliable the instrument. The test-retest method has two limitations. First, a second application of the measurement may influence the scores. There may not be a high correlation between scores because the respondents have become familiar with the tool and its purpose. You can partially compensate for this problem by changing the order of your questions and possible responses. Second, scores may change because the respondents actual attitudes have changed as a function of time and socialization. The first problem results from the unreliability of your instrument. The second problem, however, does not mean your instrument is unreliable. It only gives the appearance of an unreliable measurement. After all, you have measured a change in the respondents attitudes. Split-Half Method The split-half method of estimating reliability requires you to divide your original scale into two or more subscales (see our discussion about scales and indices below). You then administer each subscale to a group and determine the average difference among the scores. This average difference helps you determine the reliability of your instrument. If the scores on one subscale deviate from the scores on the other subscale by an average of 5 percent, the split-half score is.95 (1.0.05). A score of.90 or higher is considered acceptable evidence of a reliable scale (Cole 1996, 134). Triangulation Triangulation is an attempt to enhance the reliability and validity of measurement by using multiple and overlapping measurement strategies. There are several types of triangulation. Data triangulation involves the use of several data sources relative to the concept. For example, if you want to evaluate the effectiveness of public transit systems, you might use the following data sources to gather information for analysis: surveys of mass transit users and nonusers; surveys of public officials, transport authorities, and bus drivers; customer complaint files; and accident reports. Data triangulation enhances the validity and reliability of findings because it taps a variety of information sources. Investigator triangulation involves the use of multiple observers for the same research activity. It reduces potential bias that might come from a single observer. Examples include the use of several interviewers, analysts, and decision makers. Methodological triangulation combines two or more information collection methods in the study of a single concept. It uses the strengths of various methods. For example, you might use surveys to gather information about a phenomenon. To complement this method, you might discretely observe and chart the activities of your subjects. This method can compensate for the possible bias that could result from interviews and surveys. Scales and Indices Indices and scales are similar to each other. You create an index when you assign scores based on the combined response of several related questions. You use scales to empirically demonstrate a hierarchical ranking of items. An index is a crude form of scaling because you do not rank the items in the index. Scales, therefore, are more precise measures than indices. In addition, scales involve the principle of split-half method: Calculating reliability by comparing the results of two equivalent measures made at the same time. triangulation: The use of several observers, data collection techniques, or sources of data in an effort to enhance the reliability and validity of a research effort. scales: Combined measures used to operationalize abstract concepts such as racial prejudice, which cannot be adequately measured by a single indicator. index: A multi-item measure in which individual scores on a set of items are combined to form a summary measure.
10 190 Chapter 10 unidimensionality, which implies that the items comprising the scale reflect a single dimension or concept. For the most part, you use scales and indices to measure attitudes and knowledge of a particular subject; for example, ideological attitudes, self-esteem concepts, knowledge about the U.S. Constitution, and knowledge about Third World economies. Indices and scales require you to use several questions to measure a concept. Think of these tools as tests. Your professor does not ask you a single question to determine your comprehension about a subject such as comparative politics. Likewise, you should not ask a single question to determine one s knowledge about a particular phenomenon or attitude about political parties or public policy. A scale or index measuring political participation, for example, might involve questions about a person s voting habits, whether they contribute to campaigns, the extent of their communication with elected officials, whether they attend political rallies or meetings, and whether they run for office. Several questions will increase the reliability of your measurement tool by reducing possible error. They will also enhance the possibility that you are measuring what you intended to measure. composite index: An index developed by using several items (questions) to measure complex concepts. Although somewhat crude, it is an efficient way to summarize information and enhance the validity of an analysis. Likert scale: A multi-item measure in which the items are selected based on their ability to discriminate between those scoring high and those scoring low on the measure. They are not, however, cumulative scales. The Composite Index There are several types of indices or scales we can use. The composite index is the most basic. As with all scales and indices, it uses several questions to measure an attitude or perception. Table 10-1 shows an example of a composite index. You can use this type of an index as an independent or dependent variable. Normally, however, the index is the dependent variable. There are several advantages to using a composite index. First, it is simple to construct. Second, as discussed, the index results in a higher level of measurement. This allows you to use the measure with more sophisticated analytical methods. Third, it is an efficient way to summarize information. Fourth, you can use the Cronbach Alpha statistic to evaluate the internal consistency, or reliability, of the index items. Alpha gives you an idea of how well the index items fit together. Alpha can range from zero, or no reliability, to 1, or perfect reliability. An Alpha of.70 is an acceptable level of internal consistency (Frankfort-Nachmias and Nachmias 2000, 425). This type of index, however, is also somewhat crude. It is difficult to know how to weigh the various components used in the index. Are all questions equal in describing the concept? Excluding the minimum and maximum scores, how do you interpret the other scores of the index? In addition, some criticize the use of composite indices because respondents may fall into a response-set pattern. They select the same responses for each item without thoroughly considering each question. This problem could be addressed by counterbalancing the responses. That is, sometimes a 1 would be a liberal response, and sometimes it would be a conservative response. Of course this would require the researcher to recode responses to derive a respondent s final score. Last, while there are statistics that measure the extent to which the individual index items relate to each other, item selection for the scale is somewhat subjective. These disadvantages could negatively impact the validity of the measurement tool. Therefore, more elaborate types of indices are preferred. Likert Scales Likert scales are particularly useful in measuring people s attitudes. They differ from indices in that not every individual item score is used to calculate the final score. To design a Likert scale, you need to take several steps. First, you need to
11 Understanding Measurement 191 compile several possible scale items that make up your survey questions. You do this by compiling a series of items that express a wide range of attitudes, from extremely negative to extremely positive. You may ask several questions about the media s impact on political socialization, for example. Second, you need to assign numbers to the possible responses. Most Likert scales use a scheme similar to the following: 1) = Strongly disagree 2) = Disagree 3) = Undecided 4) = Agree 5) = Strongly agree Next, you need to administer the survey to a random sample of respondents. You do this for several reasons. First, you want to test the reliability of your scale. You can accomplish this task by using the test-retest method of estimating reliability or by using the split-half technique. Second, you want to compute a total score for each respondent. For example, suppose that a respondent strongly agreed with three statements and agreed with two other statements. If you used the scale in the table just presented, the respondent s score would be 23. We use the total scores to help us determine the discriminative power (DP) of the scale items. Remember we said that Likert scales differ from indices in that not every individual item score is used to calculate the final score. The DP of an item allows us to readily distinguish those items we want to include in our final scale. The DP enables us to separate those scoring high on an attribute from those scoring low on an attribute in our attitude continuum. We retain those items that allow us to discriminate most readily as a part of the final scale. One way to determine the DP of our scale is to use item analysis. This method requires us to compare each individual item to the total scale score. If individuals score high on one item but low on the entire scale, then that one item is not measuring the same thing as the other items. Thus, the item should be dropped. Let s consider our political ideology example again. Only this time, let s use a Likert response scheme to determine a respondent s attitude about the questions. Note that this measurement scheme necessitates a rewording of the questions. Table 10-2 presents an illustration of Likert scaling. One scale item in Table 10-2 asks respondents to show their level of agreement with this statement: The economy is improving. Suppose a respondent strongly agrees and scores a 5 for this item. According to the scale, higher scores reflect a test-retest method: A method to calculate reliability by repeating the same measure at two or more points in time. Table 10-2 Illustration of Likert Scaling Do you strongly disagree, disagree, agree, strongly agree, or are you undecided about the following statements? 1. The United States Supreme Court has ruled that no state or local government may require the reading of the Lord s Prayer or Bible verses in public schools. The Court s decision was correct. 1) Strongly disagree 2) Disagree 3) Undecided 4) Agree 5) Strongly agree 2. Government efforts to implement the preferential hiring and promotion of blacks in the workplace is an important government action. 1) Strongly disagree 2) Disagree 3) Undecided 4) Agree 5) Strongly agree 3. A woman has the right to an abortion if she wants it for any reason. 1) Strongly disagree 2) Disagree 3) Undecided 4) Agree 5) Strongly agree 4. The economy is improving. 1) Strongly disagree 2) Disagree 3) Undecided 4) Agree 5) Strongly agree 5. When a person has a disease that cannot be cured, doctors should be allowed by law to end the patient s life by some painless means if the patient and his family request it. 1) Strongly disagree 2) Disagree 3) Undecided 4) Agree 5) Strongly agree
12 192 Chapter 10 Guttman scale: A multi-item measure in which respondents are presented with increasingly difficult measures of approval for an attitude. Guttman scales are unidimensional and cumulative. liberal ideology. The respondent s total scale score based on other questions, however, was 9 out of a possible maximum score of 25. The overall score of 9 indicates a more conservative stance. So what should you do? If you find that other respondents responded similarly, perhaps you should eliminate the economy question. It does not correlate with the total scale score. Thus, it lacks discriminative power. It may not be measuring the concept of ideology. Common sense tells us this conclusion is correct. Whether the economy is recovering or not has little to do with ideology. Likert scales have some obvious advantages. They are relatively easy to administer, they provide a more rational basis for item selection, and they provide a range of alternative responses to each question. Several scholars, however, have criticized Likert scales. As with composite indices, the problem of the response-set pattern is possible. In addition, the scale relies on the selection of extreme items. Thus, the scale may not be able to satisfactorily differentiate between more moderate respondents. Also, there is no empirical way to determine whether the items finally selected to make up the scale do, in fact, measure the concept of interest (Cole 1996, 125). Therefore, some prefer to use the more precise Guttman scale. Guttman Scales Guttman scales have several characteristics. First, they incorporate an empirical test of unidimensionality. They measure only a single dimension or attitude. Second, Guttman scales are cumulative. Potential scale items are ordered according to the degree of difficulty associated with responding positively to each item. The technique, however, assumes that respondents who answer positively to a difficult item will also respond positively to less difficult items. As a result of the ordering process, Guttman scales, unlike Likert scales, generally yield scale scores resulting from a single set of responses. That is, to get a 20 on the ideological perception scale, a particular pattern of responses is essential. In a Likert scale, different patterns of responses can yield the same scale score. Because the Guttman scale is more complex than other scales, let s take time to construct one. Consider the following hypothetical Scale of Religious Activity constructed from questions asked in the National Opinion Research Center General Social Survey (NORC GSS), currently directed by James A. Davis and Tom W. Smith of the University of Chicago. In the past month, did you spend time (check all that apply). 1. visiting a stranger s home to talk about religion? 2. attending weekly prayer groups or Bible studies? 3. shopping for religious items? 4. attending Sunday church services? 5. praying in your home? If we use the coding methods associated with a composite index or Likert scale, a scale of religious activity could be constructed by summing each individual s response to the preceding questions. Consequently, scores could range from 0 (no religious activity) to 5 (for those participating in each activity). Excluding the minimum and maximum scores, how do you interpret the other scores of the index? That is, scores of 1, 2, 3, and 4 could be achieved through different combinations of responses. In addition, we cannot tell if similar scores measure similar or different dimensions of religious activity. For example, one could score a 2 in several ways. Is one who attends Sunday services and spends time praying as religiously active as one who visits a stranger s home and attends prayer groups? Thus, you can see that a score of 2 could denote different degrees of religious activism.
13 Understanding Measurement 193 Table 10-3 Illustration of Unidimensionality More Difficult Less Difficult Respondent Stranger Prayer Group Shopping Services Pray Score 1 yes yes yes yes yes 5 2 no yes yes yes yes 4 3 no no yes yes yes 3 4 no no no yes yes 2 5 no no no no yes 1 6 no no no no no 0 Table 10-3 illustrates the concept of unidimensionality, a benefit of Guttman scales. The Guttman scaling technique starts by ordering potential items according to the degree of difficulty or effort assumed to be associated with responding positively to a question. Let s assume that we asked six individuals to respond to our survey. Table 10-3 represents a distribution of responses that is perfectly unidimensional. Though such an outcome is unlikely in practice, it provides the essential Guttman baseline that is then compared to the actual responses in your survey. It ranks the items on the single underlying dimension of religious activity. In addition, the scale is cumulative in that none of the respondents has a disagreement response before an agreement response, or vice versa. If you examine the table closely, you will see that information on the position of any respondent s last positive response allows the prediction of all of her responses to the other scale items. For example, if an individual is willing to visit a stranger s home to talk about religion, she would be willing to attend prayer group meetings, shop for religious items, attend religious services, and pray in their home. In addition, with a perfectly unidimensional scale, if you know an individual s total religious activity score, you can accurately predict their response to each subsequent scale item. Knowing that Respondent 4 received a score of 2 also enables you to know which activities she is willing to undertake (praying in her home and attending Sunday services). You also know which activities she does not undertake. Thus, you are able to reproduce each individual s responses to each question because you know each individual s total score. Table 10-3 is an example of 100 percent reproducibility. In the real research world, however, this seldom occurs. Reproducibility is the extent to which you can replicate the total response pattern on a set of scaled items by knowing only the total score. In actuality, you will probably have a number of responses that deviate from the expected pattern. For example, if Respondent 5 responded yes to attending Sunday services but no to praying in his home, a deviation from the expected unidimensionality has occurred. Hence, it is necessary to establish a criterion for evaluating the unidimensionality and cumulativeness of the scale. We do this by determining the ratio of error responses to the total number of possible responses. This ratio is known as the coefficient of reproducibility. The coefficient of reproducibility (CR) measures the degree of conformity to a perfect scalable pattern such as the one we have in Table We calculate CR as follows: CR = 1 Number of inconsistencies Total number of responses (Number of cases times number of scale items) Frankfort-Nachmias and Nachmias wrote that the coefficient you obtain should be.9 or greater to be an acceptable scale (Frankfort-Nachmias and Nachmias 2000, 427). reproducibility The extent to which you can replicate the total response pattern on a set of scaled items by knowing only the total score.
14 194 Chapter 10 Table 10-4 Illustration of Non-Unidimensionality More Difficult Less Difficult Respondent Stranger Prayer Group Shopping Services Pray Score 1 yes yes yes yes yes 5 2 no yes no yes yes 3 3 no no no yes yes 2 4 no no yes no yes 2 5 no no yes no yes 2 6 no no no no no 0 Total Number of Inconsistencies: 3. Total Number of Responses: 30. CR = 1 3/30 =.90. Table 10-4 presents an illustration of non-unidimensionality. In other words, it depicts some inconsistencies in the responses. An examination of Table 10-4 shows the pattern of true responses you might obtain in actual research. Those who agreed with the more difficult questions also agreed with the less difficult ones. Responses to the question about shopping for religious items, however, do not fit the pattern. Respondent 2 agreed with a more difficult question but did not agree with the shopping question. Respondents 4 and 5, on the other hand, did not agree with a less difficult question (attending Sunday church services) and agreed with the shopping question. Therefore, the question about shopping for religious items does not seem to fit the pattern. You should remove it from the scale because it does not measure the concept of religious activity. Once the question is removed, the pattern as depicted in Table 10-5 evolves. Notice that the pattern is unidimensional. Also take a look at Table 10-5 to see how the scale can be revised to meet the criterion of unidimensionality. Table 10-5 Illustration of Revised Scale More Difficult Less Difficult Respondent Stranger Prayer Group Services Pray Score 1 yes yes yes yes 4 2 no yes yes yes 3 3 no no yes yes 2 4 no no no yes 1 5 no no no yes 1 6 no no no no 0 Total Number of Inconsistencies: 0.
15 Understanding Measurement 195 Summary Chapter Summary In this chapter we expanded our discussion in Section 9-2b of Chapter 9 about operational definitions and variables by concentrating on the subject of measurement. We said that you use measurement to answer questions about voting turnout, the governmental structure of other nations, and why people revolt. In addition, we discussed the theory of measurement while giving you a working definition of measurement. We also spent considerable time differentiating between measurement validity and measurement reliability. Our discussion introduced terms such as content validity, discriminate validity, and predictive validity. We also gave you some ways, such as triangulation and the use of indices and scales, to establish measurement validity and measurement reliability. Last, we discussed the different levels of measurement and their importance to the research process. Quiz Chapter Quiz 1. The General Social Survey asked respondents to assess their own health as excellent, good, fair, or poor. The level of measurement of this variable is a. nominal. b. ordinal. c. interval. d. ratio. 2. A student conducted a survey that asked respondents to identify their race as white, African American, or Other. The level of measurement of this variable is a. nominal. b. ordinal. c. interval. d. ratio. 3. AVGTEMP is a variable included in a data set of America s fifty largest cities. The variable represents the average annual temperature (in Fahrenheit) of each city. The level of measurement of this variable is a. nominal. b. ordinal. c. interval. d. ratio. 4. Suppose that a researcher conducting a survey based on a sample of government workers asks respondents their annual incomes using these values: $20,000 or less; $20,000 through $60,000; $60,000 or more. A problem with this set of values is that a. they are measured at the nominal level. b. they are not continuous. c. they are population data. d. they are not collectively exhaustive. e. they are not mutually exclusive. 5. scales incorporate an empirical test of unidimensionality and are cumulative. a. Likert b. Guttman c. Composite d. Simple 6. refers to the extent to which a measurement procedure consistently measures whatever it measures. a. Unidimensionality b. Reliability c. Validity d. Correlation 7. There are two types of measurement: interval measurement and ratio measurement. a. metric b. normative c. unidimensional d. central tendency 8. Measurement is concerned with the effectiveness of the measuring instrument and the extent that the instrument reflects the actual activity or behavior one wants to study. a. unidimensionality b. reliability c. validity d. correlation 9. is an attempt to enhance the reliability and validity of measurement by using multiple and overlapping measurement strategies. a. Likert scaling b. Guttman scaling c. Triangulation d. Unidimensional scaling 10. A(n) variable takes on only certain values within its range. a. discrete b. antecedent c. continuous d. intervening
16 196 Chapter 10 Readings Suggested Readings Babbie, Earl, Fred Halley, and Jeanne Zaino. Adventures in Social Research. Thousand Oaks, CA: Pine Forge Press, Bernstein, Robert A. and James A. Dyer. An Introduction to Political Science Methods, 3rd ed. Englewood Cliffs, NJ: Prentice-Hall, Fox, William. Social Statistics, 3rd ed. Bellevue, WA: Micro- Case, Frankfort-Nachmias, Chava and David Nachmias. Research Methods in the Social Sciences, 6th ed. New York: Worth Publishers, Goldenberg, Sheldon. Thinking Methodologically.New York: HarperCollins, Johnson, Janet Buttolph, Richard A. Joslyn, and H. T. Reynolds. Political Science Research Methods, 4th ed. Washington, D.C.: Congressional Quarterly Press, Katzner, Donald. Our Mad Rush to Measure: How Did We Get There? Methodus 3 (2), 1991: Kay, Susan Ann. Introduction to the Analysis of Political Data. Englewood Cliffs, NJ: Prentice-Hall, Leedy, Paul D. and Jeanne Ellis Ormrod. Practical Research: Planning and Design, 7th ed. Upper Saddle River, NJ: Merrill Prentice Hall, Shively, W. Phillips. The Craft of Political Research, 3rd ed. Englewood Cliffs, NJ: Prentice-Hall, 1990.
Descriptive Statistics and Measurement Scales
Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample
DATA COLLECTION AND ANALYSIS
DATA COLLECTION AND ANALYSIS Quality Education for Minorities (QEM) Network HBCU-UP Fundamentals of Education Research Workshop Gerunda B. Hughes, Ph.D. August 23, 2013 Objectives of the Discussion 2 Discuss
Basic Concepts in Research and Data Analysis
Basic Concepts in Research and Data Analysis Introduction: A Common Language for Researchers...2 Steps to Follow When Conducting Research...3 The Research Question... 3 The Hypothesis... 4 Defining the
SOST 201 September 18-20, 2006. Measurement of Variables 2
1 Social Studies 201 September 18-20, 2006 Measurement of variables See text, chapter 3, pp. 61-86. These notes and Chapter 3 of the text examine ways of measuring variables in order to describe members
Measurement and Measurement Scales
Measurement and Measurement Scales Measurement is the foundation of any scientific investigation Everything we do begins with the measurement of whatever it is we want to study Definition: measurement
II. DISTRIBUTIONS distribution normal distribution. standard scores
Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,
Association Between Variables
Contents 11 Association Between Variables 767 11.1 Introduction............................ 767 11.1.1 Measure of Association................. 768 11.1.2 Chapter Summary.................... 769 11.2 Chi
a. Will the measure employed repeatedly on the same individuals yield similar results? (stability)
INTRODUCTION Sociologist James A. Quinn states that the tasks of scientific method are related directly or indirectly to the study of similarities of various kinds of objects or events. One of the tasks
Chapter 5 Conceptualization, Operationalization, and Measurement
Chapter 5 Conceptualization, Operationalization, and Measurement Chapter Outline Measuring anything that exists Conceptions, concepts, and reality Conceptions as constructs Conceptualization Indicators
Constructing a TpB Questionnaire: Conceptual and Methodological Considerations
Constructing a TpB Questionnaire: Conceptual and Methodological Considerations September, 2002 (Revised January, 2006) Icek Ajzen Brief Description of the Theory of Planned Behavior According to the theory
WHAT IS A JOURNAL CLUB?
WHAT IS A JOURNAL CLUB? With its September 2002 issue, the American Journal of Critical Care debuts a new feature, the AJCC Journal Club. Each issue of the journal will now feature an AJCC Journal Club
CALCULATIONS & STATISTICS
CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents
Measurement. How are variables measured?
Measurement Y520 Strategies for Educational Inquiry Robert S Michael Measurement-1 How are variables measured? First, variables are defined by conceptual definitions (constructs) that explain the concept
Guided Reading 9 th Edition. informed consent, protection from harm, deception, confidentiality, and anonymity.
Guided Reading Educational Research: Competencies for Analysis and Applications 9th Edition EDFS 635: Educational Research Chapter 1: Introduction to Educational Research 1. List and briefly describe the
Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS
Chapter Seven Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS Section : An introduction to multiple regression WHAT IS MULTIPLE REGRESSION? Multiple
Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Battery Bias
Glossary of Terms Ability A defined domain of cognitive, perceptual, psychomotor, or physical functioning. Accommodation A change in the content, format, and/or administration of a selection procedure
Elementary Statistics
Elementary Statistics Chapter 1 Dr. Ghamsary Page 1 Elementary Statistics M. Ghamsary, Ph.D. Chap 01 1 Elementary Statistics Chapter 1 Dr. Ghamsary Page 2 Statistics: Statistics is the science of collecting,
Introduction; Descriptive & Univariate Statistics
Introduction; Descriptive & Univariate Statistics I. KEY COCEPTS A. Population. Definitions:. The entire set of members in a group. EXAMPLES: All U.S. citizens; all otre Dame Students. 2. All values of
DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
Lecture 2: Types of Variables
2typesofvariables.pdf Michael Hallstone, Ph.D. [email protected] Lecture 2: Types of Variables Recap what we talked about last time Recall how we study social world using populations and samples. Recall
Introduction... 3. Qualitative Data Collection Methods... 7 In depth interviews... 7 Observation methods... 8 Document review... 8 Focus groups...
1 Table of Contents Introduction... 3 Quantitative Data Collection Methods... 4 Interviews... 4 Telephone interviews... 5 Face to face interviews... 5 Computer Assisted Personal Interviewing (CAPI)...
Session 7 Bivariate Data and Analysis
Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares
DOING YOUR BEST ON YOUR JOB INTERVIEW
CHECKLIST FOR PREPARING FOR THE INTERVIEW Read this pamphlet carefully. Make a list of your good points and think of concrete examples that demonstrate them. Practice answering the questions on page 6.
Descriptive Inferential. The First Measured Century. Statistics. Statistics. We will focus on two types of statistical applications
Introduction: Statistics, Data and Statistical Thinking The First Measured Century FREC 408 Dr. Tom Ilvento 213 Townsend Hall [email protected] http://www.udel.edu/frec/ilvento http://www.pbs.org/fmc/index.htm
Last May, philosopher Thomas Nagel reviewed a book by Michael Sandel titled
Fourth Quarter, 2006 Vol. 29, No. 4 Editor s Watch Sandel and Nagel on Abortion Last May, philosopher Thomas Nagel reviewed a book by Michael Sandel titled Public Philosophy in The New York Review of Books.
Research Methods & Experimental Design
Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and
Now, observe again the 10 digits we use to represent numbers. 0 1 2 3 4 5 6 7 8 9 Notice that not only is each digit different from every other
VARIABLES- NOMINAL, ORDINAL and INTERVAL/SCALE LEVELS OF MEASUREMENT Variables: traits or characteristics that vary from one individual, group, or society to another individual, group, or society. Examples:
RESEARCH METHODS IN I/O PSYCHOLOGY
RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 01/17/11 [Arthur]
Beef Demand: What is Driving the Market?
Beef Demand: What is Driving the Market? Ronald W. Ward Food and Economics Department University of Florida Demand is a term we here everyday. We know it is important but at the same time hard to explain.
Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS)
Mode and Patient-mix Adjustment of the CAHPS Hospital Survey (HCAHPS) April 30, 2008 Abstract A randomized Mode Experiment of 27,229 discharges from 45 hospitals was used to develop adjustments for the
Assessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview
Assessment, Case Conceptualization, Diagnosis, and Treatment Planning Overview The abilities to gather and interpret information, apply counseling and developmental theories, understand diagnostic frameworks,
RESEARCH METHODS IN I/O PSYCHOLOGY
RESEARCH METHODS IN I/O PSYCHOLOGY Objectives Understand Empirical Research Cycle Knowledge of Research Methods Conceptual Understanding of Basic Statistics PSYC 353 11A rsch methods 09/01/11 [Arthur]
Chapter 4. Probability and Probability Distributions
Chapter 4. robability and robability Distributions Importance of Knowing robability To know whether a sample is not identical to the population from which it was selected, it is necessary to assess the
6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
Response to Critiques of Mortgage Discrimination and FHA Loan Performance
A Response to Comments Response to Critiques of Mortgage Discrimination and FHA Loan Performance James A. Berkovec Glenn B. Canner Stuart A. Gabriel Timothy H. Hannan Abstract This response discusses the
WHY STUDY PUBLIC FINANCE?
Solutions and Activities to CHAPTER 1 WHY STUDY PUBLIC FINANCE? Questions and Problems 1. Many states have language in their constitutions that requires the state to provide for an adequate level of education
Levels of measurement in psychological research:
Research Skills: Levels of Measurement. Graham Hole, February 2011 Page 1 Levels of measurement in psychological research: Psychology is a science. As such it generally involves objective measurement of
Prospect Theory Ayelet Gneezy & Nicholas Epley
Prospect Theory Ayelet Gneezy & Nicholas Epley Word Count: 2,486 Definition Prospect Theory is a psychological account that describes how people make decisions under conditions of uncertainty. These may
Measurement with Ratios
Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical
MATH 103/GRACEY PRACTICE QUIZ/CHAPTER 1. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
MATH 103/GRACEY PRACTICE QUIZ/CHAPTER 1 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Use common sense to determine whether the given event
Fairfield Public Schools
Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity
Introduction to Quantitative Methods
Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................
How to Identify Real Needs WHAT A COMMUNITY NEEDS ASSESSMENT CAN DO FOR YOU:
How to Identify Real Needs Accurately assessing the situation in your community is important when making decisions about what ministries you will provide. Many projects fail because the people who planned
Statistics. Measurement. Scales of Measurement 7/18/2012
Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does
Correlation key concepts:
CORRELATION Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson s coefficient of correlation c) Spearman s Rank correlation coefficient d)
Means, standard deviations and. and standard errors
CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard
Descriptive Statistics
Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize
Test Bias. As we have seen, psychological tests can be well-conceived and well-constructed, but
Test Bias As we have seen, psychological tests can be well-conceived and well-constructed, but none are perfect. The reliability of test scores can be compromised by random measurement error (unsystematic
Problem of the Month: Fair Games
Problem of the Month: The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards:
Technical Report. Overview. Revisions in this Edition. Four-Level Assessment Process
Technical Report Overview The Clinical Evaluation of Language Fundamentals Fourth Edition (CELF 4) is an individually administered test for determining if a student (ages 5 through 21 years) has a language
Introduction to Hypothesis Testing
I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true
IMPLEMENTATION NOTE. Validating Risk Rating Systems at IRB Institutions
IMPLEMENTATION NOTE Subject: Category: Capital No: A-1 Date: January 2006 I. Introduction The term rating system comprises all of the methods, processes, controls, data collection and IT systems that support
Levels of Measurement. 1. Purely by the numbers numerical criteria 2. Theoretical considerations conceptual criteria
Levels of Measurement 1. Purely by the numbers numerical criteria 2. Theoretical considerations conceptual criteria Numerical Criteria 1. Nominal = different categories based on some kind of typology 2.
Module 3: Correlation and Covariance
Using Statistical Data to Make Decisions Module 3: Correlation and Covariance Tom Ilvento Dr. Mugdim Pašiƒ University of Delaware Sarajevo Graduate School of Business O ften our interest in data analysis
PURPOSE OF GRAPHS YOU ARE ABOUT TO BUILD. To explore for a relationship between the categories of two discrete variables
3 Stacked Bar Graph PURPOSE OF GRAPHS YOU ARE ABOUT TO BUILD To explore for a relationship between the categories of two discrete variables 3.1 Introduction to the Stacked Bar Graph «As with the simple
Welcome back to EDFR 6700. I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons.
Welcome back to EDFR 6700. I m Jeff Oescher, and I ll be discussing quantitative research design with you for the next several lessons. I ll follow the text somewhat loosely, discussing some chapters out
CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
INTERNATIONAL FRAMEWORK FOR ASSURANCE ENGAGEMENTS CONTENTS
INTERNATIONAL FOR ASSURANCE ENGAGEMENTS (Effective for assurance reports issued on or after January 1, 2005) CONTENTS Paragraph Introduction... 1 6 Definition and Objective of an Assurance Engagement...
FLORIDA: TRUMP WIDENS LEAD OVER RUBIO
Please attribute this information to: Monmouth University Poll West Long Branch, NJ 07764 www.monmouth.edu/polling Follow on Twitter: @MonmouthPoll Released: Monday, March 14, Contact: PATRICK MURRAY 732-979-6769
Sample Size and Power in Clinical Trials
Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance
Evaluation: Designs and Approaches
Evaluation: Designs and Approaches Publication Year: 2004 The choice of a design for an outcome evaluation is often influenced by the need to compromise between cost and certainty. Generally, the more
1.7 Graphs of Functions
64 Relations and Functions 1.7 Graphs of Functions In Section 1.4 we defined a function as a special type of relation; one in which each x-coordinate was matched with only one y-coordinate. We spent most
An Introduction to Secondary Data Analysis
1 An Introduction to Secondary Data Analysis What Are Secondary Data? In the fields of epidemiology and public health, the distinction between primary and secondary data depends on the relationship between
Validity, Fairness, and Testing
Validity, Fairness, and Testing Michael Kane Educational Testing Service Conference on Conversations on Validity Around the World Teachers College, New York March 2012 Unpublished Work Copyright 2010 by
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus
Facebook Friend Suggestion Eytan Daniyalzade and Tim Lipus 1. Introduction Facebook is a social networking website with an open platform that enables developers to extract and utilize user information
DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION
DATA ANALYSIS AND INTERPRETATION OF EMPLOYEES PERSPECTIVES ON HIGH ATTRITION Analysis is the key element of any research as it is the reliable way to test the hypotheses framed by the investigator. This
THE FIELD POLL. By Mark DiCamillo, Director, The Field Poll
THE FIELD POLL THE INDEPENDENT AND NON-PARTISAN SURVEY OF PUBLIC OPINION ESTABLISHED IN 1947 AS THE CALIFORNIA POLL BY MERVIN FIELD Field Research Corporation 601 California Street, Suite 210 San Francisco,
Statistics, Research, & SPSS: The Basics
Statistics, Research, & SPSS: The Basics SPSS (Statistical Package for the Social Sciences) is a software program that makes the calculation and presentation of statistics relatively easy. It is an incredibly
MAY 2004. Legal Risks of Applicant Selection and Assessment
MAY 2004 Legal Risks of Applicant Selection and Assessment 2 Legal Risks of Applicant Selection and Assessment Effective personnel screening and selection processes are an important first step toward ensuring
THE ACT INTEREST INVENTORY AND THE WORLD-OF-WORK MAP
THE ACT INTEREST INVENTORY AND THE WORLD-OF-WORK MAP Contents The ACT Interest Inventory........................................ 3 The World-of-Work Map......................................... 8 Summary.....................................................
Mind on Statistics. Chapter 4
Mind on Statistics Chapter 4 Sections 4.1 Questions 1 to 4: The table below shows the counts by gender and highest degree attained for 498 respondents in the General Social Survey. Highest Degree Gender
Economic inequality and educational attainment across a generation
Economic inequality and educational attainment across a generation Mary Campbell, Robert Haveman, Gary Sandefur, and Barbara Wolfe Mary Campbell is an assistant professor of sociology at the University
Logic Models, Human Service Programs, and Performance Measurement
Three Logic Models, Human Service Programs, and Performance Measurement Introduction Although the literature on ment has been around for over two decades now, scholars and practitioners still continue
Reliability Analysis
Measures of Reliability Reliability Analysis Reliability: the fact that a scale should consistently reflect the construct it is measuring. One way to think of reliability is that other things being equal,
Stigmatisation of people with mental illness
Stigmatisation of people with mental illness Report of the research carried out in July 1998 and July 2003 by the Office for National Statistics (ONS) on behalf of the Royal College of Psychiatrists Changing
Practical Research. Paul D. Leedy Jeanne Ellis Ormrod. Planning and Design. Tenth Edition
Practical Research Planning and Design Tenth Edition Paul D. Leedy Jeanne Ellis Ormrod 2013, 2010, 2005, 2001, 1997 Pearson Education, Inc. All rights reserved. Chapter 1 The Nature and Tools of Research
Understanding Financial Management: A Practical Guide Guideline Answers to the Concept Check Questions
Understanding Financial Management: A Practical Guide Guideline Answers to the Concept Check Questions Chapter 8 Capital Budgeting Concept Check 8.1 1. What is the difference between independent and mutually
Quantitative Research: Reliability and Validity
Quantitative Research: Reliability and Validity Reliability Definition: Reliability is the consistency of your measurement, or the degree to which an instrument measures the same way each time it is used
EDUCATION POST 2015 Parent Attitudes Survey
EDUCATION POST 2015 Parent Attitudes Survey About the Survey The following analysis contains the results of the 2015 Parent Attitudes Survey, conducted on behalf of Education Post, via an online survey
GUIDE TO WRITING YOUR RESEARCH PAPER Ashley Leeds Rice University
GUIDE TO WRITING YOUR RESEARCH PAPER Ashley Leeds Rice University Here are some basic tips to help you in writing your research paper. The guide is divided into six sections covering distinct aspects of
SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question.
Ch. 1 Introduction to Statistics 1.1 An Overview of Statistics 1 Distinguish Between a Population and a Sample Identify the population and the sample. survey of 1353 American households found that 18%
There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month
Test-Retest Reliability and The Birkman Method Frank R. Larkey & Jennifer L. Knight, 2002
Test-Retest Reliability and The Birkman Method Frank R. Larkey & Jennifer L. Knight, 2002 Consultants, HR professionals, and decision makers often are asked an important question by the client concerning
Published entries to the three competitions on Tricky Stats in The Psychologist
Published entries to the three competitions on Tricky Stats in The Psychologist Author s manuscript Published entry (within announced maximum of 250 words) to competition on Tricky Stats (no. 1) on confounds,
Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration
Chapter 6: The Information Function 129 CHAPTER 7 Test Calibration 130 Chapter 7: Test Calibration CHAPTER 7 Test Calibration For didactic purposes, all of the preceding chapters have assumed that the
WRITING A RESEARCH PAPER FOR A GRADUATE SEMINAR IN POLITICAL SCIENCE Ashley Leeds Rice University
WRITING A RESEARCH PAPER FOR A GRADUATE SEMINAR IN POLITICAL SCIENCE Ashley Leeds Rice University Here are some basic tips to help you in writing your research paper. The guide is divided into six sections
Sampling and Sampling Distributions
Sampling and Sampling Distributions Random Sampling A sample is a group of objects or readings taken from a population for counting or measurement. We shall distinguish between two kinds of populations
Non-random/non-probability sampling designs in quantitative research
206 RESEARCH MET HODOLOGY Non-random/non-probability sampling designs in quantitative research N on-probability sampling designs do not follow the theory of probability in the choice of elements from the
Latino Decisions Poll of Non-Voters November 2014
MAIN QUESTIONNAIRE 1. Even though you don t plan to vote, thinking about the 2014 election, what are the most important issues facing the [Latino/Hispanic] community that our politicians should address?
Neutrality s Much Needed Place In Dewey s Two-Part Criterion For Democratic Education
Neutrality s Much Needed Place In Dewey s Two-Part Criterion For Democratic Education Taylor Wisneski, Kansas State University Abstract This paper examines methods provided by both John Dewey and Amy Gutmann.
Midterm Review Problems
Midterm Review Problems October 19, 2013 1. Consider the following research title: Cooperation among nursery school children under two types of instruction. In this study, what is the independent variable?
Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach
Paid and Unpaid Work inequalities 1 Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach Paid and Unpaid Labor in Developing Countries: an inequalities in time use approach
Chapter 1: The Nature of Probability and Statistics
Chapter 1: The Nature of Probability and Statistics Learning Objectives Upon successful completion of Chapter 1, you will have applicable knowledge of the following concepts: Statistics: An Overview and
Chapter 9 Assessing Studies Based on Multiple Regression
Chapter 9 Assessing Studies Based on Multiple Regression Solutions to Empirical Exercises 1. Age 0.439** (0.030) Age 2 Data from 2004 (1) (2) (3) (4) (5) (6) (7) (8) Dependent Variable AHE ln(ahe) ln(ahe)
Introduction to Hypothesis Testing OPRE 6301
Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about
APPENDIX. Interest Concepts of Future and Present Value. Concept of Interest TIME VALUE OF MONEY BASIC INTEREST CONCEPTS
CHAPTER 8 Current Monetary Balances 395 APPENDIX Interest Concepts of Future and Present Value TIME VALUE OF MONEY In general business terms, interest is defined as the cost of using money over time. Economists
