appendix B Grouped Frequency Distributions and Central Tendency Grouped Frequency Distributions OBJECTIVES FOR APPENDIX B

Similar documents
CALCULATIONS & STATISTICS

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Example: Find the expected value of the random variable X. X P(X)

Measures of Central Tendency and Variability: Summarizing your Data for Others

Calculation example mean, median, midrange, mode, variance, and standard deviation for raw and grouped data

Statistics Revision Sheet Question 6 of Paper 2

6.4 Normal Distribution

Lesson 4 Measures of Central Tendency

MEASURES OF VARIATION

Data exploration with Microsoft Excel: univariate analysis

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Means, standard deviations and. and standard errors

Fractions to decimals

A Picture Really Is Worth a Thousand Words

Activity 3.7 Statistical Analysis with Excel

Decimals Adding and Subtracting

Describing Data: Measures of Central Tendency and Dispersion

Paper 1. Calculator not allowed. Mathematics test. First name. Last name. School. Remember KEY STAGE 3 TIER 6 8

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

3.2 Measures of Spread

Useful Number Systems

Conversions between percents, decimals, and fractions

7. Normal Distributions

DESCRIPTIVE STATISTICS & DATA PRESENTATION*

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

WISE Sampling Distribution of the Mean Tutorial

Frequency Distributions

Lesson 7 Z-Scores and Probability

Chapter 2: Frequency Distributions and Graphs

Summarizing and Displaying Categorical Data

Northumberland Knowledge

Chapter 3 RANDOM VARIATE GENERATION

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

4. Descriptive Statistics: Measures of Variability and Central Tendency

Descriptive Statistics

Click on the links below to jump directly to the relevant section

HFCC Math Lab Arithmetic - 4. Addition, Subtraction, Multiplication and Division of Mixed Numbers

Welcome to Basic Math Skills!

Activity 4 Determining Mean and Median of a Frequency Distribution Table

Using Excel for descriptive statistics

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

Objective: Use calculator to comprehend transformations.

Recall the process used for adding decimal numbers. 1. Place the numbers to be added in vertical format, aligning the decimal points.

Major Work of the Grade

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

Analyzing and interpreting data Evaluation resources from Wilder Research

How To Understand And Solve A Linear Programming Problem

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS

CAMI Education linked to CAPS: Mathematics

Accuplacer Arithmetic Study Guide

Central Tendency - Computing and understanding averages. different in conception and calculation. They represent different notions of the center of a

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Information Technology Services will be updating the mark sense test scoring hardware and software on Monday, May 18, We will continue to score

3: Summary Statistics

11. Analysis of Case-control Studies Logistic Regression

Paper 1. Calculator not allowed. Mathematics test. First name. Last name. School. Remember KEY STAGE 3 TIER 5 7

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Chapter 6: The Information Function 129. CHAPTER 7 Test Calibration

Mathematics Pre-Test Sample Questions A. { 11, 7} B. { 7,0,7} C. { 7, 7} D. { 11, 11}

Data exploration with Microsoft Excel: analysing more than one variable

Spreadsheets Hop-around Cards

Chapter 2: Descriptive Statistics

2 Describing, Exploring, and

Section 1.3 Exercises (Solutions)

GeoGebra Statistics and Probability

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Three had equal scores for the Progressive and Humanistic. schools. The other four had the following combination of

An introduction to using Microsoft Excel for quantitative data analysis

Exercise 1.12 (Pg )

Chapter 4. Probability and Probability Distributions

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Module 3: Correlation and Covariance

Estimating the Average Value of a Function

Revision Notes Adult Numeracy Level 2

Standard Deviation Estimator

Mark. Use this information and the cumulative frequency graph to draw a box plot showing information about the students marks.

Appendix 2.1 Tabular and Graphical Methods Using Excel

Below is a very brief tutorial on the basic capabilities of Excel. Refer to the Excel help files for more information.

Measurement with Ratios

Common Tools for Displaying and Communicating Data for Process Improvement

TECHNIQUES OF DATA PRESENTATION, INTERPRETATION AND ANALYSIS

Math Review. for the Quantitative Reasoning Measure of the GRE revised General Test

Thursday, November 13: 6.1 Discrete Random Variables

Chapter 1: Exploring Data

Test Scoring And Course Evaluation Service

Application. Outline. 3-1 Polynomial Functions 3-2 Finding Rational Zeros of. Polynomial. 3-3 Approximating Real Zeros of.

Using SPSS, Chapter 2: Descriptive Statistics

If A is divided by B the result is 2/3. If B is divided by C the result is 4/7. What is the result if A is divided by C?

MATHS LEVEL DESCRIPTORS

Descriptive statistics parameters: Measures of centrality

Midterm Review Problems

Analyzing Quantitative Data Ellen Taylor-Powell

Normal distribution. ) 2 /2σ. 2π σ

SOLVING QUADRATIC EQUATIONS BY THE DIAGONAL SUM METHOD

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

Simple Random Sampling

Transcription:

appendix B Grouped Frequency Distributions and Central Tendency OBJECTIVES FOR APPENDIX B After studying the text and working the problems, you should be able to: 1. Use four conventions for constructing grouped frequency distributions. Arrange raw data into a grouped frequency distribution 3. Find the mean, median, and mode of a grouped frequency distribution (In writing this appendix, I assumed that you have studied Chapter, Frequency Distributions and Graphs, and the central tendency sections of Chapter 3.) As you know from Chapter, converting a batch of raw scores into a simple frequency distribution brings order out of apparent chaos. For some distributions even more order can be obtained if the raw scores are arranged into a grouped frequency distribution. The order becomes even more apparent when grouped frequency distributions are graphed. In addition to grouping and graphing, this appendix covers the calculation of the mean, median, and mode of grouped frequency distributions. Grouped frequency distributions are used when the range of scores is too large for a simple frequency distribution. How large is too large? A rule of thumb is that grouped frequency distributions are appropriate when the range of scores is greater than 0. At times, however, ignoring this rule of thumb produces an improved analysis. Grouped Frequency Distributions As you may recall from Chapter, the only difference between simple frequency distributions and grouped frequency distributions is that grouped frequency distributions 373

374 Appendix B have class intervals in the place of scores. Each class interval in a grouped frequency distribution covers the same number of scores. The number of scores in the interval is symbolized i (interval size). Establishing Class Intervals There are no hard and fast rules for establishing class intervals. The ones that follow are used by many researchers, but some computer programs do not follow them. 1. The number of class intervals. The number of class intervals should be 10 to 0. On the one hand, with fewer than 10 intervals, the extreme scores in the data are not as apparent because they are clustered with more frequently occurring scores. On the other hand, more than 0 class intervals often make it difficult to see the shape of the distribution.. The size of i. If i is odd, the midpoint of the class interval will be a whole number, and whole numbers look better on graphs than decimal numbers. Three and five often work well as interval sizes. You may find that i is needed if you are to have 10 to 0 class intervals. If an i of 5 produces more than 0 class intervals, data groupers usually jump to an i of 10 or some multiple of 10. An interval size of 5 is popular. 3. The lower limit of a class interval. Begin each class interval with a multiple of i. For example, if the lowest score is 5 and i 3 (as happened with the Satisfaction With Life Scale (SWLS) scores in Table.4), the first class interval should be 3 5. An exception to this convention occurs when i 5. When the interval size is 5, it is usually better to use a multiple of 5 as the midpoint because multiples of 5 are easier to read on graphs. 4. The order of the intervals. The largest scores go at the top of the table. (This is a convention not followed by some computer programs.) Converting Unorganized Scores into a Grouped Frequency Distribution With the conventions for establishing class intervals in mind, here are the steps for converting unorganized data into a grouped frequency distribution. As an example, I will use the raw data in Table.1 and describe converting it into Table.4. 1. Find the highest and lowest scores. In Table.1, the highest score is 35 and the lowest score is 5.. Find the range of the scores by subtracting the lowest score from the highest score (35 5 30). 3. Determine i by a trial-and-error procedure. Remember that there are to be 10 to 0 class intervals and that the interval size should be convenient (3, 5, 10, or a multiple of 10). Dividing the range by a potential i value gives the approximate number of class intervals. Dividing the range, 30, by 3 gives 10, which is a recommended number of class intervals. 4. Establish the lowest interval. Begin the interval with a multiple of i, which may or may not be an actual raw score. End the interval so that it contains

Grouped Frequency Distributions and Central Tendency 375 i scores (but not necessarily i frequencies). For Table.4, the lowest interval is 3 5. (Note that 3 is not an actual score but is a multiple of i.) Each interval above the lowest one begins with a multiple of i. Continue building the class intervals. 5. With the class intervals written, underline each score (Table.1) and put a tally mark beside its class interval (Table.4). 6. As a check on your work, add up the frequency column. The sum should be N, the number of scores in the unorganized data. PROBLEMS *B.1. A sociology professor was deciding what statistics to present in her introduction to sociology classes. She developed a test that covered concepts such as the median, graphs, standard deviation, and correlation. She tested one class of 50 students, and on the basis of the results, planned a course syllabus for that class and the other six intro sections. Arrange the data into an appropriate rough-draft frequency distribution. 0 56 48 13 30 39 5 41 5 44 7 36 54 46 59 4 17 63 50 4 31 19 38 10 43 31 34 3 15 47 40 36 5 31 53 4 31 41 49 1 6 35 8 37 5 33 7 38 34 *B.. The measurements that follow are weights in pounds of a sample of college men in one study. Arrange them into a grouped frequency distribution. If these data are skewed, tell the direction of the skew. 164 158 156 148 180 176 171 150 15 155 161 168 148 175 154 155 149 149 151 160 157 158 161 167 15 168 151 157 150 154 189 Central Tendency of Grouped Frequency Distributions Mean Finding the mean of a grouped frequency distribution involves the same arithmetic as that for a simple frequency distribution. Setting up the problem, however, requires one additional step. Look at Table B.1, which has four columns (compared to the three in Table 3.3). For a grouped frequency distribution, the midpoint of the interval represents all the scores in the interval. Thus, multiplying the midpoint by its f value includes all the scores in that interval. As you can see at the bottom of Table B.1, summing the fx column gives fx, which, when divided by N, yields the mean.

376 Appendix B TABLE B.1 A grouped frequency distribution of Satisfaction With Life Scale scores with i 3 SWLS scores Midpoint (class interval) (X) f fx 33 35 34 5 170 30 3 31 11 341 7 9 8 3 644 4 6 5 4 600 1 3 14 308 18 0 19 8 15 15 17 16 5 80 1 14 13 3 39 9 11 10 5 50 6 8 7 0 0 3 5 4 8 N 100 39 In terms of a formula, m or X fx N For Table B.1, m or X fx N 39 100 3.9 Note that the mean of the grouped data is 3.9 but the mean of the simple frequency distribution is 4.00. The mean of grouped scores is often different, but seldom is this difference of any consequence. Median Finding the median of a grouped distribution is almost the same as finding the median of a simple frequency distribution. Of course, you are looking for a point that has as many frequencies above it as below it. To locate the median, use the formula Median location N 1 For the data in Table B.1, Median location N 1 100 1 50.5 As before, look for a point with 50 frequencies above it and 50 frequencies below it. Adding frequencies from the bottom of the distribution, you find that there are 37 scores below the interval 4 6 and 4 scores in that interval. The 50.5th score is in the

Grouped Frequency Distributions and Central Tendency 377 interval 4 6. The midpoint of the interval is the median. For the grouped SWLS scores in Table B.1, the median is 5. Thus, to find the median of a grouped frequency distribution, locate the class interval that is the location of the middle score. The midpoint of that interval is the median. Mode The mode is the midpoint of the interval that has the highest frequency. In Table B.1 the highest frequency count is 4. The interval with 4 scores is 4 6. The midpoint of that interval, 5, is the mode. PROBLEMS B.3. Find the mean, median, and mode of the grouped frequency distribution you constructed from the statistics questionnaire data (problem B.1). B.4. Find the mean, median, and mode of the weight data in problem B..