Math Elementary Statistics: A Brief Version, 5/e Bluman Ch. 3. # 3, 4,, 30, 3, 3 Find (a) the mean, (b) the median, (c) the mode, and (d) the midrange. 3) High Temperatures The reported high temperatures (in degrees Fahrenheit) for selected world cities on an October day are shown below. Which measure of central tendency do you think best describes these data? (a) Mean 6 66 9 83 6 6 85 64 4 4 38 9 66 90 4 63 64 68 4 Mean = ( x) 566 = = 68. n 3 (b) Median 38, 4, 4,..., 68,..., 85, 90, 9 Median=68 (c) Mode= 4, 6, 64, 66,, 4 (d) Midrange Midrange= 38 + 9 = 64.5 For the best measure of average, answers will vary. 4) Observers in the Frogwatch Program The number of observers in the Frogwatch USA program (a wildlife conservation program dedicated to helping conserve frogs and toads) for the top 0 states with the most observers is 484, 483, 4, 396, 38, 35, 338, 33, 38, and 30. The top 0 states with the most active watchers list these numbers of visits: 634, 464, 406, 6, 9, 94, 9, 50, 30, and 4. Compare the measures of central tendency for these two groups of data.
) Earthquake Strengths twelve major earthquakes had Richter magnitudes shown here. 5.4 5.4 6. 6. 6.4 6.4 6.5.0... 8.0 a) Mean = ( x) 9.6 = = 6.63 n b) Median = 6.4 + 6.5 = 6.45 c) Mode = none d) Midrange = 5.4 + 8.0 = 6. Since almost all data is around the mean value (6.63), the mean is a better choice for the average of measurement. 30) Final Grade An instructor grades exams, 0%; term paper, 30%; final exam, 50%. A student had grades of 83,, and 90, respectively, for exams, term paper, and final exam. Find the student s final average. Use the weighted mean. 3) Final Grade Another instructor gives four -hour exams and one final exam, which counts as two -hour exams. Find a student s grade if she received 6, 83, 9, and 90 on the -hour exams and 8 on the final exam. 3) For these situations, state which measure of central tendency mean, median, or mode should be used. a. The most typical case is desired. Mode b. The distribution is open-ended. Median c. There is an extreme value in the data set. Median d. The data are categorical. Mode e. Further statistical computations will be needed. Mean f. The values are to be divided into two approximately equal groups, one group containing the larger values and one containing the smaller values. Median Section 3- #, 9,, 3,30, 3, 33, 34, 35,3, 4, 4 For exercises -3, find the range, variance, and standard deviation. Assume the data represent samples, and use the shortcut formula for the unbiased estimator to compute the variance and standard deviation.
( X ) mean = X = ; n variance = s standard deviation = s = ( X X) X [( X) / n] = or s = n n variance ) Police Calls in Schools the number of incidents where police were needed for a sample of 0 schools in Allegheny County is, 3, 3, 8, 48,, 6, 0, 0, 3. Are the data consistent or do they vary? Explain your answer. x x 0 0 3 9 3 9 6 36 49 8 64 0 00 3 369 48 304 Sum 33 406 Mean = ( X ) n 33 = = 3.3 0 Range = 48 0 = 48 n=0 X [( X) / n] Variance = s = n 406 (33) /0 = 9 406 68.9 9. = = = 54. 9 9 Standard deviation = 54. 6 The data vary widely. 9) Precipitation and High Temperatures the normal daily high temperatures (in degrees Fahrenheit) in January for 0 selected cities are as follows. 50, 3, 9, 54, 30, 6, 4, 38, 34, 6 The normal monthly precipitation (in inches) for theses same 0 cities is listed here. 4.8,.6,.5,.8,.8, 3.3, 5.,.,.8,.5 x x 9 84 30 900 34 56 3 369 38 444 4 09 50 500 54 96
Which set is more variable? 6 3 6 3 Sum 44 0, For the Temperature: Mean = ( X ) n 44 = = 44. Range = 6 9 = 3 n=0 0 X [( X) / n] 0, [(44) /0] Variance = s = = n 9 0, 9,448. 38.9 = = = 4.6 9 9 Standard deviation = 4.6 =.5.5 CVar= = =.6% 44. For the Precipitation: x x 4.8 3.04.6 6.6.5.5.8 3.4.8 3.4 3.3 0.89 5. 6.0...8 3.4.5 6.5 Sum 6.3 86.3 86.3 69. 6.96 = = =.88 9 9 Standard deviation =.88 =.3.33 CVar= = = 5.%.63 Mean = ( X ) n 6.3 = =.63 0 Precipitation is more variable since CVar is higher. Range = 5.. = 4.0 n=0 Variance = s X [( X) / n] = n 86.3 [(6.3) /0] = 9 ) Stories in the Tallest Buildings The number of stories in the 3 tallest buildings for two different cities is listed below. Which set of data is more variable?
Houston: 5,, 64, 56, 53, 55, 4, 55, 5, 50, 50, 50, 4 Pittsburgh: 64, 54, 40, 3, 46, 44, 4, 4, 40, 40, 34, 3, 30 3) Hardcover Bestsellers the number of weeks on The New York Times Best Sellers list for hardcover fiction is 4 3 8 5 5 0 4 3 6 Use the range rule of thumb to estimate the standard deviation. Compare the estimate to the actual standard deviation. x x range s = = 5.5 4 4, 4 6 4 4 3 9 8 34 (89) 5 5 X [( X) / n] 06 [ ] s = = 5 38. 5 5 n 5 0 00 s = 38. 6. 4 6 3 9 6 36 4 4 484 Sum 89 06
By the rule of thumb the estimation of the standard deviation is s=5.5, and by actually calculating it the standard deviation is s = 6.. Thus, we can say that the estimate is close to the actual standard deviation. 30) Exam Scores the average score on an English final examination was 85, with a standard deviation of 5; the average score on a history final exam was 0, with a standard deviation of 8. Which class was more variable? The coefficients of variation are: English Final: CVar = s 5 00% 5.9% X = 85 = s 8 History Final: CVar = 00%.3% X = 0 = The history class was more variable. 3) Ages of accountants the average age of the accountants at Three Rivers Corp. is 6 years, with a standard deviation of 6 years; the average salary of the accountants is $3,000, with a standard deviation of $4000. Compare the variations of age and income. Age: CVar = s 6 00% 3.% X = 6 = s 4000 Salary: CVar = 00%.9% X = 3000 = Age is more variable. 33) The mean of a distribution is 0 and the standard deviation is. Use Chebyshev s theorem. a) At least what percentage of the values will fall between 0 and 30? k = 5 = = = 0.04 k 5 5 = 0.04 = 0.96 = 96% k b) At least what percentage of the values will fall between and 8? k = 4 = = = 0.065 k 4 6 = 0.065 = 0.935 = 93.5% k 34) In a distribution of 60 values with a mean of, at least 0 fall within the interval 6. Approximately what percentage of values should fall in the interval 6 8? Use Chebyshev s theorem.
0 = 0.5 = 5%; 60 = = = k = k k k k 0.5 = k = k = + s = s =.5 0.5 0.5 0.5 4 4 +.5k = 8 k = 4 = 0.935 or 93.5% 4 At least 93.5% of the data values will fall in the interval 6 8. 35) Calories The average number of calories in a regular size bagel is 40. If the standard deviation is 38 calories, find the range in which at least 5% of the data will lie. Use Chebyshev s theorem. k k k 0.5 = 0.5 0.5 = 0.5 = k = k = 4 k = 4 k = At least 5% of the data values will fall within two standard deviations of the mean; hence, (40-(38), 40+(38))= (64, 36) At least 5% of the data values will fall between 64 and 36 calories. 3) Solid Waste Production The average college student produces 640 pounds of solid waste each year including 500 disposable cups and 30 pounds of paper. If the standard deviation is approximately 85 pounds, within what weight limits will at least 88.89% of all students garbage lie? k k k 0. = 0.8889 0.8889 = 0. = k = k = 9.009 k = 9.009 k 3 (640-3(85), 640+3(85), = (385, 895) Therefore at least 88.89% of the data values will fall between 385 and 895 pounds. 4) Citrus Fruit Consumption the average U.S. yearly per capita consumption of citrus fruit is 6.8 pounds. Suppose that the distribution of fruit amounts consumed is bell-shaped with a standard deviation equal to 4. pounds. What percentage of Americans would you expect to consume more than 3 pounds of citrus fruit per year? By the Empirical Rule, 68% of consumption is within standard deviation of the mean. Then of 3%, or 6% of consumption would be more than 3 pounds of citrus fruit per year. 4) Work Hours for College Faculty the average full-time faculty member in a post-secondary degree-granting institution works an average of 53 hours per week. a) If we assume the standard deviation is.8 hours, what percentage of faculty members work more than 58.6 hours a week?
k = = = = 0.5 k 4 0.5 = 0.5 =.5%.5% of faculty members work more than 58.6 hours a week. b) If we assume a bell-shaped distribution, what percentage of faculty members work more than 58.6 hours a week? By the Empirical Rule, 95% of members working is within standard deviations of the mean. Then of 5% or.5% of faculty member work more than 58.6 hours a week. Ch 3-3 #, 3, 4,, 3, 4, 5, ) Exam Scores A final examination for a psychology course has a mean of 84 and a standard deviation of 4. Find the corresponding Z score for each raw score. a) b) c) d) e) 8 84 z = = = 0.5 σ 4 9 84 z = = =.5 σ 4 93 84 z = = =.5 σ 4 6 84 z = = = σ 4 8 84 z = = = 0.5 σ 4 3) Which has a better relative position: a score of 5 on a statistics test with a mean of 60 and a standard deviation of 0 or a score of 36 on an accounting test with a mean of 30 and a variance of 6? a. A score of 5 on an exam with X = 60 and s = 0. x x 5 60 = =.5 s 0 b. A score of 36 on an exam with X = 30 and s = 4 x x 36 30 = =.5 s 4 Neither. The scores have the same relative position.
4) Test Scores A student scores 60 on a mathematics test that has a mean of 54 and a standard deviation of 3, and she scores 80 on a history test with a mean of 5 and a standard deviation of. On which test did she perform better? 60 54 z = = =.0 σ 3 80 5 z = = =.5 σ The student performed better in the history test. ) Average Weekly Earnings The average weekly earnings in dollars for various industries are listed below. Find the percentile rank of each value. 804 36 659 489 63 59 54 8 Reorder: 8, 489, 54, 59, 63, 659, 36,, 804 Percentile = ( number of values below x ) + 0.5 00% total number of values 3) In exercise, what value corresponds to the 40 th percentile? What is 40% of n? = 0.40 x 9 = 3.6, round up to 4. Therefore, the number located in the 4 th place which is 59. 4) Test scores find the percentile rank for each test score in the data set. 8 35 4 4 49 50 Percentile = ( number of values below x ) + 0.5 00% total number of values : percentile = 0 + 0.5 00 % th
8: percentile = + 0.5 00 st 35: percentile = + 0.5 00 % 36 th 4: percentile = 3 + 0.5 00 % 50 th 4: percentile = 4 + 0.5 00 % 64 th 49: percentile = 5 + 0.5 00 % 9 th 50: percentile = 6 + 0.5 00 % 93 rd 5) In exercise 4, what value corresponds to the 60 th percentile? What is 60% of n? 0.6 x = 4., round it up to 5. Therefore the number in the 5 th location which is 4 corresponds to the 60 th percentile. ) What value in Exercise 6 corresponds to the 40 th percentile? Values in Exercise 6:...9...5 3.3 6. 6.8 0.3 What is 40% of n? 0.40 x 0 = 4. Thus, the value halfway between the 4 th and 5 th place is.5 which corresponds to the 40% percentile. 3.4 #, 3,, 9,, 3, 5 Identify the five-number summary and find the interquartile range. ) 8,, 3, 6,, 9, 54 Reorder: 6, 8,, 9,, 3, 54 Low = 6 Q = 8 Median = Q = 9 Q 3 = 3 High = 54 IQR = Q3 Q = 3 8 = 4 3) 88 9 36 36 43 589 Low = 88 Q = 9 Median = 339 Q 3 = 43 High = 589
IQR = Q3 Q = 43 9 = 45 Use each box plot to identify the maximum value, minimum value, median, first quartile, third quartile, and interquartile range. ) Low = 3, High =, median = 8, Q = 5, Q 3 = 9, and IQR = Q3 Q = 4 9) Low = 00, High = 35, median = 5, Q = 5, Q 3 = 300, and IQR = Q3 Q = 5 ) Earned Run Average Number of GamesPitched Construct a boxplot for the following data and comment on the shape of the distribution representing the number of games pitched by major league baseball s earned run average (ERA) leaders for the past few years. 30 34 9 30 34 9 3 33 34 30 34 3 Reorder:,, 9, 9, 30, 30, 30, 3, 3, 33, 34, 34, 34, 34 Low =, Q = 9, median = 30 + 3 = 30.5, Q 3 = 34 High = 34 3) State Sites for Frogwatch Construct a boxplot for these numbers of state sites for Frogwatch USA. Is the distribution symmetric? 4 395 34 94 89 53 4 38 35 99 Reorder: 99 35 38 4 53 89 94 34 395 4
5) Tornadoes in 005 Construct a boxplot and comment on its skewness for the number of tornadoes recorded each month in 005. 33 0 6 3 3 36 38 3 33 8 Reorder: 0 8 33 6 3 3 3 33 38 36