A.2 Measures of Central Tendency and Dispersion



Similar documents
How To Rate Plan On A Credit Card With A Credit Union

U.S. Department of Housing and Urban Development: Weekly Progress Report on Recovery Act Spending

New York Public School Spending In Perspec7ve

State Corporate Income Tax-Calculation

Regional Electricity Forecasting

Federation of State Boards of Physical Therapy Jurisdiction Licensure Reference Guide Topic: Continuing Competence

NHIS State Health insurance data

Federation of State Boards of Physical Therapy Jurisdiction Licensure Reference Guide Topic: Continuing Competence

TITLE POLICY ENDORSEMENTS BY STATE

Federation of State Boards of Physical Therapy Jurisdiction Licensure Reference Guide Topic: PTA Supervision Requirements

Standardized Pharmacy Technician Education and Training

NAAUSA Security Survey

U.S. Department of Education NCES NAEP. Tools on the Web

The Lincoln National Life Insurance Company Variable Life Portfolio

Department of Business and Information Technology

Health Workforce Data Collection: Findings from a Survey of States

States Future Economic Standing

State Annual Report Due Dates for Business Entities page 1 of 10

ANTHONY P. CARNEVALE NICOLE SMITH JEFF STROHL

Florida Workers Comp Market

ehealth Price Index Trends and Costs in the Short-Term Health Insurance Market, 2013 and 2014

Dashboard. Campaign for Action. Welcome to the Future of Nursing:

Federation of State Boards of Physical Therapy Jurisdiction Licensure Reference Guide Topic: License Renewal Who approves courses?

LIMITED LIABILITY COMPANY ORGANIZATION CHART

Mapping State Proficiency Standards Onto the NAEP Scales:

CINCINNATI HILLS CHRISTIAN ACADEMY COLLEGE QUESTIONNAIRE FOR STUDENTS

Table 12: Availability Of Workers Compensation Insurance Through Homeowner s Insurance By Jurisdiction

Rates and Bills An Analysis of Average Electricity Rates & Bills in Georgia and the United States

2016 Individual Exchange Premiums updated November 4, 2015

Enrollment Snapshot of Radiography, Radiation Therapy and Nuclear Medicine Technology Programs 2013

Enrollment Snapshot of Radiography, Radiation Therapy and Nuclear Medicine Technology Programs 2014

Surety Bond Requirements for Mortgage Brokers and Mortgage Bankers As of July 15, 2011

AmGUARD Insurance Company EastGUARD Insurance Company NorGUARD Insurance Company WestGUARD Insurance Company GUARD

LexisNexis Law Firm Billable Hours Survey Report

Ambulance Industry Receives Financial Relief Through the MMA

How To Know The Nursing Workforce

Admitting Foreign Trained Lawyers. National Conference of Bar Examiners Chicago, May 2, 2015

In Utilization and Trend In Quality

State Survey Results MULTI-LEVEL LICENSURE TITLE PROTECTION

Understanding Payroll Recordkeeping Requirements

American Equity Investment Life Insurance Company Bonus Gold (Index 1-07) PFG Marketing Group, Inc.

Life Settlements Source List

Trends in Medigap Coverage and Enrollment, 2011

Return-to-Work Outcomes Among Social Security Disability Insurance (DI) Beneficiaries

Fixed Indexed Annuity Rates

National Student Clearinghouse. CACG Meeting

Annual Survey of Public Pensions: State- and Locally- Administered Defined Benefit Data Summary Brief: 2015

Nurse Practitioners and Physician Assistants in the United States: Current Patterns of Distribution and Recent Trends. Preliminary Tables and Figures

Table 11: Residual Workers Compensation Insurance Market By Jurisdiction

STATE INCOME TAX WITHHOLDING INFORMATION DOCUMENT

Building a Market for Small Wind: The Break-Even Turnkey Cost of Residential Wind Systems in the United States

States Served. CDFI Fund 601 Thirteenth Street, NW, Suite 200, South, Washington, DC (202)

Foreign Language Enrollments in K 12 Public Schools: Are Students Prepared for a Global Society?

The following rates are the maximum rates that should be illustrated. Be sure to update the IRIS illustration system

Moving TIM from Good to Great?

Final Expense Life Insurance

The Economic Impact of Commercial Airports in 2010

Preapproval Inspections for Manufacturing. Christy Foreman Deputy Director Division of Enforcement B Office of Compliance/CDRH

Athene Annuity (DE) Rates

PRODUCTS CURRENTLY AVAILABLE FOR SALE. Marquis SP

State of the Residential Property Management Market Survey Report, Fall 2012

Enrollment Snapshot of Radiography, Radiation Therapy and Nuclear Medicine Technology Programs 2015

Enrollment Snapshot of Radiography, Radiation Therapy and Nuclear Medicine Technology Programs 2012

When To Refinance. Your Mortgage

EFFECTS OF LEGALIZING MARIJUANA 1

Hourly Wages. For additional information, please contact:

State Technology Report 2008

Pharmacist Administered Vaccines Types of Vaccines Authorized to Administer

3. How Do States Use These Assessments?

A descriptive analysis of state-supported formative assessment initiatives in New York and Vermont

AN INSIDE LOOK AT SOCIAL RECRUITING IN THE USA

Broadband Technology Opportunities Program: Sustainable Broadband Adoption and Public Computer Centers

PEOPLE, PRICE, PRODUCT, PROMOTION and PRIDE

List of HUD Accepted Insured Ten-Year Protection Plans (As of September 22, 2008) Posted as a courtesy by MSI on 11/05/08

Who provides this training? Are there any requirements? The parents/guardians and the doctor go through the medication curriculum with the student.

Payroll Tax Chart Results

Download at

GE Inventory Finance. Unlock your cash potential.

An Introduction to... Equity Settlement

ANALYSIS OF US AND STATE-BY-STATE CARBON DIOXIDE EMISSIONS AND POTENTIAL SAVINGS IN FUTURE GLOBAL TEMPERATURE AND GLOBAL SEA LEVEL RISE

STATES VEHICLE ASSET POLICIES IN THE FOOD STAMP PROGRAM

VCF Program Statistics (Represents activity through the end of the day on June 30, 2015)

FIELD SERVICE BULLETIN

Dental Therapist Initiatives, Access, and Changing State Practice Acts The ADHA Perspective: An Update

Notices of Cancellation / Nonrenewal and / or Other Related Forms

Analysis of Site-level Administrator and Superintendent Certification Requirements in the USA (2010)

STC Insured Deposit Program (STID) Updated 06/16/2016

The Praxis Series Passing Scores by Test and State

Standardization of Technician Education Want it? Need it? Janet Teeters, M.S., R.Ph. Director of Accreditation Services ASHP

Home Schooling Achievement

CDFI FUND NEW MARKETS TAX CREDIT PROGRAM:

The Youth Vote in 2012 CIRCLE Staff May 10, 2013

Suitability Agent Continuing Education Requirements by State

The Leukemia & Lymphoma Society

Transcription:

Section A. Measures of Central Tendency and Dispersion A A. Measures of Central Tendency and Dispersion What you should learn How to find and interpret the mean, median, and mode of a set of data How to determine the measure of central tendency that best represents a set of data How to find the standard deviation of a set of data How to create and use box-and-whisker plots Why you should learn it Measures of central tendency and dispersion provide a convenient way to describe and compare sets of data. For instance, in Exercise 6 on page A, the mean and standard deviation are used to analyze the price of gold for the years 98 through 000. Mean, Median, and Mode In many real-life situations, it is helpful to describe data by a single number that is most representative of the entire collection of numbers. Such a number is called a measure of central tendency. The most commonly used measures are as follows.. The mean, or average, of n numbers is the sum of the numbers divided by n.. The median of n numbers is the middle number when the numbers are written in order. If n is even, the median is the average of the two middle numbers.. The mode of n numbers is the number that occurs most frequently. If two numbers tie for most frequent occurrence, the collection has two modes and is called bimodal. Example Comparing Measures of Central Tendency On an interview for a job, the interviewer tells you that the average annual income of the company s employees is $60,89. The actual annual incomes of the employees are shown below. What are the mean, median, and mode of the incomes? Was the person telling you the truth? $7,0, $78,0, $,678, $8,980, $7,08, $,676, $8,906, $,00, $,0, $,0, $,00, $,8, $7,0, $0,, $8,96, $,98, $6,0, $0,9, $6,8, $6,0, $,6, $98,, $8,980, $9,0, $,67 The mean of the incomes is Mean 7,0 78,0,678 8,980...,67,, $60,89. To find the median, order the incomes as follows. $,00, $,00, $6,0, $7,0, $7,08, $8,980, $0,, $,0, $,676, $8,906, $8,96, $,6, $,0, $,8, $,98, $,67, $6,0, $6,8, $7,0, $,678, $8,980, $9,0, $98,, $0,9, $78,0 From this list, you can see that the median (the middle number) is $,0. From the same list, you can see that $,00 is the only income that occurs more than once. So, the mode is $,00. Technically, the person was telling the truth because the average is (generally) defined to be the mean. However, of the three measures of central tendency Mean: $60,89 Median: $,0 Mode: $,00 it seems clear that the median is most representative. The mean is inflated by the two highest salaries.

A6 Appendix A Concepts in Statistics Choosing a Measure of Central Tendency Which of the three measures of central tendency is the most representative? The answer is that it depends on the distribution of the data and the way in which you plan to use the data. For instance, in Example, the mean salary of $60,89 does not seem very representative to a potential employee. To a city income tax collector who wants to estimate % of the total income of the employees, however, the mean is precisely the right measure. Example Choosing a Measure of Central Tendency Which measure of central tendency is the most representative of the data shown in each frequency distribution? a. Number Tally b. Number Tally c. Number Tally 7 9 6 0 8 7 6 8 6 6 6 6 7 7 7 7 8 0 8 8 8 9 9 9 9 0 a. For this data, the mean is., the median is, and the mode is. Of these, the mode is probably the most representative. b. For this data, the mean and median are each and the modes are and 9 (the distribution is bimodal). Of these, the mean or median is the most representative. c. For this data, the mean is.9, the median is, and the mode is. Of these, the mean or median is the most representative. Variance and Standard Deviation Very different sets of numbers can have the same mean. You will now study two measures of dispersion, which give you an idea of how much the numbers in a set differ from the mean of the set. These two measures are called the variance of the set and the standard deviation of the set. Definitions of Variance and Standard Deviation Consider a set of numbers x, x,..., x n with a mean of x. The variance of the set is v x x x x... x n x n v and the standard deviation of the set is ( is the lowercase Greek letter sigma).

The standard deviation of a set is a measure of how much a typical number in the set differs from the mean. The greater the standard deviation, the more the numbers in the set vary from the mean. For instance, each of the following sets has a mean of.,,,,,, 6, 6, and,, 7, 7 The standard deviations of the sets are 0,, and. 0 Section A. Measures of Central Tendency and Dispersion A7 6 6 7 7 Example Estimations of Standard Deviation Consider the three sets of data represented by the bar graphs in Figure A.. Which set has the smallest standard deviation? Which has the largest? Set A Set B Set C 6 7 6 7 6 7 FIGURE A. Of the three sets, the numbers in set A are grouped most closely to the center and the numbers in set C are the most dispersed. So, set A has the smallest standard deviation and set C has the largest standard deviation.

A8 Appendix A Concepts in Statistics Example Finding Standard Deviation Find the standard deviation of each set shown in Example. Because of the symmetry of each bar graph, you can conclude that each has a mean of x. The standard deviation of set A is ( 0.. The standard deviation of set B is 0. The standard deviation of set C is 0.. These values confirm the results of Example. That is, set A has the smallest standard deviation and set C has the largest. 7 6 The following alternative formula provides a more efficient way to compute the standard deviation. Alternative Formula for Standard Deviation The standard deviation of x, x,..., x n is x x... x n n x. Because of messy computations, this formula is difficult to verify. Conceptually, however, the process is straightforward. It consists of showing that the expressions x x x x... x n x n and x x... x n n x are equivalent. Try verifying this equivalence for the set x x x x. x, x, x with

Section A. Measures of Central Tendency and Dispersion A9 Number of states AK 7 AL 09 AR 8 AZ 6 CA 9 CO 67 CT DC DE 6 FL 0 GA HI IA ID IL 98 IN KS KY 0 LA MA 79 MD 9 ME 7 MI MN MO 8 MS 96 7 6 0 9 8 7 6 FIGURE A. 0-9 0-99 00-9 MT NC ND NE 8 NH 8 NJ 8 NM 6 NV NY 8 OH 67 OK 09 OR 9 PA 0 RI SC 6 SD 8 TN TX 08 UT VA 89 VT WA 86 WI WV 8 WY 0-99 00-9 0-99 00-9 0-99 00-99 Number of hospitals (in thousands) Example Using the Alternative Formula Use the alternative formula for standard deviation to find the standard deviation of the following set of numbers., 6, 6, 7, 7, 8, 8, 8, 9, 0 Begin by finding the mean of the set, which is 7.. So, the standard deviation is 6 7 8 9 0 You can use the statistical features of a graphing utility to check this result. A well-known theorem in statistics, called Chebychev s Theorem, states that at least k of the numbers in a distribution must lie within k standard deviations of the mean. So, 7% of the numbers in a set must lie within two standard deviations of the mean, and at least 88.9% of the numbers must lie within three standard deviations of the mean. For most distributions, these percentages are low. For instance, in all three distributions shown in Example, 00% of the numbers lie within two standard deviations of the mean. Example 6 Describing a Distribution The table at the left above shows the number of hospitals (in thousands) in each state and the District of Columbia in 999. Find the mean and standard deviation of the numbers. What percent of the numbers lie within two standard deviations of the mean? (Source: Health Forum) 68 0.0...76 0 Begin by entering the numbers into a graphing utility that has a standard deviation program. After running the program, you should obtain x 97.8 and The interval that contains all numbers that lie within two standard deviations of the mean is 97.8 8.99, 97.8 8.99 or 66.80, 6.6. From the histogram in Figure A., you can see that all but two of the numbers (96%) lie in this interval all but the numbers that correspond to the number of hospitals (in thousands) in California and Texas. 8.99. 7.

A0 Appendix A Concepts in Statistics Box-and-Whisker Plots Standard deviation is the measure of dispersion that is associated with the mean. Quartiles measure dispersion associated with the median. Definition of Quartiles Consider an ordered set of numbers whose median is m. The lower quartile is the median of the numbers that occur before m. The upper quartile is the median of the numbers that occur after m. Example 7 Finding Quartiles of a Set Find the lower and upper quartiles for the set.,,, 6,, 8, 0,, 6, 6,, 7 Begin by ordering the set.,,, 6, 6, 8, 0,,, 6, 7, st % nd % rd % th % The median of the entire set is 9. The median of the six numbers that are less than 9 is. So, the lower quartile is. The median of the six numbers that are greater than 9 is. So, the upper quartile is. Quartiles are represented graphically by a box-and-whisker plot, as shown in Figure A.6. In the plot, notice that five numbers are listed: the smallest number, the lower quartile, the median, the upper quartile, and the largest number. Also notice that the numbers are spaced proportionally, as though they were on a real number line. 9 FIGURE A.6 The next example shows how to find quartiles when the number of elements in a set is not divisible by.

Section A. Measures of Central Tendency and Dispersion A Example 8 Sketching Box-and-Whisker Plots Sketch a box-and-whisker plot for each set. a. 7, 8, 0,,, 0, 0, 6, 6, 6, 66 b. 8, 8, 8, 8, 87, 89, 90, 9, 9, 9, 96, 98, 99 c.,,,, 7, 8, 0,,, 7 a. This set has numbers. The median is 0 (the sixth number). The lower quartile is 0 (the median of the first five numbers). The upper quartile is 6 (the median of the last five numbers). See Figure A.7. 7 0 0 6 66 FIGURE A.7 b. This set has numbers. The median is 90 (the seventh number). The lower quartile is 8 (the median of the first six numbers). The upper quartile is 9. (the median of the last six numbers). See Figure A.8. 8 8 90 9. 99 FIGURE A.8 c. This set has 0 numbers. The median is 7. (the average of the fifth and sixth numbers). The lower quartile is (the median of the first five numbers). The upper quartile is (the median of the last five numbers). See Figure A.9. 7. 7 FIGURE A.9 A. Exercises In Exercises 6, find the mean, median, and mode of the set of measurements..,, 7,, 8, 9, 7. 0, 7,, 9,,,.,, 7,, 8, 9, 7. 0, 7,, 9,,,.,, 7,, 9, 7 6. 0, 7,, 9,, 7. Reasoning Compare your answers for Exercises and with those for Exercises and. Which of the measures of central tendency is sensitive to extreme measurements? Explain your reasoning. 8. Reasoning (a) Add 6 to each measurement in Exercise and calculate the mean, median, and mode of the revised measurements. How are the measures of central tendency changed? (b) If a constant k is added to each measurement in a set of data, how will the measures of central tendency change?

A Appendix A Concepts in Statistics 9. Electric Bills A person had the following monthly bills for electricity. What are the mean and median of the collection of bills? January $67.9 February $9.8 March $.00 April $.0 May $7.99 June $6. July $8.76 August $7.98 September $87.8 October $8.8 November $6. December $7.00 0. Car Rental A car rental company kept the following record of the numbers of miles a rental car was driven. What are the mean, median, and mode of this data? Monday 0 Tuesday 60 Wednesday 0 Thursday 0 Friday 60 Saturday 0. Six-Child Families A study was done on families having six children. The table shows the numbers of families in the study with the indicated numbers of girls. Determine the mean, median, and mode of this set of data. Number of girls 0 6 Frequency 0 9 7. Sports A baseball fan examined the records of a favorite baseball player s performance during his last 0 games. The numbers of games in which the player had 0,,,, and hits are recorded in the table. Number of hits 0 Frequency 6 7 (a) Determine the average number of hits per game. (b) Determine the player s batting average if he had 00 at-bats during the 0-game series.. Think About It Construct a collection of numbers that has the following properties. If this is not possible, explain why it is not. Mean 6, median, mode. Think About It Construct a collection of numbers that has the following properties. If this is not possible, explain why it is not. Mean 6, median 6, mode. Test Scores A professor records the following scores for a 00-point exam. 99, 6, 80, 77, 9, 7, 87, 79, 9, 88, 90,, 0, 89,, 00, 98, 8, 78, 9 Which measure of central tendency best describes these test scores? 6. Shoe Sales A salesman sold eight pairs of men s black dress shoes. The sizes of the eight pairs were as follows: 0 8,, 0,, and 0, 0, 9,. Which measure (or measures) of central tendency best describes the typical shoe size for this data? In Exercises 7, find the mean x, variance v, and standard deviation of the set. 7., 0, 8, 8.,, 6, 9, 9. 0,,,,,,,, 0.,,,,,.,,,,, 6, 7.,,,,,. 9, 6, 0, 9,, 70.., 0.,., 0.7, 0.8 In Exercises 0, use the alternative formula to find the standard deviation of the set..,, 6, 6,, 6. 0,, 0, 6,,, 9, 7. 6, 6, 7, 67, 9, 9 8. 6.0, 9.,., 8.7, 0. 9. 8., 6.9,.7,., 6. 0. 9.0, 7.,., 7., 6.0 In Exercises and, line plots of sets of data are given. Determine the mean and standard deviation of each set.. (a) (b) (c) (d) 8 0 6 6 8 0 8 0 6 6 8 0

Section A. Measures of Central Tendency and Dispersion A. (a) (b) (c) (d). Reasoning Without calculating the standard deviation, explain why the set,, 0, 0 has a standard deviation of 8.. Reasoning If the standard deviation of a set of numbers is 0, what does this imply about the set?. Test Scores An instructor adds five points to each student s exam score. Will this change the mean or standard deviation of the exam scores? Explain. 6. Price of Gold The following data represents the average prices of gold (in dollars per fine ounce) for the years 98 to 000. Use a computer or graphing utility to find the mean, variance, and standard deviation of the data. What percent of the data lies within two standard deviations of the mean? (Source: U.S. Bureau of Mines and U.S. Geological Survey) 60, 76,, 6, 8, 68, 78, 8, 8, 8, 6,, 6, 8, 86, 89,, 9, 80, 80 7. Think About It The histograms represent the test scores of two classes of a college course in mathematics. Which histogram has the smaller standard deviation? Frequency 6 6 8 6 8 6 8 6 8 Frequency 6 8. Test Scores The scores of a mathematics exam given to 600 science and engineering students at a college had a mean and standard deviation of and 8, respectively. Use Chebychev s Theorem to determine the intervals containing at least and at 8 least 9 of the scores. How would the intervals change if the standard deviation were 6? In Exercises 9, sketch a box-and-whisker plot for the data without the aid of a graphing utility. 9.,,,,,,, 0, 0., 0,,, 7, 6,,, 8,, 0. 6, 8, 8, 0,, 7,, 7, 9,., 0,, 8,, 8,, 9, 7, 9, 8, In Exercises 6, use a graphing utility to create a box-and-whisker plot for the data.. 9,,, 9,,, 7,, 9,, 0, 9. 9,,,, 6,,,, 7, 0, 7,, 8, 9, 9. 0.,.,.9,.9,.,.,.,.,.7, 7.,.8,., 7., 6.,.8 6. 78., 76., 07., 78., 9., 90., 77.8, 7., 97., 7., 8.8, 6.6 7. Product Lifetime A company has redesigned a product in an attempt to increase the lifetime of the product. The two sets of data list the lifetimes (in months) of 0 units with the original design and 0 units with the new design. Create a box-and-whisker plot for each set of data, and then comment on the differences between the plots. Original Design. 78. 6. 68.9 0.6 7...7 7.7 0..0..0 8. 8. 0.8 8. 8. 0.0.6 New Design.8 7..6 9.0. 7. 60.0. 8.9 80. 6.7. 67.9. 99..0...8 87.8 86 90 9 98 Score 8 88 9 96 Score