MEASURES OF CENTRAL TENDENCY INTRODUCTION

Size: px
Start display at page:

Download "MEASURES OF CENTRAL TENDENCY INTRODUCTION"

Transcription

1 2 MEASURES OF CENTRAL TENDENCY INTRODUCTION A requency distribution, in general, shows clustering o the data around some central value. Finding o this central value or the average is o importance, as it gives the most representative value o the whole group. Dierent methods give dierent averages which are known as the measures o central tendency. The commonly used measures o central values are mean, median, mode, geometric mean and harmonic mean. There are two main objectives o the study o averages. (i) To get single value that describes the characteristic o the entire group. Measures o central value, by condensing the mass o data in one single value, enable us to get a bird s-eye view o the the entire data. (ii) To acilitate comparison, measures o central value, by reducing the mass o data to one single igure, enable comparison to be made. Comparison can be made either at a point o time or over a period o time. For example, we can compare the percentage results o the students o dierent colleges in a certain examination. Since an average is a single value representing a group o values, it is desired that the value satisies the ollowing properties: (i) Easy to understand (ii) Simple to compute (iii) Based on all the items (iv) Not be undully aected by extreme observations (v) Capable o urther algebraic treatment (vi) Sampling stability Deinition A numerical expression which is used to present a whole series should neither have the lowest value nor the heighest value in the series, but a value somewhere between these two limits, possibly in the centre, where most o the items o the series cluster, such igures are called measures o central tendency or measures o locations or averages.

2 40 Statistics or Managers GM = [(x 1 ) 1 (x 2 ) (xn) n ] n \ log GM = n 1 [ 1 log x log x n log x n ] \ GM = Antilog ( S i log x i n ) Harmonic Mean I x 1, x 2, x 3... x n be a set o n observations then the harmonic mean is deined as the reciprocal o the (arithmetic) mean o the reciproculs o the quantities. 1 Thus, HM = 1 n ( x x x n ) where In a requency distribution harmonic mean is given by 1 HM = 1 N ( 1 x x n 2 x n ) MEDIAN N = Â i I the values o the variable are arranged in the ascending or descending order o magnitude, the median is the middle item i the number is odd and is the mean o the two middle items i the number is even. Thus, the median is equal to the mid-value i.e. the value which divides the total requency into two equal parts. For computation o median, it is necessary that the items be arranged in ascending order. 1. Ungrouped requency distribution: Arrange the n values o the variate in ascending or descending order. (a) When n is odd, the middle value, i.e. ( n ) th value gives the median. (b) When n is even, there are two middle values n 2 th and ( n ) th. The arithmetic mean o these two values gives the median. 2. For grouped requency distribution ( N 2 C ) Median = l + h where l is the lower limit o the median class N is the total requency is the requency o this class h is the width o the class and C is the cumulative requency upto the class preceding the median class.

3 44 Statistics or Managers S i = 90 S i u i = 13 Here h = 8 A = 32.5 Mean = A + hs i u i S i = = Example 4 The mean o 200 items was 50. Later on, it was ound that two items were read as 92 and 8 instead o 192 and 88. Find the correct mean. Solution Here n = 200 and incorrect value o _ x = 50 Since _ Sx x = n \ Sx = n _ x \ Incorrect total = = 10,000 \ Correct total = 10,000 (92 + 8) + ( ) = \ Correct mean = = 50.9 Example 5 Find the missing requency rom the ollowing data. Marks No. o students Given AM is 34 marks. Solution Let 1 be the missing requency. Now, AM = _ x = 34 _ S i x i x = S i \ 34 = \ 34 ( ) = \ i = 180 Example 6 Calculate the missing requency rom the ollowing data. Class Frequency

4 Measures o Central Tendency 45 Given AM is 37 Solution Let i be the missing requency. Now AM _ x = 37 _ S i x i x = S i = \ 37(59 + i ) = i \ 37 i 35 i = \ 2 i = 772 \ i = 386 Example 7 Daily income o ten amilies o a particular place is given below. Find geometric mean. 85, 70, 15, 75, 500, 8, 45, 250, 40, 60. Solution To ind GM irst, we calculate log x as ollows. x log x \ S log x = \ GM = Antilog ( S log x n ) = Antilog ( ) = Example 8 Calculate the geometric mean rom the ollowing data. x 125, 1462, 38, 7, 0.22, 0.08, 12.75, 0.5 Solution First ind log x x _ log x \ Sx = and n = 8 \ GM = Antilog ( S log x = n ) = Antilog ( ) Example 9 Find the geometric mean or the ollowing data given below. Marks Frequency Solution To ind GM, we orm the table as ollows.

5 46 Statistics or Managers Class Midpoint m log m log m N = 109 S log m = S \ GM = Antilog log m ( N ) = Antilog ( ) = Antilog (1.2587) = Example 10 Find the harmonic mean rom the ollowing data. x Solution To ind harmonic mean, calculate 1 x. x x Here n = 8 HM = N S 1 x = = Example 11 From the ollowing data, compare the harmonic mean. Marks Students Solution To ind HM we orm the table as ollows. Marks (x) Students () /x Here N = 120 and S/x = \ N HM = S/x = =

6 Example 12 Compute the harmonic mean rom the ollowing data. Measures o Central Tendency 47 Class Frequency Solution To ind the harmonic mean, we orm the table as ollows. Class Freq. Midpoint /m m N = 30 S/m = \ Harmonic mean = N S/m = = Example 13 An incomplete requency distribution is given below Class Frequency Given that the total requency is 229 and median is 46. Find the missing requencies. Solution Let 1 and 2 be the missing requencies o the classes and respectively. Since the median lies in the class ( N 2 C ) Median = l + h [ ( ] ) 46 = [ ( ] ) \ = \ = \ 1 = = \ 1 = 34 And 2 = 229 ( ) = 45 \ Missing requencies are 1 = 34 and 2 = 45

7 48 Statistics or Managers Example 14 Calculate median, the lower quartile, upper quartile, semi-interquartile range and the mode or the ollowing data. Class Marks Frequency Solution We orm the table as ollows. Class Frequency Cumulative requency N = 49 (1) Median ( or N 2 = 49 2 ) = 24.5 alls in the class and is given by N 2 C ) Median = l + h where l = 15, N = 49, C = 11, h = 5 and = 15 ( 49 2 ) 11 \ Median = = 19.5 marks. 15 (2) Lower quartile (Q 1 ) ( or N 4 = ) alls in the class ( N 4 C ) \ Q 1 = l + h where l = 15, N = 49, C = 11, h = 5 and = 15 ( 49 4 ) 11 \ Q 1 = = 15.4 marks 15 3N (3) Upper quartile (Q 3 ) ( or 4 ) = alls in the class N 4 ) C Q 3 = l + h

8 Measures o Central Tendency 49 where l = 25, N = 49, C = 36, h = 5 and = 5 \ Q 3 = 25 + ( ) 36 5 = marks (4) Semi-interquartile range Q = 1 2 (Q 3 Q 1 ) = 1 ( ) = (5) Mode: It is seen that the mode value alls in the class m 1 Mode = l + h 2 m 1 2 where l = 15, m = 15, 1 = 6, 2 = 10 and h= = = 18.2 marks Example 15 From the ollowing marks ind Q 1, Q 2 and Q 3 marks. 23, 48, 34, 68, 15, 36, 24, 54, 65, 75, 92, 10, 70, 61, 20, 47, 83, 19, 77 Solution Let us irst arrange the given data in the ascending order. SN x SN x Here n = 19 (i) Q 1 is the size o ( n ) ( th item = = 5th item = 23 (ii) Q 2 is the size o ( n + 1 (iii) Q 3 is size o 3 ( n ) ( th item ) th item 2 ) th item = 10th item = 48 4 ) = 15th item = 70 4 ) th item = 3 ( Example 16 Find the median value o the ollwing data. 43, 62, 15, 80, 56, 72, 34, 8, 25 Solution Arrange the data in an array orm with serial numbers S. No x

9 50 Statistics or Managers Median is the size o ( n ) th item \ median = 43 = ( ) th Example 17 Find the median value o the ollowing. 36, 5, 19, 26, 6, 28, 56, 18, 63, 4 Solution Arrange the data in an array orm with serial numbers S. No x Median is the size o ( n ) th item = = 5.5th item 2 The item lies in between 5th and 6th item. \ Median = = = 22.5 Example 18 Locate median and quartiles rom the ollowing data. Size o shoes Frequency Solution x CF (1) Q 1 is the size o ( n ) th item = = 80.25th item (lies in 100 CF). 4 Against 100 CF, the value is 5.0 \ Q 1 = 5.

10 (2) Median Q 2 = ( n ) th item = th item (lies in 230 CF). Against 230 CF the value is 6 \ Q 2 = 6 3(n + 1) (3) Q 3 is the size o th item = th item (lies in 260 CF) 4 Against 260 CF, the value is 6.5 \ Q 3 = 6.5 Measures o Central Tendency 51 Example 19 The mean height o 25 male workers in a actory is 61 inches and the mean height o 35 emale workers in the same actory is 58. Find the combined mean o 60 workers in the actory. Solution Given n 1 = 25, _ x 1 = 61, n 2 = 35 and _ x 2 = 58 The combined mean is given by _ x = n 1 _ x 1 + n 2 x 2 n 1 + n = = 3555/ = Thus, the combined mean height is inches. Example 20 Compute mean, median and mode or the ollowing data. Class Freq Solution Form the table as ollows Class Freq Mid values d i = x i A h i d i CF x i A Here S = 90 = N S i d i = 75 h = 10 (1) Mean = A + h S id i S i = = = 33.83

11 52 Statistics or Managers (2) Median ( N 2 = 90 2 ) = 45 alls in the class ( N 2 C ) \ Median = l + h ( 90 2 ) 35 = = = (3) Mode: The maximum requuency is in the class Mode = l + m 1 2 m 1 2 h = = Example 21 Find the numbers whose arithmetic mean is 12.5 and geometric mean is 10. Solution Let the two numbers be a and b Then, AM = a + b and GM = ab 2 a + b \ = 12.5 ab = 10 2 \ a + b = 25 or, ab = 100 Consider (a + b) 2 (a b) 2 = 4ab \ (25) 2 (a b) 2 = \ (a b) = = 225 \ a b = 15. \ a + b = 25 Solving, a = 20, b = 5 a b = 15 Example 22 An aeroplane covers the our sides o a square at speeds o 1,000, 2,000, 3000 and 4000 km per hour respectively. What is the average speed o the plane in its light around the square? Solution For arithmetic man, we compute as _ x = = 2,500 km per hour. 4

12 Measures o Central Tendency 53 However, that is not the correct answer in such a problem harmonic mean is an appropriate average. 4 HM = = 25 12, ,000 = = 1,920 km per hour. 25 Note: The harmonic mean is a measure o central tendency or data expressed as rates such as km per hour, km per litre, hours per semester tonnes per month, etc. Example 23 The mean annual salaries paid to 100 employees o a company was Rs. 5,000. The mean annual salaries paid to male and emale employees were Rs. 5,200 and Rs. 4,200 respectively. Determine the percentage o males and emales employed by the company. Solution Let n 1 represent percentage o males and n 2 represent precentage o emales so that n 1 + n 2 = 100 Given, combined mean _ x = 5,000, _ x 1 = 5,200 and _ x 2 = 4200 Consider, _ x = \ 5000 = n _ 1 x 1 + n 2 x 2 n 1 + n 2 n n n 1 + n 2 \ (n 1 + n 2 ) 5000 = 5200 n n 2 \ 5000 [n 1 + (100 n 1 )] = 5200 n (100 n 1 ) 5000 n n 1 = 5200 n n 1 \ 1,000 n 1 = 80,000 \ n 1 = 80 and n 2 = 20 Thus, the percentage o males and emales employed is 80 and 20 respectively. General merits and uses o Averages So ar we have discussed the methods o computing the various types o averages and also their distinctive eatures. At this point, we have a problem which o these is the best average to be used. To have a solution to this we must know the ollowing: (i) The purpose which the average is designed to serve. (ii) Would the average be used or urther computations? (iii) The type o data available. I they are badly skewed (avoid the mean). gappy around the middle (avoid the median) and unequal in class interval (avoid the mode). (iv) The typical value required in the particular problem. Within the ramework o descriptive statistics the main requirement is to know what each average means and then select one that uluils the purpose. (v) In some cases it is required to work out more than one averages and present them.

13 54 Statistics or Managers Median The median is the best average in open-end grouped distribution especially where i plotted as a requency curve. For example, in case o price distribution or income distribution. In such cases very high or very low values would cause the mean to be higher to lower than the most common values. In such cases the median or middle value o the series may be a more representative igure to use in describing the mass o data. Mode Mode can be used to describe quantitative data. The mode can be used in problems involving the expression o preerences where quantitative measurements are not possible. Thus, the preerred type o package design among a number o alternative design would be the model design. I we want to compare consumer preerences or dierent kinds o products or dierent kinds o advertising we can compare the model preerences expressed by dierent groups o people but we cannot calculate the median or mean. Mode is a particularly useul average or discrete, series and best suited or large requency. Geometric mean Geometric mean is useul in averaging ratio and percentages and in computing average rates o increase or decrease. It is particularly important in economics and business statistics in index number construction. Harmonic mean Harmonic mean is useul in problems in which values o a variable are compared with a constant quantity o another variable, i.e. rates distance covered within certain time and quantity purchased or sold per unit, etc. In the ollowing cases arithmetic mean should not be used. (a) In highly skewed distributions. (b) In distributions with open end intervals. (c) When the distribution is unevenly spread. (d) The arithmetic mean should not be used to average ratios and rates o change. In such cases, the geometric mean is more suitable. (e) When there are very large and very small items, arithmetic mean would be seriously misleading on account o undue inluence rom extreme items. General demerits o averages: (a) Since an average is a single value representing a group o values. Otherwise there is every possibility o jumping to wrong conclusions. For example, a person had to cross the river rom one bank to another. He was not aware o the depth o the river so he enquired o another man who told him that the average depth o water is 5 t. 4 inches. The man was 5 t. and 8 inches and he thought that he can very easily cross the river because at all times the would be above the level o water so he started. In the beginning, the level o water was very low but it as he reached the middle the water was 15 t. deep and he lost his lie. The man was drowned because he had a misconception that average depth means uniorm depth throughout but it is not so. An average represents a group o values and lies somewhere in between the two extremes, i.e. the largest and the smallest items o the series. (b) An average may give us a value that does not exist in the data. For example, the arithmetic mean o 100, 300, 250, 50 and 100 is 160 a value that does not exist in the data. (c) At times the average may give a very absurd result. For example. It we calculate size o a amily we may get a value 5.6. But this is impossible as persons cannot be in ractions. However, we should remember that it is an average value representing the entire group.

14 Measures o Central Tendency 55 (d) Measures o central tendency ail to give an idea about the ormation o the series. Two or more series may have the same central value but may dier widely in composition. (e) We must remember that an average is a measure o central tendency. Hence, unless the data show a clear single concentration o observations an average may not be meaningul at all. This evidently precludes the use o any average to typiy a bimodal, a U-shaped or J-shaped distribution. QUESTION BANK A. Choose the correct answer rom the given alternatives. 1. Dierent methods give dierent averages which are known as the (a) measures o central tendency (b) statistics (c) measures o dispersion (d) skewness 2. I x 1, x 2, x 3 x n are a set o n values o a variate, then the mean is given by (a) n Sx i (b) Sx i n n 1 x 1 + n 2 x 2 n 1 + n 2 (c) nsx i (d) 3. I x 1, x 2 x 3 x n are a set o n observations then the geometric mean is n (a) n Slog x i (b) Slog x i (c) Slog x i n (d) Antilog Slog x i n 4. I the values o the variables are arranged in ascending order o magnitude, the middle term is (a) mean (b) mode (c) median (d) quartile 5. In a grouped distribution, the value o the variable o maximum requency is called (a) mode (b) mean (c) median (d) quartile 6. In a symmetrical distribution, the mean, median and mode (a) dier (b) coincide (c) mean-median = mode (d) dier by In case o individual observations and discrete series, the size can be determined by using the ( n ) th item or (a) D 10 (b) D 9 (c) D 1 (d) D 2 8. For grouped requency distribution the mode is calculated by using the ormula m 1 (a) l + h (b) l 2 m 1 2 n + m 1 1 2

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Module 4: Data Exploration

Module 4: Data Exploration Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive

More information

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

More information

Means, standard deviations and. and standard errors

Means, standard deviations and. and standard errors CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo) COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY Prepared by: Jess Roel Q. Pesole CENTRAL TENDENCY -what is average or typical in a distribution Commonly Measures: 1. Mode. Median 3. Mean quantified

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review. Descriptive Statistics AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

THE MODELING AND CALCULATION OF SOUND RADIATION FROM FACILITIES WITH GAS FLOWED PIPES INTRODUCTION

THE MODELING AND CALCULATION OF SOUND RADIATION FROM FACILITIES WITH GAS FLOWED PIPES INTRODUCTION THE MODELING AND CALCULATION OF SOUND ADIATION FOM FACILITIES WITH GAS FLOWED PIPES INTODUCTION Analysis o the emission caused by industrial acilities like chemical plants, reineries or other production

More information

Lesson outline 1: Mean, median, mode

Lesson outline 1: Mean, median, mode 4. Give 5 different numbers such that their average is 21. The numbers are: I found these numbers by: 5. A median of a set of scores is the value in the middle when the scores are placed in order. In case

More information

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

More information

Statistics Revision Sheet Question 6 of Paper 2

Statistics Revision Sheet Question 6 of Paper 2 Statistics Revision Sheet Question 6 of Paper The Statistics question is concerned mainly with the following terms. The Mean and the Median and are two ways of measuring the average. sumof values no. of

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Measures of Central Tendency and Variability: Summarizing your Data for Others

Measures of Central Tendency and Variability: Summarizing your Data for Others Measures of Central Tendency and Variability: Summarizing your Data for Others 1 I. Measures of Central Tendency: -Allow us to summarize an entire data set with a single value (the midpoint). 1. Mode :

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Chapter 1 Review 1. As part of survey of college students a researcher is interested in the variable class standing. She records a 1 if the student is a freshman, a 2 if the student

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics

More information

3: Summary Statistics

3: Summary Statistics 3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

10 20 30 40 50 60 Mark. Use this information and the cumulative frequency graph to draw a box plot showing information about the students marks.

10 20 30 40 50 60 Mark. Use this information and the cumulative frequency graph to draw a box plot showing information about the students marks. GCSE Exam Questions on Frequency (Grade B) 1. 200 students took a test. The cumulative graph gives information about their marks. 200 160 120 80 0 10 20 30 50 60 Mark The lowest mark scored in the test

More information

Describing and presenting data

Describing and presenting data Describing and presenting data All epidemiological studies involve the collection of data on the exposures and outcomes of interest. In a well planned study, the raw observations that constitute the data

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

1.2 GRAPHS OF EQUATIONS. Copyright Cengage Learning. All rights reserved.

1.2 GRAPHS OF EQUATIONS. Copyright Cengage Learning. All rights reserved. 1.2 GRAPHS OF EQUATIONS Copyright Cengage Learning. All rights reserved. What You Should Learn Sketch graphs of equations. Find x- and y-intercepts of graphs of equations. Use symmetry to sketch graphs

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

Calculation example mean, median, midrange, mode, variance, and standard deviation for raw and grouped data

Calculation example mean, median, midrange, mode, variance, and standard deviation for raw and grouped data Calculation example mean, median, midrange, mode, variance, and standard deviation for raw and grouped data Raw data: 7, 8, 6, 3, 5, 5, 1, 6, 4, 10 Sorted data: 1, 3, 4, 5, 5, 6, 6, 7, 8, 10 Number of

More information

FIXED INCOME ATTRIBUTION

FIXED INCOME ATTRIBUTION Sotware Requirement Speciication FIXED INCOME ATTRIBUTION Authors Risto Lehtinen Version Date Comment 0.1 2007/02/20 First Drat Table o Contents 1 Introduction... 3 1.1 Purpose o Document... 3 1.2 Glossary,

More information

Rainfall generator for the Meuse basin

Rainfall generator for the Meuse basin KNMI publication; 196 - IV Rainall generator or the Meuse basin Description o 20 000-year simulations R. Leander and T.A. Buishand De Bilt, 2008 KNMI publication = KNMI publicatie; 196 - IV De Bilt, 2008

More information

Manufacturing and Fractional Cell Formation Using the Modified Binary Digit Grouping Algorithm

Manufacturing and Fractional Cell Formation Using the Modified Binary Digit Grouping Algorithm Proceedings o the 2014 International Conerence on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Manuacturing and Fractional Cell Formation Using the Modiied Binary

More information

Descriptive statistics parameters: Measures of centrality

Descriptive statistics parameters: Measures of centrality Descriptive statistics parameters: Measures of centrality Contents Definitions... 3 Classification of descriptive statistics parameters... 4 More about central tendency estimators... 5 Relationship between

More information

How To: Analyse & Present Data

How To: Analyse & Present Data INTRODUCTION The aim of this How To guide is to provide advice on how to analyse your data and how to present it. If you require any help with your data analysis please discuss with your divisional Clinical

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

Is the trailing-stop strategy always good for stock trading?

Is the trailing-stop strategy always good for stock trading? Is the trailing-stop strategy always good or stock trading? Zhe George Zhang, Yu Benjamin Fu December 27, 2011 Abstract This paper characterizes the trailing-stop strategy or stock trading and provides

More information

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

MEASURES OF CENTER AND SPREAD MEASURES OF CENTER 11/20/2014. What is a measure of center? a value at the center or middle of a data set

MEASURES OF CENTER AND SPREAD MEASURES OF CENTER 11/20/2014. What is a measure of center? a value at the center or middle of a data set MEASURES OF CENTER AND SPREAD Mean and Median MEASURES OF CENTER What is a measure of center? a value at the center or middle of a data set Several different ways to determine the center: Mode Median Mean

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13

CHAPTER THREE COMMON DESCRIPTIVE STATISTICS COMMON DESCRIPTIVE STATISTICS / 13 COMMON DESCRIPTIVE STATISTICS / 13 CHAPTER THREE COMMON DESCRIPTIVE STATISTICS The analysis of data begins with descriptive statistics such as the mean, median, mode, range, standard deviation, variance,

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques

More information

FACTOR COMPONENTS OF INEQUALITY. Cecilia García-Peñalosa and Elsa Orgiazzi GINI DISCUSSION PAPER 12 JULY 2011 GROWING INEQUALITIES IMPACTS

FACTOR COMPONENTS OF INEQUALITY. Cecilia García-Peñalosa and Elsa Orgiazzi GINI DISCUSSION PAPER 12 JULY 2011 GROWING INEQUALITIES IMPACTS FACTOR COMPONENTS OF INEQUALITY Cecilia García-Peñalosa and Elsa Orgiazzi GINI DISCUSSION PAPER 12 JULY 2011 GROWING INEQUALITIES IMPACTS Acknowledgement This research was partly supported by the Institut

More information

Power functions: f(x) = x n, n is a natural number The graphs of some power functions are given below. n- even n- odd

Power functions: f(x) = x n, n is a natural number The graphs of some power functions are given below. n- even n- odd 5.1 Polynomial Functions A polynomial unctions is a unction o the orm = a n n + a n-1 n-1 + + a 1 + a 0 Eample: = 3 3 + 5 - The domain o a polynomial unction is the set o all real numbers. The -intercepts

More information

LINEAR EQUATIONS IN TWO VARIABLES

LINEAR EQUATIONS IN TWO VARIABLES 66 MATHEMATICS CHAPTER 4 LINEAR EQUATIONS IN TWO VARIABLES The principal use of the Analytic Art is to bring Mathematical Problems to Equations and to exhibit those Equations in the most simple terms that

More information

19 - RAY OPTICS Page 1 ( Answers at the end of all questions )

19 - RAY OPTICS Page 1 ( Answers at the end of all questions ) 19 - RAY OPTICS Page 1 1 ) A ish looking up through the water sees the outside world contained in a circular horizon. I the reractive index o water is 4 / 3 and the ish is 1 cm below the surace, the radius

More information

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

More information

The Seven Basic Tools. QUALITY CONTROL TOOLS (The Seven Basic Tools) What are check sheets? CHECK SHEET. Illustration (Painting defects)

The Seven Basic Tools. QUALITY CONTROL TOOLS (The Seven Basic Tools) What are check sheets? CHECK SHEET. Illustration (Painting defects) QUALITY CONTROL TOOLS (The Seven Basic Tools) Dr. Ömer Yağız Department of Business Administration Eastern Mediterranean University TRNC The Seven Basic Tools The seven basic tools are: Check sheet Flow

More information

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics. Business Course Text Bowerman, Bruce L., Richard T. O'Connell, J. B. Orris, and Dawn C. Porter. Essentials of Business, 2nd edition, McGraw-Hill/Irwin, 2008, ISBN: 978-0-07-331988-9. Required Computing

More information

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions.

Algebra 2 Chapter 1 Vocabulary. identity - A statement that equates two equivalent expressions. Chapter 1 Vocabulary identity - A statement that equates two equivalent expressions. verbal model- A word equation that represents a real-life problem. algebraic expression - An expression with variables.

More information

Statistics 151 Practice Midterm 1 Mike Kowalski

Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and

More information

Section 3: Examining Center, Spread, and Shape with Box Plots

Section 3: Examining Center, Spread, and Shape with Box Plots Section 3: Examining Center, Spread, and Shape with Box Plots Q32. So far with my examination of the data, most of the data seems to be skewed. Expenditure per student and revenue per student are both

More information

Big Ideas in Mathematics

Big Ideas in Mathematics Big Ideas in Mathematics which are important to all mathematics learning. (Adapted from the NCTM Curriculum Focal Points, 2006) The Mathematics Big Ideas are organized using the PA Mathematics Standards

More information

Bar Graphs and Dot Plots

Bar Graphs and Dot Plots CONDENSED L E S S O N 1.1 Bar Graphs and Dot Plots In this lesson you will interpret and create a variety of graphs find some summary values for a data set draw conclusions about a data set based on graphs

More information

Variables. Exploratory Data Analysis

Variables. Exploratory Data Analysis Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is

More information

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: What do the data look like? Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses

More information

CHAPTER THREE. Key Concepts

CHAPTER THREE. Key Concepts CHAPTER THREE Key Concepts interval, ordinal, and nominal scale quantitative, qualitative continuous data, categorical or discrete data table, frequency distribution histogram, bar graph, frequency polygon,

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

CHAPTER 14 STATISTICS. 14.1 Introduction

CHAPTER 14 STATISTICS. 14.1 Introduction 238 MATHEMATICS STATISTICS CHAPTER 14 14.1 Introduction Everyday we come across a lot of information in the form of facts, numerical figures, tables, graphs, etc. These are provided by newspapers, televisions,

More information

Ch. 3.1 # 3, 4, 7, 30, 31, 32

Ch. 3.1 # 3, 4, 7, 30, 31, 32 Math Elementary Statistics: A Brief Version, 5/e Bluman Ch. 3. # 3, 4,, 30, 3, 3 Find (a) the mean, (b) the median, (c) the mode, and (d) the midrange. 3) High Temperatures The reported high temperatures

More information

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics Course Text Business Statistics Lind, Douglas A., Marchal, William A. and Samuel A. Wathen. Basic Statistics for Business and Economics, 7th edition, McGraw-Hill/Irwin, 2010, ISBN: 9780077384470 [This

More information

Algebra I Vocabulary Cards

Algebra I Vocabulary Cards Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression

More information

Mathematical Conventions Large Print (18 point) Edition

Mathematical Conventions Large Print (18 point) Edition GRADUATE RECORD EXAMINATIONS Mathematical Conventions Large Print (18 point) Edition Copyright 2010 by Educational Testing Service. All rights reserved. ETS, the ETS logo, GRADUATE RECORD EXAMINATIONS,

More information

Mathematical Conventions. for the Quantitative Reasoning Measure of the GRE revised General Test

Mathematical Conventions. for the Quantitative Reasoning Measure of the GRE revised General Test Mathematical Conventions for the Quantitative Reasoning Measure of the GRE revised General Test www.ets.org Overview The mathematical symbols and terminology used in the Quantitative Reasoning measure

More information

Sampling and Descriptive Statistics

Sampling and Descriptive Statistics Sampling and Descriptive Statistics Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University Reference: 1. W. Navidi. Statistics for Engineering and Scientists.

More information

Data Exploration Data Visualization

Data Exploration Data Visualization Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select

More information

Accommodating Transient Connectivity in Ad Hoc and Mobile Settings

Accommodating Transient Connectivity in Ad Hoc and Mobile Settings Accommodating Transient Connectivity in Ad Hoc and Mobile Settings Radu Handorean, Christopher Gill and Gruia-Catalin Roman Department o Computer Science and Engineering Washington University in St. Louis

More information

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

consider the number of math classes taken by math 150 students. how can we represent the results in one number? ch 3: numerically summarizing data - center, spread, shape 3.1 measure of central tendency or, give me one number that represents all the data consider the number of math classes taken by math 150 students.

More information

DATA INTERPRETATION AND STATISTICS

DATA INTERPRETATION AND STATISTICS PholC60 September 001 DATA INTERPRETATION AND STATISTICS Books A easy and systematic introductory text is Essentials of Medical Statistics by Betty Kirkwood, published by Blackwell at about 14. DESCRIPTIVE

More information

THE BINOMIAL DISTRIBUTION & PROBABILITY

THE BINOMIAL DISTRIBUTION & PROBABILITY REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution

More information

2010 Solutions. a + b. a + b 1. (a + b)2 + (b a) 2. (b2 + a 2 ) 2 (a 2 b 2 ) 2

2010 Solutions. a + b. a + b 1. (a + b)2 + (b a) 2. (b2 + a 2 ) 2 (a 2 b 2 ) 2 00 Problem If a and b are nonzero real numbers such that a b, compute the value of the expression ( ) ( b a + a a + b b b a + b a ) ( + ) a b b a + b a +. b a a b Answer: 8. Solution: Let s simplify the

More information

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple. Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

Physics Notes Class 11 CHAPTER 3 MOTION IN A STRAIGHT LINE

Physics Notes Class 11 CHAPTER 3 MOTION IN A STRAIGHT LINE 1 P a g e Motion Physics Notes Class 11 CHAPTER 3 MOTION IN A STRAIGHT LINE If an object changes its position with respect to its surroundings with time, then it is called in motion. Rest If an object

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Introduction; Descriptive & Univariate Statistics

Introduction; Descriptive & Univariate Statistics Introduction; Descriptive & Univariate Statistics I. KEY COCEPTS A. Population. Definitions:. The entire set of members in a group. EXAMPLES: All U.S. citizens; all otre Dame Students. 2. All values of

More information

Name: Date: Use the following to answer questions 2-3:

Name: Date: Use the following to answer questions 2-3: Name: Date: 1. A study is conducted on students taking a statistics class. Several variables are recorded in the survey. Identify each variable as categorical or quantitative. A) Type of car the student

More information

How do you compare numbers? On a number line, larger numbers are to the right and smaller numbers are to the left.

How do you compare numbers? On a number line, larger numbers are to the right and smaller numbers are to the left. The verbal answers to all of the following questions should be memorized before completion of pre-algebra. Answers that are not memorized will hinder your ability to succeed in algebra 1. Number Basics

More information

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Options on Stock Indices, Currencies and Futures

Options on Stock Indices, Currencies and Futures Options on Stock Indices, Currencies and utures It turns out that options on stock indices, currencies and utures all have something in common. In each o these cases the holder o the option does not get

More information

Exploratory Data Analysis. Psychology 3256

Exploratory Data Analysis. Psychology 3256 Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find

More information

Interpreting Data in Normal Distributions

Interpreting Data in Normal Distributions Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,

More information

2 Describing, Exploring, and

2 Describing, Exploring, and 2 Describing, Exploring, and Comparing Data This chapter introduces the graphical plotting and summary statistics capabilities of the TI- 83 Plus. First row keys like \ R (67$73/276 are used to obtain

More information

Financial Services [Applications]

Financial Services [Applications] Financial Services [Applications] Tomáš Sedliačik Institute o Finance University o Vienna tomas.sedliacik@univie.ac.at 1 Organization Overall there will be 14 units (12 regular units + 2 exams) Course

More information