Chapter 3: Data Description Numerical Methods


 Miles Patterson
 10 months ago
 Views:
Transcription
1 Chapter 3: Data Description Numerical Methods Learning Objectives Upon successful completion of Chapter 3, you will be able to: Summarize data using measures of central tendency, such as the mean, median, mode and midrange. Describe data using measures of variation, such as the range, variance, and standard deviation. Identify the position of a data value in a data set, using various measures of position, such as percentiles, deciles, and quartiles. Use the techniques of exploratory data analysis, including boxplots and fivenumber summaries, to discover various aspects of data. I. Basic Vocabulary A. Statistics vs. Parameter A statistic is a numerical characteristic or numerical summary obtained by using the data values from a sample. A parameter is a numerical characteristic or numerical summary obtained by using all the data values for the entire population. B. Numerical Summaries of Quantitative Data 1. Measures of the average or center: mean, median, mode, and midrange. 2. Measures of variation (spread, variability, or dispersion): range, variance, and standard deviation. THE GENERAL ROUNDING RULE: Always round to one more place than the data when the final answer is computed. C. Notation for numerical summaries indicate if it is a parameter or a statistic N: population size n: sample size population mean sample mean Note: The mean is found the same way for the sample or population Dr. Janet Winter, Stat 200 Page 1
2 population variance population standard deviation sample variance sample standard deviation population proportion sample proportion is a value of the variable or an answer to the question asked II. Measures of the Center A. Mean I. Mean (ungrouped data) To calculate the mean, take the sum of all data values, and then divide by the number of values: Sample Mean Population Mean Note: The mean is found the same way for the sample and population. Example: Mean I Note: The answer is rounded t one more place than the data. a) Example: Mean II Note: The answer is rounded t one more place than the data. Dr. Janet Winter, Stat 200 Page 2
3 II. Mean for Grouped Data To calculate the mean of grouped data, 1. Use the class midpoint (x i ) for each class 2. Use the class frequency for each class (f) with the formula a) Example: Mean for Grouped Data Class Midpoint(x) Frequency (f) xf Totals Note: The answer is rounded t one more place than the data. III. Weighted Mean A weighted mean has an additional factor or weight for each class. a) Example: Weighted Mean PSU Grade Point Average (GPA) grades are weighted by their quality points. Course Credit (w) Grade (x) xw English 3.0 B Stat 4.0 A History 3.0 C Totals Dr. Janet Winter, Stat 200 Page 3
4 B. Median (MD, x ) I. Median To calculate the median, place the data in increasing order and find a value in the center of the ordered list. II. Median: The middle value in an ordered list of data. It is the value with the same number of data values above and below it. Used for data sets with outliers. In the absence of outliers, use the mean. Process: 1. Order the data values from the smallest to the largest. 2. When the sample size n is odd, the median is the data located in the exact middle. 3. When the sample size n is even, there are two data values in the middle. The median is the average of the two data values in the middle. Example: Example data: Ordered data: Answer: 14 is the data value in the center Example: Median Example data: Ordered data: The middle value is between 14 and 23. Solve for the average: ( )/2 = 37/2 = 18.5 MD = 18.5 (round to one more place or tenths) C. Mode For ungrouped data, the mode is the value that occurs most often in a data set. For grouped data, the mode is the class with the highest frequency and is called the modal class. Bimodal Two classes each with the largest frequency OR Two data value each with the largest frequency. No mode no value is repeated. Multi modal more than two data values or more than two classes with the same greatest frequency. No symbol for the mode. Dr. Janet Winter, Stat 200 Page 4
5 Question 1 When a person says that the average age of a group of workers is 35, the average a) is the mean of the ages. b) is the median of the ages. c) could be either the mean or the median of the ages. d) do not know. Question 2 If we are taking a test and we wish to score in the upper half of the students, then we wish to be higher than the a) is the mean of the ages. b) is the median of the ages. c) could be either the mean or the median of the ages. d) do not know. D. Midrange (MR) I. Midrange (MR) : The value in the middle of the range The value midway between the lowest and highest data values II. Example: Midrange Find the midrange for: 2, 13, 1, 25, 45, 67, Order the data: The average of 1 (lowest) and 90 (highest) 3. (1 + 90)/2 = 45.5 (round to one more place or tenths) Dr. Janet Winter, Stat 200 Page 5
6 E. Comparison of Mean, Median, Mode, and Midrange Measure of Center Mean Median Mode Midrange Definition middle value most frequent data value How Common? most familiar average commonly used sometimes used rarely used Existence always exists always exists might not exist; may be more than one mode always exists Takes Every Value into Account? yes no no no Affecte d by Extreme Values? yes no no yes Advantages and Disadvantage s used throughout this book; works well with many statistical methods often a good choice if there are some extreme values appropriate for data at the nominal level very sensitive extreme values General Comments: For a data collection that is approximately symmetric with one mode, the mean, median, mode, and midrange tend to be about the same. For a data collection that is obviously asymmetric, it would be good to report both the mean and median. The mean is relatively reliable. That is, when samples are drawn from the same population, the sample means tend to be more consistent than the other measures of center (consistent in the sense that the means of samples drawn from the same population don t vary as much as the other measure of center. (Triola & Triola, 2006) Question 3 Which measures of the center are influenced by outliers? a) Mean b) Median c) Mode d) Midrange e) Both A & D Dr. Janet Winter, Stat 200 Page 6
7 Question 4 If we tally the votes in an election, then the winner would be the candidate corresponding to a) the mean of the number of votes. b) the median of the number of votes. c) the mode of the number of votes. d) do not know. III. Shapes of Distributions A. Symmetrical Symmetrical shapes have evenly distributed data values on both side of the mean. Mean median and mode are all equal. B. Positively skewed or Right skewed Positively skewed or right skewed shapes have the majority of data values fall to the left of the mean and cluster at the lower end of the distribution, with the tail to the right. The mean and median are to the right of the mode. Dr. Janet Winter, Stat 200 Page 7
8 C. Negatively skewed or Left skewed Negatively skewed or left skewed shapes have the majority of the data values fall to the right of the mean and cluster at the upper end of the distribution, with the tail to the left. The mean and median are to the left of the mode. IV. Measures of Spread (dispersion or variability) A. Types 1. Range the highest value minus the lowest value in a data set (R) 2. Variance 3. Standard Deviation Question 5 An entertainment event advertises that people ages 1 to 100 would enjoy the event. The advertisement specifically describes a set of people with a) a large number of ages. b) a large range of ages. c) a large mean of ages. d) do not know. B. Variance I. Population Variance ( )  To calculate population variance: 1. Find the mean. 2. Subtract the Mean from each data value 3. Square each difference 4. Divide the sum by the number of data values Dr. Janet Winter, Stat 200 Page 8
9 II. Sample Variance ( ) To calculate sample variance: 1. Find the mean. 2. Subtract the Mean from each data value 3. Square each difference 4. Divide the sum by the number of data values minus one s 2 = (x i x ) 2 (n 1) III. Example: Sample Variance Data: x = = = = 1 1 Total 10 Note: Round to one more place than the original data. C. Standard Deviation I. Population Standard Deviation (σ) The population standard deviation (σ) is the square root of the population variance (σ 2 ). Same Rounding rule: Round the final answer to one more decimal place than the original data. II. Sample Standard Deviation (s) a) Deviation Formula the sample standard deviation (s) is the square root of the sample variances (s 2 ). s = (x i x ) 2 (n 1) Same Rounding rule: Round the final answer to one more decimal place than the original data. Dr. Janet Winter, Stat 200 Page 9
10 b) Computational Formula Note: This is used for better accuracy when the mean has several decimal points and folks are more likely to ignore those decimals. Process: 1. Find the sum of all of the data values 2. Find the sum of the squared data values 3. Multiply the sum of the squared data values by the number of data values 4. Square the sum of the data values in step 1 5. Subtract step 4 answer from the step 3 answer: 6. Divide the difference in step 5 by the n times (n 1) 7. Take the square root of the quotient c) Example: Standard Deviation (again with the computational formula) Data: x x Total D. Range Rule of Thumb A rough estimate of the standard deviation is: Where range is highest data value minus lowest data value. Dr. Janet Winter, Stat 200 Page 10
11 E. Standard Deviation for Grouped Data: Use the class midpoints and frequencies s = (x i x ) 2 f i (n 1) I. Example: Standard Deviation for Grouped Data Data: Midpoint Frequency (f) xf x 2 f Question 6 If we know the variance of a set of data, then to calculate the standard deviation of this data a) is a long process because of the many operations needed. b) is a short process because the standard deviation is equal to the variance. c) is a short process because the standard deviation is the square root of the variance. d) do not know. F. Uses for Variance and Standard Deviation 1. Measures of spread, variability, and consistency. 2. To complete inferential statistics. 3. To understand data distributions using Chebyshev s theorem and the Empirical Rule. Dr. Janet Winter, Stat 200 Page 11
12 G. Coefficient of variation (cvar): Comparing Standard Deviations for Different Distributions To compare standard deviations for different distributions, use the coefficient of variation. The coefficient of variation is the standard deviation divided by the mean and multiplied by 100%. It is free of measurement units. cvar = standard deviation mean 100% I. Example: Comparing Standard Deviations for Different Distributions I The mean of the number of sales of cars over a 3 month period is 87, and the standard deviation is 5. The mean of the commissions is $5225, and the standard deviation is $773. Compare the variations of the two. Sales Commissions The commissions are more variable than the sales. II. Example: Comparing Standard Deviations for Different Distributions II John took two tests last week. The average for the history test was 61.3 and the standard deviation was The average for the math test was 81.5 and the standard deviation was Compare the variation for the two tests. History Test Math Test The history test is more variable than the math test. V. Calculator A. TI83 Key Strokes to Clear Lists ALWAYS clear out Lists before entering data. 1. STAT 2. CLRLIST (L 1, L 2, L 3, ) Use second function 1 for L 1, second function 2 for L 2 etc. Be sure to include commas and end with parentheses. 3. ENTER Dr. Janet Winter, Stat 200 Page 12
13 B. TI83 Key Strokes to Enter Data Enter data into a Cleared List. 1. STAT 2. EDIT 3. Enter the data in the lists as need pressing ENTER after each data value. C. TI83 Basic Statistics for Ungrouped Data 1. Clear L 1 and enter the data in L 1 2. STAT 3. CALC 4. 1 VARIABLE STATS L 1 5. ENTER D. TI83 Basic Statistics for Grouped Data 1. Clear L 1, L 2 2. STAT 3. EDIT 4. Enter midpoints in L 1 and enter their corresponding frequencies in L 2 5. STAT 6. CALC 7. Ivariable stats L 1, L 2 8. Check that n is the sum of the frequencies VI. Rules For Data Distribution For all data sets, use Chebyshev s Theorem. For bellshaped or approximately normally distributed data sets, use the Empirical Rule ( Rule) A. Chebyshev s Theorem for All Distributions For any distribution, the proportion of values from a data set that will fall within k standard deviations of the mean will be at least: 1 1/k 2, where k is a number greater than 1. I. Process Select values for k and compute 1 1/k 2 k 1 1/k 2 Interpretation = % of the data is within 2.1 standard deviations of the mean or would be in the interval (X 2.1 S, X S) =.75 75% of the data is within 2 standard deviations of the mean or would be in the interval (X 2 S, X + 2 S) = % of the data is within 3.3 standard deviations of the mean or would be in the interval (X 3.35 S, X S) Dr. Janet Winter, Stat 200 Page 13
14 II. Example: Chebyshev s Theorem for All Distributions  with k = 2 The mean price of houses in a certain neighborhood is $150,000, and the standard deviation is $10,000. Find the price range for which at least 75% of the houses will sell. Using the table from the previous example, k=2. $150,000 +2($10,000) = $150,000 + $20,000 = $170,000 $150,000 +2($10,000) = $150,000 $20,000 = $130,000 75% of the houses cost between $130,000 and $170,000 B. Empirical Rule for Bell Shaped Distributions Approximately 68% of data values fall within one standard deviation of the mean. Approximately 95% of the data values fall within two standard deviations of the mean. Approximately 99.75% of the data values fall within three standard deviations of the mean. VII. Measures of Position or Relative Standing Measures of position are the relative positions of one data value in comparison with the entire set of data values. Zscore Percentiles Quartiles Deciles Dr. Janet Winter, Stat 200 Page 14
15 A. Standard Scores (used to compare data values between two groups) To compare data values, subtract the mean from the data value and divide by the standard deviation. I. Zscore Forumlas For samples: For populations: II. Example: Zscore I A student scored 75 on a calculus test that had a mean of 50 and a standard deviation of 10; she scored 80 on a history test with a mean of 75 and a standard deviation of 6.1. Compare her relative positions on the two tests. The second zscore is larger. Thus, the 75 in calculus is a better grade as a standard score or compared to the classmates than the 80 on the history test. Note: zscores are always given to twoplace accuracy. III. Understanding Zscore a) Zscores have a mean of 0 and a standard deviation of 1. b) A zscore is the number of standard deviations a value is away from the mean for a specific distribution. c) d) Ordinary and Unusual zscores Ordinary values: 2 < z < 2 Unusual values: z < 2 or z> 2 e) Whenever a value is less than the mean, its corresponding zscore is negative. Dr. Janet Winter, Stat 200 Page 15
16 f) Example: Zscore II Using the information below, compare Joe s height of 78 inches to Susan s height of 73 inches. Men have heights with a mean of 69.0 inches and a standard deviation of 2.8 inches. Women have heights with a mean of 63.6 inches and a standard deviation of 2.5 inches. Joe: z = (78 69)/2.8 = 3.21 Susan: z = ( )/2.5 = 3.76 Susan is taller compared to other women than Joe compared to other men. B. Percentiles (the position of a data value within its group) A percentile, P, is an integer between 1 and 99 such that P% of the data values are less than or equal to the value and (100 P)% of the data values are greater than or equal to the value. I. Given a data value x, find the percentile P 1. Count the number of data values below x 2. Add.5 3. Divide the sum by the number of data values n 4. Multiply by 100% 5. Round to an integer using regular rounding rules II. Given the percentile P, find the data value x n: the total number of data values p: the percentile c: used to find the position of the data value 1. Order the data lowest to highest 2. To find the position of the data value x, let: c = (n p)/ To find the data value, use the position value c If c is not a whole number, round to the next larger whole number. Starting at the lowest data value, count to the number that corresponds to the rounded up value of c. If c is a whole number, use the value halfway between the c th and (c + 1) st values when counting up from the lowest value. Dr. Janet Winter, Stat 200 Page 16
17 III. Example: Percentiles I Find the value corresponding to the 13 th percentile. Unordered Data: 18, 15, 12, 6, 8, 2, 3, 5, 20, 10 Ordered data: 2, 3, 5, 6, 8, 10, 12, 15, 18, 20 n p c = = = Since c is not a whole number, round up to 2. Start at the lowest score and count to the second value, which is 3. 3 is the 13th percentile value. IV. Example: Percentiles II A teacher gives a 20point test to 10 students. The scores are shown below. Find the percentile rank of a score of 12. Unordered Data: 18, 15, 12, 6, 8, 2, 3, 5, 20, 10 Ordered Data: 2, 3, 5, 6, 8, 10, 12, 15, 18, Percentile = 100% = 65th percentile 10 C. Quartiles Divide the order list of data values into four groups. Q 1 is the same as the 25 th percentile Q 2 is the 50 th percentile or the median Q 3 is the 75 th percentile Question 7 If a botanist measures the length of flower petals and finds that 75% of the lengths are 1.5 cm or longer, then 1.5 is a) the f the first quartile of lengths of petals. b) the 25 th percentile of lengths of petals. c) both of the above. Dr. Janet Winter, Stat 200 Page 17
18 d) do not know. D. Deciles Deciles divide the distribution into 10 groups. They are denoted by D 1, D 2,, D 10. How do deciles related to percentiles? Question 8 The percentile that corresponds to the mean is a) the 50 th percentile. b) the 100 th percentile. c) no particular percentile corresponds to the mean. d) do not know. VIII. Exploratory Data Analysis A. Introduction I. Purpose: Examine data patterns when the mean is affected by outliers. Find gaps in the data. Find patterns. Compare data sets. Identify outliers (values located far away from other values) II. Exploratory Data Analysis is 1. Fivenumber summary 2. Box plot B. FiveNumber Summary A fivenumber summary is a list of: The lowest value of data set (L or minimum) Q 1 (25 th percentile) The median (MD or 50 th percentile) Q 3 (75 th percentile) The highest value of data set (H or maximum) A box plot is a graphical representation of a fivenumber summary on a scaled axes. Be sure the box is above the scaled line and drawn to scale (see example in the text and section x). Dr. Janet Winter, Stat 200 Page 18
19 Question 9 IX. Outliers A box plot can be drawn from data in a stem and leaf plot by a) counting the values in the stem and leaf plot to determine the five number summary. b) adding the values in the stem and leaf plot to determine the five number summary. c) graphing only the stems and not the leaves from the stem and leaf plot. d) do not know. An outlier is an extremely high or an extremely low data value when compared with the rest of the data values. Can be the result of measurement or observational error. Outliers can also indicate something else in the data. Can have a dramatic affect on the mean Can have a dramatic affect on the standard deviation Can have a dramatic affect on the scale of the histogram so that the shape of the distribution is obscured. A. Outliers for Normally Distributed Data Any data value more than three standard deviations away from the mean is considered an outlier. B. Outliers for Other Distributions 1. Arrange the data in order 2. Find Quartile 1 and Quartile 3 3. Find the interquartile range: IQR = Q 3 Q 1 4. Outliers are: Any data value larger than Q (IQR) Any data value smaller than Q (IQR) Dr. Janet Winter, Stat 200 Page 19
20 X. Box Plots (Box and Whisker Plots) Scaled graph of the five number summary Process: 1. Find the 5number summary (minimum, Q1, Q2, Q3, and maximum) 2. Construct a horizontal scale that includes the minimum and the maximum data. Start the scale at or below the lowest data values and end it slightly above the largest data value. 3. Construct a rectangle floating above the line with the left end at Quartile 1 and the right end at Quartile Construct a vertical line segment inside the box at the median. 5. Construct a horizontal line segment from the center of the lower vertical box edge to the lowest data value that is not an outlier. Construct a second horizontal line segment from the cent of the upper vertical box edge to the highest data value that is not an outlier. 6. Graph mild outliers with a solid dot. Graph extreme outliers with an open dot. XI. Summary Histograms, frequency polygons and ogives are used for quantitative data organized in a grouped frequency distribution. Pareto charts and bar graphs are frequency graphs for qualitative variables. Time series graphs are used to show a pattern or trend that occurs over time. Pie graphs are used to show the relationship between the parts and the whole for qualitative or categorical data. Data can be organized in meaningful ways using frequency distributions and graphs. In descriptive statistics, we use all of these numerical and graphical techniques with sampling methods to collect, organize, summarize, and present data. Data is organized for interpretation and inference Dr. Janet Winter, Stat 200 Page 20
21 Answer: Question 1 When a person says that the average age of a group of workers is 35, the average C could be either the mean or the median of the ages. Answer: Question 2 If we are taking a test and we wish to score in the upper half of the students, then we wish to be higher than the B the median of the test scores. Answer: Question 3 Which measures of the center are influenced by outliers? E both A & D. Answer: Question 4 If we tally the votes in an election, then the winner would be the candidate corresponding to C the mode of the number of votes. Answer: Question 5 An entertainment event advertises that people ages 1 to 100 would enjoy the event. The advertisement specifically describes a set of people with B a large range of ages. Answer: Question 6 If we know the variance of a set of data, then to calculate the standard deviation of this data C is a short process because the standard deviation is the square root of the variance. Answer: Question 7 If a botanist measures the length of flower petals and finds that 75% of the lengths are 1.5cm or longer, then 1.5 is C both A & C. Answer: Question 8 The percentile that corresponds to the mean is C no particular percentile corresponds to the mean. Answer: Question 9 A box plot can be drawn from data in a stem and leaf plot by A counting the values in the stem and leaf plot to determine the five number summary. Dr. Janet Winter, Stat 200 Page 21
Descriptive Statistics. Frequency Distributions and Their Graphs 2.1. Frequency Distributions. Chapter 2
Chapter Descriptive Statistics.1 Frequency Distributions and Their Graphs Frequency Distributions A frequency distribution is a table that shows classes or intervals of data with a count of the number
More informationChapter 2. Objectives. Tabulate Qualitative Data. Frequency Table. Descriptive Statistics: Organizing, Displaying and Summarizing Data.
Objectives Chapter Descriptive Statistics: Organizing, Displaying and Summarizing Data Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically
More informationA frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes
A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes together with the number of data values from the set that
More informationChapter 2: Exploring Data with Graphs and Numerical Summaries. Graphical Measures Graphs are used to describe the shape of a data set.
Page 1 of 16 Chapter 2: Exploring Data with Graphs and Numerical Summaries Graphical Measures Graphs are used to describe the shape of a data set. Section 1: Types of Variables In general, variable can
More informationNumerical Measures of Central Tendency
Numerical Measures of Central Tendency Often, it is useful to have special numbers which summarize characteristics of a data set These numbers are called descriptive statistics or summary statistics. A
More informationWe will use the following data sets to illustrate measures of center. DATA SET 1 The following are test scores from a class of 20 students:
MODE The mode of the sample is the value of the variable having the greatest frequency. Example: Obtain the mode for Data Set 1 77 For a grouped frequency distribution, the modal class is the class having
More informationData Mining Part 2. Data Understanding and Preparation 2.1 Data Understanding Spring 2010
Data Mining Part 2. and Preparation 2.1 Spring 2010 Instructor: Dr. Masoud Yaghini Introduction Outline Introduction Measuring the Central Tendency Measuring the Dispersion of Data Graphic Displays References
More informationDesciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods
Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods Qualitative data Data are classified in categories Non numerical (although may be numerically codified) Elements
More informationChapter 2: Frequency Distributions and Graphs
Chapter 2: Frequency Distributions and Graphs Learning Objectives Upon completion of Chapter 2, you will be able to: Organize the data into a table or chart (called a frequency distribution) Construct
More information13.2 Measures of Central Tendency
13.2 Measures of Central Tendency Measures of Central Tendency For a given set of numbers, it may be desirable to have a single number to serve as a kind of representative value around which all the numbers
More informationTable 21. Sucrose concentration (% fresh wt.) of 100 sugar beet roots. Beet No. % Sucrose. Beet No.
Chapter 2. DATA EXPLORATION AND SUMMARIZATION 2.1 Frequency Distributions Commonly, people refer to a population as the number of individuals in a city or county, for example, all the people in California.
More informationChapter 3 Descriptive Statistics: Numerical Measures. Learning objectives
Chapter 3 Descriptive Statistics: Numerical Measures Slide 1 Learning objectives 1. Single variable Part I (Basic) 1.1. How to calculate and use the measures of location 1.. How to calculate and use the
More information2. Describing Data. We consider 1. Graphical methods 2. Numerical methods 1 / 56
2. Describing Data We consider 1. Graphical methods 2. Numerical methods 1 / 56 General Use of Graphical and Numerical Methods Graphical methods can be used to visually and qualitatively present data and
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More informationExercise 1.12 (Pg. 2223)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More informationF. Farrokhyar, MPhil, PhD, PDoc
Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How
More informationDescribing Data. We find the position of the central observation using the formula: position number =
HOSP 1207 (Business Stats) Learning Centre Describing Data This worksheet focuses on describing data through measuring its central tendency and variability. These measurements will give us an idea of what
More informationExam # 1 STAT The number of people from the state of Alaska الاسكا) (ولاية who voted for a Republican
King Abdulaziz University Faculty of Sciences Statistics Department Name: ID No: Exam # 1 STAT 11 First Term 149143H Section: 6 You have 6 questions in 7 pages. You have 1 minutes to solve the exam. Please
More informationStatistics Chapter 3 Averages and Variations
Statistics Chapter 3 Averages and Variations Measures of Central Tendency Average a measure of the center value or central tendency of a distribution of values. Three types of average: Mode Median Mean
More informationCHAPTER 3 CENTRAL TENDENCY ANALYSES
CHAPTER 3 CENTRAL TENDENCY ANALYSES The next concept in the sequential statistical steps approach is calculating measures of central tendency. Measures of central tendency represent some of the most simple
More information103 Measures of Central Tendency and Variation
103 Measures of Central Tendency and Variation So far, we have discussed some graphical methods of data description. Now, we will investigate how statements of central tendency and variation can be used.
More informationMethods for Describing Data Sets
1 Methods for Describing Data Sets.1 Describing Data Graphically In this section, we will work on organizing data into a special table called a frequency table. First, we will classify the data into categories.
More informationx Measures of Central Tendency for Ungrouped Data Chapter 3 Numerical Descriptive Measures Example 31 Example 31: Solution
Chapter 3 umerical Descriptive Measures 3.1 Measures of Central Tendency for Ungrouped Data 3. Measures of Dispersion for Ungrouped Data 3.3 Mean, Variance, and Standard Deviation for Grouped Data 3.4
More informationChapter 3: Central Tendency
Chapter 3: Central Tendency Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the distribution and represents
More informationContent DESCRIPTIVE STATISTICS. Data & Statistic. Statistics. Example: DATA VS. STATISTIC VS. STATISTICS
Content DESCRIPTIVE STATISTICS Dr Najib Majdi bin Yaacob MD, MPH, DrPH (Epidemiology) USM Unit of Biostatistics & Research Methodology School of Medical Sciences Universiti Sains Malaysia. Introduction
More informationPROPERTIES OF MEAN, MEDIAN
PROPERTIES OF MEAN, MEDIAN In the last class quantitative and numerical variables bar charts, histograms(in recitation) Mean, Median Suppose the data set is {30, 40, 60, 80, 90, 120} X = 70, median = 70
More informationThe right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
More informationvs. relative cumulative frequency
Variable  what we are measuring Quantitative  numerical where mathematical operations make sense. These have UNITS Categorical  puts individuals into categories Numbers don't always mean Quantitative...
More informationChapter 2  Graphical Summaries of Data
Chapter 2  Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense
More informationLecture I. Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions.
Lecture 1 1 Lecture I Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions. It is a process consisting of 3 parts. Lecture
More informationVariables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
More informationCenter: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)
Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center
More informationIII. GRAPHICAL METHODS
Pie Charts and Bar Charts: III. GRAPHICAL METHODS Pie charts and bar charts are used for depicting frequencies or relative frequencies. We compare examples of each using the same data. Sources: AT&T (1961)
More informationTYPES OF DATA TYPES OF VARIABLES
TYPES OF DATA Univariate data Examines the distribution features of one variable. Bivariate data Explores the relationship between two variables. Univariate and bivariate analysis will be revised separately.
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More informationSection 3.1 Measures of Central Tendency: Mode, Median, and Mean
Section 3.1 Measures of Central Tendency: Mode, Median, and Mean One number can be used to describe the entire sample or population. Such a number is called an average. There are many ways to compute averages,
More informationUnivariate Descriptive Statistics
Univariate Descriptive Statistics Displays: pie charts, bar graphs, box plots, histograms, density estimates, dot plots, stemleaf plots, tables, lists. Example: sea urchin sizes Boxplot Histogram Urchin
More information! x sum of the entries
3.1 Measures of Central Tendency (Page 1 of 16) 3.1 Measures of Central Tendency Mean, Median and Mode! x sum of the entries a. mean, x = = n number of entries Example 1 Find the mean of 26, 18, 12, 31,
More informationGCSE HIGHER Statistics Key Facts
GCSE HIGHER Statistics Key Facts Collecting Data When writing questions for questionnaires, always ensure that: 1. the question is worded so that it will allow the recipient to give you the information
More information1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics)
1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics) As well as displaying data graphically we will often wish to summarise it numerically particularly if we wish to compare two or more data sets.
More informationSTATISTICS FOR PSYCH MATH REVIEW GUIDE
STATISTICS FOR PSYCH MATH REVIEW GUIDE ORDER OF OPERATIONS Although remembering the order of operations as BEDMAS may seem simple, it is definitely worth reviewing in a new context such as statistics formulae.
More informationIntroduction to Descriptive Statistics
Mathematics Learning Centre Introduction to Descriptive Statistics Jackie Nicholas c 1999 University of Sydney Acknowledgements Parts of this booklet were previously published in a booklet of the same
More informationFrequency Distributions
Displaying Data Frequency Distributions After collecting data, the first task for a researcher is to organize and summarize the data to get a general overview of the results. Remember, this is the goal
More informationSampling, frequency distribution, graphs, measures of central tendency, measures of dispersion
Statistics Basics Sampling, frequency distribution, graphs, measures of central tendency, measures of dispersion Part 1: Sampling, Frequency Distributions, and Graphs The method of collecting, organizing,
More informationBNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationChapter 2 Summarizing and Graphing Data
Chapter 2 Summarizing and Graphing Data 21 Review and Preview 22 Frequency Distributions 23 Histograms 24 Graphs that Enlighten and Graphs that Deceive Preview Characteristics of Data 1. Center: A
More informationDescribe what is meant by a placebo Contrast the doubleblind procedure with the singleblind procedure Review the structure for organizing a memo
Readings: Ha and Ha Textbook  Chapters 1 8 Appendix D & E (online) Plous  Chapters 10, 11, 12 and 14 Chapter 10: The Representativeness Heuristic Chapter 11: The Availability Heuristic Chapter 12: Probability
More informationCentral Tendency. n Measures of Central Tendency: n Mean. n Median. n Mode
Central Tendency Central Tendency n A single summary score that best describes the central location of an entire distribution of scores. n Measures of Central Tendency: n Mean n The sum of all scores divided
More informationSummarizing and Displaying Categorical Data
Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
More informationDescriptive Statistics
Chapter 2 Descriptive Statistics 2.1 Descriptive Statistics 1 2.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Display data graphically and interpret graphs:
More informationChapter 7 What to do when you have the data
Chapter 7 What to do when you have the data We saw in the previous chapters how to collect data. We will spend the rest of this course looking at how to analyse the data that we have collected. Stem and
More informationMathematics. Probability and Statistics Curriculum Guide. Revised 2010
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More informationGraphical and Tabular. Summarization of Data OPRE 6301
Graphical and Tabular Summarization of Data OPRE 6301 Introduction and Recap... Descriptive statistics involves arranging, summarizing, and presenting a set of data in such a way that useful information
More information3: Summary Statistics
3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes
More informationChapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
More informationNumerical Summarization of Data OPRE 6301
Numerical Summarization of Data OPRE 6301 Motivation... In the previous session, we used graphical techniques to describe data. For example: While this histogram provides useful insight, other interesting
More informationMath Chapter 2 review
Math 116  Chapter 2 review Name Provide an appropriate response. 1) Suppose that a data set has a minimum value of 28 and a max of 73 and that you want 5 classes. Explain how to find the class width for
More informationIntroduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data
A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel
More informationChapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures)
Chapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures) Example: Titanic passenger data is available for 1310 individuals for 14 variables, though not all variables
More informationLesson 3 Measures of Central Location and Dispersion
Lesson 3 Measures of Central Location and Dispersion As epidemiologists, we use a variety of methods to summarize data. In Lesson 2, you learned about frequency distributions, ratios, proportions, and
More informationMathematics Teachers Self Study Guide on the national Curriculum Statement. Book 2 of 2
Mathematics Teachers Self Study Guide on the national Curriculum Statement Book 2 of 2 1 WORKING WITH GROUPED DATA Material written by Meg Dickson and Jackie Scheiber RADMASTE Centre, University of the
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More information3.2 Measures of Spread
3.2 Measures of Spread In some data sets the observations are close together, while in others they are more spread out. In addition to measures of the center, it's often important to measure the spread
More informationMEI Statistics 1. Exploring data. Section 1: Introduction. Looking at data
MEI Statistics Exploring data Section : Introduction Notes and Examples These notes have subsections on: Looking at data Stemandleaf diagrams Types of data Measures of central tendency Comparison of
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x  x) B. x 3 x C. 3x  x D. x  3x 2) Write the following as an algebraic expression
More informationIntroduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
More informationStatistics Chapter 2
Statistics Chapter 2 Frequency Tables A frequency table organizes quantitative data. partitions data into classes (intervals). shows how many data values are in each class. Test Score Number of Students
More information1 Measures for location and dispersion of a sample
Statistical Geophysics WS 2008/09 7..2008 Christian Heumann und Helmut Küchenhoff Measures for location and dispersion of a sample Measures for location and dispersion of a sample In the following: Variable
More informationCh. 3.1 # 3, 4, 7, 30, 31, 32
Math Elementary Statistics: A Brief Version, 5/e Bluman Ch. 3. # 3, 4,, 30, 3, 3 Find (a) the mean, (b) the median, (c) the mode, and (d) the midrange. 3) High Temperatures The reported high temperatures
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More informationSession 1.6 Measures of Central Tendency
Session 1.6 Measures of Central Tendency Measures of location (Indices of central tendency) These indices locate the center of the frequency distribution curve. The mode, median, and mean are three indices
More informationData Analysis: Describing Data  Descriptive Statistics
WHAT IT IS Return to Table of ontents Descriptive statistics include the numbers, tables, charts, and graphs used to describe, organize, summarize, and present raw data. Descriptive statistics are most
More informationReport of for Chapter 2 pretest
Report of for Chapter 2 pretest Exam: Chapter 2 pretest Category: Organizing and Graphing Data 1. "For our study of driving habits, we recorded the speed of every fifth vehicle on Drury Lane. Nearly every
More informationMeasures of Center Section 32 Definitions Mean (Arithmetic Mean)
Measures of Center Section 31 Mean (Arithmetic Mean) AVERAGE the number obtained by adding the values and dividing the total by the number of values 1 Mean as a Balance Point 3 Mean as a Balance Point
More information( ) ( ) Central Tendency. Central Tendency
1 Central Tendency CENTRAL TENDENCY: A statistical measure that identifies a single score that is most typical or representative of the entire group Usually, a value that reflects the middle of the distribution
More informationCH.6 Random Sampling and Descriptive Statistics
CH.6 Random Sampling and Descriptive Statistics Population vs Sample Random sampling Numerical summaries : sample mean, sample variance, sample range StemandLeaf Diagrams Median, quartiles, percentiles,
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationStatistical Concepts and Market Return
Statistical Concepts and Market Return 2014 Level I Quantitative Methods IFT Notes for the CFA exam Contents 1. Introduction... 2 2. Some Fundamental Concepts... 2 3. Summarizing Data Using Frequency Distributions...
More informationNorthumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data  November 2012  This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
More information909 responses responded via telephone survey in U.S. Results were shown by political affiliations (show graph on the board)
1 21 Overview Chapter 2: Learn the methods of organizing, summarizing, and graphing sets of data, ultimately, to understand the data characteristics: Center, Variation, Distribution, Outliers, Time. (Computer
More informationEach exam covers lectures from since the previous exam and up to the exam date.
Sociology 301 Exam Review Liying Luo 03.22 Exam Review: Logistics Exams must be taken at the scheduled date and time unless 1. You provide verifiable documents of unforeseen illness or family emergency,
More informationMathematics. GSE Algebra II/ Advanced Algebra Unit 7: Inferences & Conclusions from Data
Georgia Standards of Excellence Curriculum Frameworks Mathematics GSE Algebra II/ Advanced Algebra Unit 7: Inferences & Conclusions from Data These materials are for nonprofit educational purposes only.
More informationFind the median temperature. A) 33 F B) 59 F C) 51 F D) 67 F Answer: B
Review for TEST 2 STA 2023 FALL 2013 Name Find the mean of the data summarized in the given frequency distribution. 1) A company had 80 employees whose salaries are summarized in the frequency distribution
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationChapter 15 Multiple Choice Questions (The answers are provided after the last question.)
Chapter 15 Multiple Choice Questions (The answers are provided after the last question.) 1. What is the median of the following set of scores? 18, 6, 12, 10, 14? a. 10 b. 14 c. 18 d. 12 2. Approximately
More informationDomain Essential Question Common Core Standards Resources
Middle School Math 20162017 Domain Essential Question Common Core Standards First Ratios and Proportional Relationships How can you use mathematics to describe change and model real world solutions? How
More informationMath Lesson 3: Displaying Data Graphically
Math Lesson 3: Displaying Data Graphically Hawaii DOE Content Standards: Math standard: [Data Analysis, Statistics, and Probability]Pose questions and collect, organize, and represent data to answer those
More informationCentral Tendency and Variation
Contents 5 Central Tendency and Variation 161 5.1 Introduction............................ 161 5.2 The Mode............................. 163 5.2.1 Mode for Ungrouped Data................ 163 5.2.2 Mode
More informationSTAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I)
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 5: Density Curves and Normal Distributions (I) 9/12/06 Lecture 5 1 A problem about Standard Deviation A variable
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More informationDescribing Data. Carolyn J. Anderson EdPsych 580 Fall Describing Data p. 1/42
Describing Data Carolyn J. Anderson EdPsych 580 Fall 2005 Describing Data p. 1/42 Describing Data Numerical Descriptions Single Variable Relationship Graphical displays Single variable. Relationships in
More informationHome Runs, Statistics, and Probability
NATIONAL MATH + SCIENCE INITIATIVE Mathematics American League AL Central AL West AL East National League NL West NL East Level 7 th grade in a unit on graphical displays Connection to AP* Graphical Display
More information2 Describing, Exploring, and
2 Describing, Exploring, and Comparing Data This chapter introduces the graphical plotting and summary statistics capabilities of the TI 83 Plus. First row keys like \ R (67$73/276 are used to obtain
More information3.1 Measures of central tendency: mode, median, mean, midrange Dana Lee Ling (2012)
3.1 Measures of central tendency: mode, median, mean, midrange Dana Lee Ling (2012) Mode The mode is the value that occurs most frequently in the data. Spreadsheet programs such as Microsoft Excel or OpenOffice.org
More informationCHINHOYI UNIVERSITY OF TECHNOLOGY
CHINHOYI UNIVERSITY OF TECHNOLOGY SCHOOL OF NATURAL SCIENCES AND MATHEMATICS DEPARTMENT OF MATHEMATICS MEASURES OF CENTRAL TENDENCY AND DISPERSION INTRODUCTION From the previous unit, the Graphical displays
More informationData. ECON 251 Research Methods. 1. Data and Descriptive Statistics (Review) CrossSectional and TimeSeries Data. Population vs.
ECO 51 Research Methods 1. Data and Descriptive Statistics (Review) Data A variable  a characteristic of population or sample that is of interest for us. Data  the actual values of variables Quantitative
More informationM 225 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT!
M 225 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 114 14 15 3 16 5 17 4 18 4 19 11 20 9 21 8 22 16 Total 75 1 Multiple choice questions (1 point each) 1. Look
More informationSlides by. JOHN LOUCKS St. Edward s University
s by JOHN LOUCKS St. Edward s University 1 Chapter 2, Part A Descriptive Statistics: Tabular and Graphical Presentations Summarizing Qualitative Data Summarizing Quantitative Data 2 Summarizing Qualitative
More information