13.2 Measures of Central Tendency

Size: px
Start display at page:

Download "13.2 Measures of Central Tendency"

Transcription

1 13.2 Measures of Central Tendency Measures of Central Tendency For a given set of numbers, it may be desirable to have a single number to serve as a kind of representative value around which all the numbers in the set tend to cluster, a kind of middle number or a measure of central tendency. Three such measures are discussed in this section. Mean The mean (more properly called the arithmetic mean) of a set of data items is found by adding up all the items and then dividing the sum by the number of items. (The mean is what most people associate with the word average. ) The mean of a sample is denoted (read x bar ), while the mean of a complete population is denoted (the lower case Greek letter mu). The mean of n data items x 1, x 2,, x n, is given by the formula We use the symbol for summation, (the Greek letter sigma): x x n x x1 x2... xn Ex 1) Ten students in a math class were polled as to the number of siblings in their individual families and the results were: 3, 2, 2, 1, 3, 6, 3, 3, 4, 2. Find the mean number of siblings for the ten students. Ex 2) Researchers at the University of Maryland collected body temperature readings from a sample of adults, and eight of those temperatures are listed below (in degrees Fahrenheit). Does the mean of this sample equal 98.6, which is commonly believed to be the mean body temperature of adults? Weighted Mean x f w. f The weighted mean of n numbers x 1, x 2,, x n, that are weighted by the respective factors f 1, f 2,, f n is given by the formula above. In a common system for finding a grade-point average, an A is assigned 4 points, with 3 points for a B, 2 for C, and 1 for D. Find the grade-point average by multiplying the number of units for a course and the number assigned to each grade, and then adding these products. Finally, divide this sum by the total number of units. Ex 3) Find the grade-point average (weighted mean) for the grades below. Course Grade Points Units (credits) Math 4 (A) 5 History 3 (B) 3 Health 4 (A) 2 Art 2 (C) 2 Cosner - Math Notes - 1

2 Median Another measure of central tendency, which is not so sensitive to extreme values, is the median. This measure divides a group of numbers into two parts, with half the numbers below the median and half above it. To find the median of a group of items: 1) Rank the items. (Put in order.) 2) If the number of items is, the median is the middle item in the list. 3) If the number of items is, the median is the mean of the two middle numbers. Ex 4) Ten students in a math class were polled as to the number of siblings in their individual families and the results were: 3, 2, 2, 1, 1, 6, 3, 3, 4, 2. Find the median number of siblings for the ten students. Position of the Median in a Frequency Distribution Position of median = Notice that this formula gives the position, and not the actual value. Ex 5) Find the median for the distribution. Value Frequency n 1 f Mode The mode of a data set is the value that occurs most frequently. When two values occur with the same greatest frequency, each one is a mode and the data set is bimodal. When more than two values occur with the same greatest frequency, each ins a mode and the data set is multimodal. When no value is repeated, we say that there is no mode. Ex 6) Ten students in a math class were polled as to the number of siblings in their individual families and the results were: 3, 2, 2, 1, 3, 6, 3, 3, 4, 2. Find the mode for the number of siblings. Cosner - Math Notes - 2

3 Ex 7) Find the mode for the distribution. Value Frequency Central Tendency from Stem-and-Leaf Displays We can calculate measures of central tendency from a stem-and-leaf display. The median and mode are easily identified when the leaves are ranked (in numerical order) on their stems. Ex 8) Given the stem and leaf plot below, calculate the three measures of central tendency. Symmetry in Data Sets The most useful way to analyze a data set often depends on whether the distribution is symmetric or nonsymmetric. In a symmetric distribution, as we move out from a central point, the pattern of frequencies is the same (or nearly so) to the left and right. In a non-symmetric distribution, the patterns to the left and right are different. A non-symmetric distribution with a tail extending out to the left, shaped like a J, is called skewed to the left. If the tail extends out to the right, the distribution is skewed to the right. Cosner - Math Notes - 3

4 13.3 Measures of Dispersion 13.3 Book HW: 4 (by hand), 12 The mean is a good indicator of the central tendency of a set of data values, but it does not give the whole story about the data. Ex 1) Below are scores on quizzes in 2 classes. Which class did better? Class 1 Class x 6.3 x 6. 3 mode = 6,8 mode = 6 median = 6 median = We cannot tell by just the measures of central tendency. So we must look at the measure of dispersion, or spread, of the data. Two of the most common measures of dispersion are the range and the standard deviation. The range of a set of data is the difference between the maximum value and minimum value. The range is easy to compute; however the range does not take into account all values. So, outliers can affect the range. The standard deviation of a set of sample values is a measure of variation of values about the mean. x x Sample standard deviation: s n 1 The standard deviation is a measure of variation of all values from the mean. Outliers can dramatically increase the standard deviation. The units of the standard deviation, s, are the same as the units of the original data values. The value of the standard deviation is usually positive. Question: When is the standard deviation not positive? 2 Ex 2) Find the mean and standard deviation of the given data (in minutes) by hand and then with calculator: a),,,, x (min) x x x x 2 Totals Cosner - Math Notes - 4

5 b) 46,,,, 54 x (min) x x x x Totals c) 5,,,, 95 x (min) x x x x Totals Ex 3) Find the sample standard deviation for the frequency distribution. Value Frequency Cosner - Math Notes - 5

6 The sample variance of a set of sample values is a measure of variation equal to the square of the standard deviation. Ex 4) In the preceding example we found that for 46,,,, 54 (minutes), the standard deviation was about 2.8 minutes. Find the variance of that same example. Central tendency and dispersion are different independent aspects of a set of data. Which one is more critical can depend on the situation. For example, suppose muffins sell by the basket. If you want the most muffins possible per dollar spent, you would look for the basket with the highest average weight per muffin. Or, if the muffins are to be served on a tray where presentation is important, you would look for a basket with uniform-sized muffins that is a basket with the lowest weight dispersion. So the more desirable the basket depends on your objective. Ex 5) Two companies, A and B, sell 12 ounce jars of instant coffee. Five jars of each were randomly selected from markets, and the contents were carefully weighed, shown below: A B a) Find which company provides more coffee in their jars. b) Find which company fills its jars more consistently. It is clear that a larger dispersion value means more spread than a smaller one. But it is difficult to say exactly what a single dispersion value says about a data set. Cosner - Math Notes - 6

7 13.4 Measures of Position 13.4 Book HW: 5, 6, 7, 28, 34 In some cases we are interested in certain individual items in the data set, rather than in the set as a whole. We need a way of measuring how an item fits into the collection, how it compares to other items in the collection, or even how it compares to another item in another collection. There are several common ways of creating such measures and they are usually called measures of position. If x is a data item in a sample with mean x and standard deviation s, then the z-score of x is given by z = A z-value (or standardized value), is the number of standard deviations that a given value x is above or below the mean. (Round z to two decimal places.) It is found using the following: Ex 1) Two students, who take different history classes, had exams on the same day. Jen s score was 83 while Joy s score was 78. Which student did relatively better, given the class data shown below? Jen Joy Class mean Class standard deviation 4 5 Ex 2) Stanford Binet IQ scores have a mean of 100 and a standard deviation of 16. Albert Einstein reportedly had an IQ of 160. a) What is the difference between Einstein s IQ and the mean? b) Convert Einstein s IQ score to a z-score. c) How many standard deviations is that (the difference in part a)? d) If we consider usual IQ scores to be those that covert to z scores between -2 and 2, is Einstein s IQ usual or unusual? Percentiles A percentile is a measure of location which divides the data set into 100 groups with 1% of the values in each group. We denote percentiles as P1, P2, P3,..., P 99. If we talk about P 35, then we mean the 35 th percentile and this means that about 35% of the data values will lie below this value. If we talk about P, then we mean the th percentile and this means that % of the data values lie below this value the median. Additionally, just as when finding the median, when we find percentiles we must order the data set first. Cosner - Math Notes - 7

8 There are two ways that we want to talk about percentiles: 1. We may want to know what percentile corresponds to a known data value; or 2. We may want to know what data value corresponds to a particular percentile. To find the percentile corresponding to a known data value we compute number of values less than x percentile of value x = 100 total number of data values (Round to the nearest whole number) Ex 3) In a particular class of thirty math students, they took an exam over Chapter 3. The exam scores are as follows: a) What percentile corresponds to an exam score of 73? b) What percentile corresponds to an exam score of 45? To find the data value corresponding to a stated percentile we must define a few notations: n = total number of values in the data set k = percentile being used L = location of the desired data value in the ORDERED data set th P = k percentile k First, we calculate the value of the location or position, L, as follows: k L n 100 Now, there are two options for L: 1. It is NOT a whole number and then we round UP to the next whole number and locate the data value that occupies that position; OR 2. It is a whole number and then we find the average of the Lth and (L + 1)st numbers in the ordered data set. Cosner - Math Notes - 8

9 Ex 4) In a particular class of thirty math students, they took an exam over Chapter 3. The exam scores are as follows: Find the 23 rd percentile for the Chapter 3 exam scores. Find the 70 th percentile for the Chapter 3 exam scores. Quartiles Recall that there are 99 percentiles that divide the data set up into 100 groups. There are 3 quartiles that divide the data set up into 4 groups -- Q1, Q2, and Q 3. Quartiles are measures of location, just as percentiles, but each group contains about 25% of the data. How can we connect quartiles back to percentiles? Q P 1 25 Q P median Q 2 P 3 75 So, when we find quartiles, really we aren t finding anything new just thinking back to the corresponding percentile. Ex 5) In a particular class of thirty math students, they took an exam over Chapter 3. The exam scores are as follows: Find the quartiles for the data set. Cosner - Math Notes - 9

10 The interquartile range (IQR) is the difference between the upper ( Q 3 ) and lower ( Q 1 ) quartiles. Approximately half the data values fall within the interquartile range. IQR Q3 Q1 The semi-interquartile range is the IQR divided by 2. The midquartile is the sum of the upper and lower quartiles divide by 2. Semi-interquartile = midquartile = Q3 Q1 2 Q3 Q1 2 An outlier is a value that that falls far from what would be considered normal data values. We will define outliers in terms of the interquartile range (IQR). A data value will qualify as an outlier if either of the following conditions are met: o The data value is larger than Q3 1.5 IQR ; or o The data value is smaller than Q1 1.5 IQR. We use a diagram called a boxplot visually display the extreme values (minimum and maximum), the quartiles (lower, median, upper) and the IQR over a number line. We draw a box that shows the IQR with a line through the box at the median. Then we draw in whiskers that extend from the box out to the extreme values. The 5 number summary is just an ordered listing of the important values that are used in the box plot in parentheses: Ex 6) Find the 5 number summary and construct a box plot for the Math exam scores. (minimum, Q 1, Q 2, Q 3, maximum) Deciles are the nine values (denoted D 1, D 2,, D 9 ) along the scale that divide a data set into ten (approximately) equal parts, and quartiles are the three values (Q 1, Q 2, Q 3 ) that divide the data set into four (approximately) equal parts. Ex 7) The following are test scores (out of 100) for a particular math class. Find the sixth decile Cosner - Math Notes - 10

11 The Box Plot A box plot, or box-and-whisker plot, involves the median (a measure of central tendency), the range (a measure of dispersion), and the first and third quartiles (measures of position), all incorporated into a simple visual display. For a given set of data, a box plot, or box-and-whisker plot, consists of a rectangular box positioned above a numerical scale, extending from Q 1 to Q 3, with the value of Q 2 (the median) indicated within the box, and with whiskers (line segments) extending to the left and right from the box out to the minimum and maximum data items. Ex 8) Construct a box plot for the weekly study times data shown below Ex 9) Using the data set below to create a boxplot for the data set Minimum: Lower Quartile: Median: Upper Quartile: Maximum: IQR: Cosner - Math Notes - 11

12 13.5 The Normal Distribution 13.5 Book HW: 46,48, 52, 54, 60b A random variable that can take on only certain fixed values is called a discrete random variable. A variable whose values are not restricted in this way is a continuous random variable. A normal curve is a symmetric, bell-shaped curve. Any random variable whose graph has this characteristic shape is said to have a normal distribution. On a normal curve, if the quantity shown on the horizontal axis is the number of standard deviations from the mean, rather than values of the random variable itself, then we call the curve the standard normal curve. Properties of Normal Curves The graph of a normal curve is bell-shaped and symmetric about a vertical line through its center. The mean, median, and mode of a normal curve are all equal and occur at the center of the distribution. Empirical (or ) Rule for Data with a Bell-Shaped Distribution This rule states that for data sets having a distribution that is approximately bell-shaped, the following properties apply. About % of all values fall within standard deviation of the mean. About % of all values fall within standard deviation of the mean. About % of all values fall within standard deviation of the mean. Ex 1) Suppose that 280 sociology students take an exam and that the distribution of their scores can be treated as normal. Find the number of scores falling within 2 standard deviations of the mean. Ex 2) IQ scores have a bell-shaped distribution with a mean of 100 and variance of 225. What percentage of IQ scores are between 70 and 130? Cosner - Math Notes - 12

13 A Table of Standard Normal Curve Areas To answer questions that involve regions other than 1, 2, or 3 standard deviations of the mean we can refer to one of our tools, the calculator. The general syntax of the normalcdf command is: normalcdf( start, end, mean, std_dev) 2 nd VARS (DISTR), 2: normalcdf( The TI-83/84 will return the percentage (as a decimal number from 0 to 1) of data points x in a normal distribution with the given mean and standard deviation which lie between x = start and x = end. The standard normal distribution is a normal probability distribution with 1) 0 2) 1 3) Total area under its density curve is equal to 1. Ex 3) Use the calculator to find the percent of all scores that lie between the mean and the following values. a) 1.5 standard deviation above the mean b) 2.62 standard deviations below the mean Ex 4) Find the total area indicated in the region in color below. 0 z z Cosner - Math Notes - 13

14 Ex 5) Find the following using the table by first drawing a sketch: a) P(z < 1.58) b) P(z < -1.45) c) P(-2.3 < z < -1.45) Interpreting Normal Curve Areas In a standard normal curve, the following three quantities are equivalent. 1) Percentage (of total items that lie in an interval) 2) Probability (of a randomly chosen item lying in an interval) 3) Area (under the normal curve along an interval) Ex 6) The volumes of soda in bottles from a small company are distributed normally with mean 12 ounces and standard deviation.15 ounce. If 1 bottle is randomly selected, what is the probability that it will have more than ounces? Cosner - Math Notes - 14

15 Ex 7) The volume of liquid in a bottle of eye drop medicine is normally distributed with a mean of 30.5 ml and standard deviation of 2.5 ml. Calculate the following probabilities of randomly selecting a bottle: a) with less than 32 ml of eye drop medicine. b) with the amount of eye drop medicine between 26.5 and 34 ml. Ex 8) Assuming a normal distribution, find the z-value meeting the condition that 39% of the area is to the right of z. Ex 9) The shrinkage in length of a certain brand of blue jeans is normally distributed with a mean of 1.35 inches and standard deviation of 0.25 inch. What percent of this brand of jeans will shrink between 1 and 2 inches? Cosner - Math Notes - 15

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

The Normal Distribution

The Normal Distribution Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution

More information

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

consider the number of math classes taken by math 150 students. how can we represent the results in one number? ch 3: numerically summarizing data - center, spread, shape 3.1 measure of central tendency or, give me one number that represents all the data consider the number of math classes taken by math 150 students.

More information

3: Summary Statistics

3: Summary Statistics 3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck! STP 231 EXAM #1 (Example) Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

8. THE NORMAL DISTRIBUTION

8. THE NORMAL DISTRIBUTION 8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

Ch. 3.1 # 3, 4, 7, 30, 31, 32

Ch. 3.1 # 3, 4, 7, 30, 31, 32 Math Elementary Statistics: A Brief Version, 5/e Bluman Ch. 3. # 3, 4,, 30, 3, 3 Find (a) the mean, (b) the median, (c) the mode, and (d) the midrange. 3) High Temperatures The reported high temperatures

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

More information

Variables. Exploratory Data Analysis

Variables. Exploratory Data Analysis Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve

More information

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.) Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center

More information

3.2 Measures of Spread

3.2 Measures of Spread 3.2 Measures of Spread In some data sets the observations are close together, while in others they are more spread out. In addition to measures of the center, it's often important to measure the spread

More information

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random

More information

THE BINOMIAL DISTRIBUTION & PROBABILITY

THE BINOMIAL DISTRIBUTION & PROBABILITY REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Means, standard deviations and. and standard errors

Means, standard deviations and. and standard errors CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

More information

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

Probability Distributions

Probability Distributions Learning Objectives Probability Distributions Section 1: How Can We Summarize Possible Outcomes and Their Probabilities? 1. Random variable 2. Probability distributions for discrete random variables 3.

More information

Section 1.3 Exercises (Solutions)

Section 1.3 Exercises (Solutions) Section 1.3 Exercises (s) 1.109, 1.110, 1.111, 1.114*, 1.115, 1.119*, 1.122, 1.125, 1.127*, 1.128*, 1.131*, 1.133*, 1.135*, 1.137*, 1.139*, 1.145*, 1.146-148. 1.109 Sketch some normal curves. (a) Sketch

More information

Interpreting Data in Normal Distributions

Interpreting Data in Normal Distributions Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,

More information

3.4 The Normal Distribution

3.4 The Normal Distribution 3.4 The Normal Distribution All of the probability distributions we have found so far have been for finite random variables. (We could use rectangles in a histogram.) A probability distribution for a continuous

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

DesCartes (Combined) Subject: Mathematics Goal: Statistics and Probability

DesCartes (Combined) Subject: Mathematics Goal: Statistics and Probability DesCartes (Combined) Subject: Mathematics Goal: Statistics and Probability RIT Score Range: Below 171 Below 171 Data Analysis and Statistics Solves simple problems based on data from tables* Compares

More information

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. MATH 3/GRACEY PRACTICE EXAM/CHAPTERS 2-3 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The frequency distribution

More information

Shape of Data Distributions

Shape of Data Distributions Lesson 13 Main Idea Describe a data distribution by its center, spread, and overall shape. Relate the choice of center and spread to the shape of the distribution. New Vocabulary distribution symmetric

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Final Exam Review MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. 1) A researcher for an airline interviews all of the passengers on five randomly

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

Topic 9 ~ Measures of Spread

Topic 9 ~ Measures of Spread AP Statistics Topic 9 ~ Measures of Spread Activity 9 : Baseball Lineups The table to the right contains data on the ages of the two teams involved in game of the 200 National League Division Series. Is

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Data Exploration Data Visualization

Data Exploration Data Visualization Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select

More information

Key Concept. Density Curve

Key Concept. Density Curve MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 6 Normal Probability Distributions 6 1 Review and Preview 6 2 The Standard Normal Distribution 6 3 Applications of Normal

More information

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous Chapter 2 Overview Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Classify as categorical or qualitative data. 1) A survey of autos parked in

More information

Box-and-Whisker Plots

Box-and-Whisker Plots Learning Standards HSS-ID.A. HSS-ID.A.3 3 9 23 62 3 COMMON CORE.2 Numbers of First Cousins 0 3 9 3 45 24 8 0 3 3 6 8 32 8 0 5 4 Box-and-Whisker Plots Essential Question How can you use a box-and-whisker

More information

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review. Descriptive Statistics AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1. Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. STATISTICS/GRACEY PRACTICE TEST/EXAM 2 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Identify the given random variable as being discrete or continuous.

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles. Math 1530-017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem Tony Pourmohamad Department of Mathematics De Anza College Spring 2015 Objectives By the end of this set of slides, you

More information

AP STATISTICS REVIEW (YMS Chapters 1-8)

AP STATISTICS REVIEW (YMS Chapters 1-8) AP STATISTICS REVIEW (YMS Chapters 1-8) Exploring Data (Chapter 1) Categorical Data nominal scale, names e.g. male/female or eye color or breeds of dogs Quantitative Data rational scale (can +,,, with

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

2. Filling Data Gaps, Data validation & Descriptive Statistics

2. Filling Data Gaps, Data validation & Descriptive Statistics 2. Filling Data Gaps, Data validation & Descriptive Statistics Dr. Prasad Modak Background Data collected from field may suffer from these problems Data may contain gaps ( = no readings during this period)

More information

Census@School Data Handling and Probability (Grades 10, 11 and 12)

Census@School Data Handling and Probability (Grades 10, 11 and 12) Census@School Data Handling and Probability (Grades 10, 11 and 12) Statistics South Africa, 2013 Pali Lehohla, Statistician-General Census@School Data Handling and Probability (Grades 10, 11 and 12) /

More information

Exploratory Data Analysis. Psychology 3256

Exploratory Data Analysis. Psychology 3256 Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find

More information

Introduction; Descriptive & Univariate Statistics

Introduction; Descriptive & Univariate Statistics Introduction; Descriptive & Univariate Statistics I. KEY COCEPTS A. Population. Definitions:. The entire set of members in a group. EXAMPLES: All U.S. citizens; all otre Dame Students. 2. All values of

More information

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence

More information

Unit 7: Normal Curves

Unit 7: Normal Curves Unit 7: Normal Curves Summary of Video Histograms of completely unrelated data often exhibit similar shapes. To focus on the overall shape of a distribution and to avoid being distracted by the irregularities

More information

DesCartes (Combined) Subject: Mathematics Goal: Data Analysis, Statistics, and Probability

DesCartes (Combined) Subject: Mathematics Goal: Data Analysis, Statistics, and Probability DesCartes (Combined) Subject: Mathematics Goal: Data Analysis, Statistics, and Probability RIT Score Range: Below 171 Below 171 171-180 Data Analysis and Statistics Data Analysis and Statistics Solves

More information

Bar Graphs and Dot Plots

Bar Graphs and Dot Plots CONDENSED L E S S O N 1.1 Bar Graphs and Dot Plots In this lesson you will interpret and create a variety of graphs find some summary values for a data set draw conclusions about a data set based on graphs

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

First Midterm Exam (MATH1070 Spring 2012)

First Midterm Exam (MATH1070 Spring 2012) First Midterm Exam (MATH1070 Spring 2012) Instructions: This is a one hour exam. You can use a notecard. Calculators are allowed, but other electronics are prohibited. 1. [40pts] Multiple Choice Problems

More information

7. Normal Distributions

7. Normal Distributions 7. Normal Distributions A. Introduction B. History C. Areas of Normal Distributions D. Standard Normal E. Exercises Most of the statistical analyses presented in this book are based on the bell-shaped

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple. Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

More information

Mind on Statistics. Chapter 2

Mind on Statistics. Chapter 2 Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and cross-tabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table

More information

Mean, Median, and Mode

Mean, Median, and Mode DELTA MATH SCIENCE PARTNERSHIP INITIATIVE M 3 Summer Institutes (Math, Middle School, MS Common Core) Mean, Median, and Mode Hook Problem: To compare two shipments, five packages from each shipment were

More information

You flip a fair coin four times, what is the probability that you obtain three heads.

You flip a fair coin four times, what is the probability that you obtain three heads. Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Chapter 1 Review 1. As part of survey of college students a researcher is interested in the variable class standing. She records a 1 if the student is a freshman, a 2 if the student

More information

Sampling and Descriptive Statistics

Sampling and Descriptive Statistics Sampling and Descriptive Statistics Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University Reference: 1. W. Navidi. Statistics for Engineering and Scientists.

More information

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers 1.3 Measuring Center & Spread, The Five Number Summary & Boxplots Describing Quantitative Data with Numbers 1.3 I can n Calculate and interpret measures of center (mean, median) in context. n Calculate

More information

Interpret Box-and-Whisker Plots. Make a box-and-whisker plot

Interpret Box-and-Whisker Plots. Make a box-and-whisker plot 13.8 Interpret Box-and-Whisker Plots Before You made stem-and-leaf plots and histograms. Now You will make and interpret box-and-whisker plots. Why? So you can compare sets of scientific data, as in Ex.

More information

Bellwork Students will review their study guide for their test. Box-and-Whisker Plots will be discussed after the test.

Bellwork Students will review their study guide for their test. Box-and-Whisker Plots will be discussed after the test. Course: 7 th Grade Math Student Objective (Obj. 5c) TSW graph and interpret data in a box-and-whisker plot. DETAIL LESSON PLAN Friday, March 23 / Monday, March 26 Lesson 1-10 Box-and-Whisker Plot (Textbook

More information

Chapter 3. The Normal Distribution

Chapter 3. The Normal Distribution Chapter 3. The Normal Distribution Topics covered in this chapter: Z-scores Normal Probabilities Normal Percentiles Z-scores Example 3.6: The standard normal table The Problem: What proportion of observations

More information

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name: Glo bal Leadership M BA BUSINESS STATISTICS FINAL EXAM Name: INSTRUCTIONS 1. Do not open this exam until instructed to do so. 2. Be sure to fill in your name before starting the exam. 3. You have two hours

More information

Common Tools for Displaying and Communicating Data for Process Improvement

Common Tools for Displaying and Communicating Data for Process Improvement Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Suppose following data have been collected (heights of 99 five-year-old boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9

More information

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

More information

Algebra I Vocabulary Cards

Algebra I Vocabulary Cards Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression

More information

Normal Probability Distribution

Normal Probability Distribution Normal Probability Distribution The Normal Distribution functions: #1: normalpdf pdf = Probability Density Function This function returns the probability of a single value of the random variable x. Use

More information

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1. Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:

More information

Stats on the TI 83 and TI 84 Calculator

Stats on the TI 83 and TI 84 Calculator Stats on the TI 83 and TI 84 Calculator Entering the sample values STAT button Left bracket { Right bracket } Store (STO) List L1 Comma Enter Example: Sample data are {5, 10, 15, 20} 1. Press 2 ND and

More information

TEACHER NOTES MATH NSPIRED

TEACHER NOTES MATH NSPIRED Math Objectives Students will understand that normal distributions can be used to approximate binomial distributions whenever both np and n(1 p) are sufficiently large. Students will understand that when

More information

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

Students summarize a data set using box plots, the median, and the interquartile range. Students use box plots to compare two data distributions.

Students summarize a data set using box plots, the median, and the interquartile range. Students use box plots to compare two data distributions. Student Outcomes Students summarize a data set using box plots, the median, and the interquartile range. Students use box plots to compare two data distributions. Lesson Notes The activities in this lesson

More information

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo)

COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY EXERCISE 8/5/2013. MEASURE OF CENTRAL TENDENCY: MODE (Mo) MEASURE OF CENTRAL TENDENCY: MODE (Mo) COMPARISON MEASURES OF CENTRAL TENDENCY & VARIABILITY Prepared by: Jess Roel Q. Pesole CENTRAL TENDENCY -what is average or typical in a distribution Commonly Measures: 1. Mode. Median 3. Mean quantified

More information

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management

More information

Scope and Sequence KA KB 1A 1B 2A 2B 3A 3B 4A 4B 5A 5B 6A 6B

Scope and Sequence KA KB 1A 1B 2A 2B 3A 3B 4A 4B 5A 5B 6A 6B Scope and Sequence Earlybird Kindergarten, Standards Edition Primary Mathematics, Standards Edition Copyright 2008 [SingaporeMath.com Inc.] The check mark indicates where the topic is first introduced

More information

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

A Correlation of. to the. South Carolina Data Analysis and Probability Standards A Correlation of to the South Carolina Data Analysis and Probability Standards INTRODUCTION This document demonstrates how Stats in Your World 2012 meets the indicators of the South Carolina Academic Standards

More information

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: What do the data look like? Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses

More information