PROPERTIES OF MEAN, MEDIAN


 Susanna Booth
 2 years ago
 Views:
Transcription
1 PROPERTIES OF MEAN, MEDIAN In the last class quantitative and numerical variables bar charts, histograms(in recitation) Mean, Median Suppose the data set is {30, 40, 60, 80, 90, 120} X = 70, median = 70 Suppose we add a large number 1000 to the list, then the new X = 202 median = 80 One extreme value changes the mean dramatically but the median does not change much 1 Median is Robust. A few outliers will have no or small effect on the median. Mean can change dramatically in the presence of a very large or very small value. 2 If x 1, x 2,..., x m have mean x and median m then ax 1 + b, ax 2 + b,..., ax n + b will have mean a x + b and median am + b. ( Mean respects change of scale) [.5cm] 3 Mean is mathematically more easier to work with than median. Mean or Median which is better? Depends on the context. If there are extreme values present in the data,then Median is better than Mean Mean is easier to compute. Easy to update if you have additional observations. Not so with median
2 MEASURES OF SPREAD data:x 1, x 2,...,..., x n (x 1 x), (x n x),..., (x n x) are deviations from the mean x n n n (x i x) = x i x 1 1 = n x n x = 0 The total of deviations above the mean is same as the total of deviations below the mean contrast with median: the number of observations above median same as number of observations below median 1 1. Range: Maximum  Minimum 2. IQR (Inter Quartile Range)= Q 3 Q 1 (Minimum,Q 1, Median, Q 3, Maximum: Five number Summary) 3. Standard Deviation PERCENTILES STANDARD DEVIATION pth percentile: The value below which (roughly) p% of the data points lie. When p=50 we get the median. The 25th percentile is called FIRST QUARTILE and denoted by Q 1 The 75th percentile is called THIRD QUARTILE and denoted by Q 3 Standard deviation is another, very useful, measure of the amount of deviation from the mean. These are useful in determining Outliers and can be represented in a Box plot.
3 STANDARD DEVIATION Computation STANDARD DEVIATION Computation Let x 1, x 2,..., x n be a data set with mean x, xvalues (x x) (x x) 2 x 1 (x 1 x) (x 1 x) 2 x 2 (x 2 x) (x 2 x) 2 x 3 (x 3 x) (x 3 x) x n (x n x) 0 (x n x) 2 n 1 (x i x 2 Population variance : σ 2 n i=1 = (x i x) 2 n Population standard deviation: σ = Population Variance Sample variance: s 2 n i=1 = (x i x) 2 n 1 Sample standard deviation: s = Sample Variance Why the square in (x x) 2? (xi x) = x i n x = 0. Some of the deviations from the mean are positive and some negative. When you sum they cancel out each other. So take square to make everything positive Why take the square root of the variance? This ensures that the measure of variability standard deviation is in the same unit as the data.
4 Example The following data are number of passengers on flights of Delta Air Lines between San Francisco and Seattle over 33 days in April and early May. 128,121,134,136,136,118,123,109,120,116,125,128,121,129,130,131, 127,119,114,134,110,136,134,125,128,123,128,133,132,136,134,129, 132 Find the range, variance and standard deviation of the data (assumed to be a sample) Maximum is 136, Minimum is 109 So the range is = 27 X = Properties of Variance 1. xvalues (x x) (x x) ( ) ( ) 2 = ( ) ( ) 2 = ( ) ( ) 2 = ( ) 2 = Sample variance = /32 = Sample s.d = = 7.6 Var (x 1, x 2,..., x n) 0 and is = 0 only when x 1 = x 2 =... = x n Var (x 1 + b, x 2 + b,..., x n + b) = Var(x 1, x 2,..., x n) Adding a constant does not change the variance Var (ax 1, ax 2,..., ax n) = a 2 Var(x 1, x 2,..., x n) Var(ax 1 + b, ax 2 + b,..., ax n + b) = a 2 Var(x 1, x 2,..., x n)
5 Properties of Standard Deviation HISTOGRAM Histogram of x1 Histogram of x2 S.D (x 1, x 2,..., x n) 0 and is = 0 only when x 1 = x 2 =... = x n S.D (x 1 + b, x 2 + b,..., x n + b) = S.D(x 1, x 2,..., x n) Density x1 Density x2 Adding a constant does not change the standard deviation S.D (ax 1, ax 2,..., ax n) = a S.D(x 1, x 2,..., x n) S.D (ax 1 + b, ax 2 + b,..., ax n + b) = a S.D(x 1, x 2,..., x n) Density 0.00 Histogram of x Density 0.0 Histogram of x x3 x5 Figure: Histograms What to look for in a Histogram PROPERTIES OF MEAN, MEDIAN Shape 1. Is it symmetric? skewed? 2. Does it have one mode? two modes? three modes? [ Peaks are called modes] Two modes suggests that the there are two subgroups present 3. Outliers? (will return to this later) 4. Gaps? 1. Describes how data are distributed 2. Measures of Shape Skew = Symmetry LeftSkewed Mean Median Symmetric Mean = Median RightSkewed Median Mean
6 Transformation to get to symmetry If the data is nearly symmetric, mean is a good measure of center For skewed data, median is a better measure Generally most statistical method use mean and symmetric bell shaped distn How do we justify this? NOT INCLUDED IN EXAM Sometimes you can transform the data and get a symmetric distn Transformation to get to symmetry Transformation to get to symmetry transformation.pdf Frequency Histogram of y y Frequency histogram of log y x
7 histogram2.pdf histogram of x by square.pdf histogram of x^2 Frequency Frequency y x What to look for in a Histogram Bimodal histogram 1. Is it symmetric? skewed? 2. Does it have one mode? two modes? three modes? [ Peaks are called modes] Two modes suggests that the there are two subgroups present 3. Outliers? (will return to this later) 4. Gaps? 0.6 * dnorm(x, 1, 1) * dnorm(x, 5, 1) Index
8 outliers outliers are data points that are far away from the main body of the histogram outliers need to be investigated further. Typically outliers can be explained. by education.png Figure: scatter plot age vs wage Figure: Histogram by education
9 my height is 5 7" It is a reasonable height in the general population In the population of basket ball players, this would be an outlier comparison.png Figure: outliers CHEBYSHEV s RULE Outliers are data points far away from the center of the data distribution far away depends on the center and on the amount of variation in the data mean and s.d determine which data points are outliers Atleast (1 1 k 2 ) part of the histogram lies within ks of the mean k=2: At least 75 % of the observations lie within 2 standard deviations of the mean k=3: at least 8/9, approx 90% of the observations lie within 3 standard deviations of the mean
10 If the histogram is bell shaped then prob 153, 144 Approximately 68 % of the observations lie within x s, x + s Approximately 95 % of the observations lie within x 2s, x + 2s Approximately 99.7 % of the observations lie within x 3s, x + 3s Problem If the range of a set of data is 20, find a rough approximation to the s.d of the data set 75% of the data falls within x 2s, x + 2s i.e within a range of x + 2s x 2s = 4s so 4s range = 20 so s 20 4 = 5 Numerical measures of relative standing Let µ be the mean of a data set when the data set is the population σ be the s.d of a data set when the data set is the population x be the mean of a data set when the data set is a sample σ be the sd of a data set when the data set is a sample For any value x, The Population zscore of x is The Sample zscore of x is z = x µ σ z = x X s The z score is a measure of how many s.d s is x away from the mean
11 x is k standard deviations away from the mean is same as x µ > kσ. i.e. If x µ > kσ then x µ σ > k i.e z > k. So x is larger than µ + kσ is equivalent to zvalue of x is larger than k. Similarly, x is smaller than µ kσ is equivalent to zvalue of x is smaller than k. So in terms of z  values At least 75 % of the observations have zvalues less than 2 at least 8/9, approx 90% of the observations have zvalues less than 3 Put differently, At most 10% of the observations have zvalues larger than 3 If the histogram is bell shaped then Since values that are far away from the mean have very large or very small (negative) z scores, we can use zscores to define outliers. Approximately 68 % of the observations have zvalues less than 1 Observations with zscores greater than 3 in absolute value are considered outliers. Approximately 95 % of the observations have zvalues less than 2 Approximately 99.7 % of the observations have zvalues less than 3
12 problems 139,140,161 Another way to define outliers is via interquartile range and box plots. Outliers 2 Minimum, Q 1, Median, Q 3, Maximum Plot1.pdf are called Five number Summary Q 2 Q 1 is called Inter Quartile Range [IQR] 1. Graphical display of data using 5number summary data below Q 1 1.5(IQR) and data aboveq (IQR) are called Outliers X smallest Q 1 Median Q 3 X largest These are displayed in a
13 Plot2.pdf Plot3.pdf 1. Draw a rectangle (box) with the ends (hinges) drawn at the lower and upper quartiles (Q L and Q U ). The median data is shown by a line or symbol (such as + ). 2. The points at distances 1.5(IQR) from each hinge define the inner fences of the data set. Line (whiskers) are drawn from each hinge to the most extreme measurements inside the inner fence. 3. The symbol (*) represents measurements falling beyond the inner fences. 4. Symbols that represent the median and extreme data points vary depending on software used. You may use your own symbols if you are constructing a box plot by hand. Plot4.pdf Shape & Plot5.pdf Detecting Outliers LeftSkewed Symmetric RightSkewed Q 1 Median Q 3 Q 1 Median Q 3 Q 1 Median Q 3 s: Observations falling between the inner and outer fences are deemed suspect outliers. Observations falling beyond the outer fence are deemed highly suspect outliers. zscores: Observations with zscores greater than 3 in absolute value are considered outliers. (For some highly skewed data sets, observations with zscores greater than 2 in absolute value may be outliers.)
Chapter 3 Descriptive Statistics: Numerical Measures. Learning objectives
Chapter 3 Descriptive Statistics: Numerical Measures Slide 1 Learning objectives 1. Single variable Part I (Basic) 1.1. How to calculate and use the measures of location 1.. How to calculate and use the
More informationChapter 3: Data Description Numerical Methods
Chapter 3: Data Description Numerical Methods Learning Objectives Upon successful completion of Chapter 3, you will be able to: Summarize data using measures of central tendency, such as the mean, median,
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More information103 Measures of Central Tendency and Variation
103 Measures of Central Tendency and Variation So far, we have discussed some graphical methods of data description. Now, we will investigate how statements of central tendency and variation can be used.
More informationHistogram. Graphs, and measures of central tendency and spread. Alternative: density (or relative frequency ) plot /13/2004
Graphs, and measures of central tendency and spread 9.07 9/13/004 Histogram If discrete or categorical, bars don t touch. If continuous, can touch, should if there are lots of bins. Sum of bin heights
More information4. Introduction to Statistics
Statistics for Engineers 41 4. Introduction to Statistics Descriptive Statistics Types of data A variate or random variable is a quantity or attribute whose value may vary from one unit of investigation
More informationThe right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median
CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box
More information1 Measures for location and dispersion of a sample
Statistical Geophysics WS 2008/09 7..2008 Christian Heumann und Helmut Küchenhoff Measures for location and dispersion of a sample Measures for location and dispersion of a sample In the following: Variable
More informationF. Farrokhyar, MPhil, PhD, PDoc
Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How
More information2.0 Lesson Plan. Answer Questions. Summary Statistics. Histograms. The Normal Distribution. Using the Standard Normal Table
2.0 Lesson Plan Answer Questions 1 Summary Statistics Histograms The Normal Distribution Using the Standard Normal Table 2. Summary Statistics Given a collection of data, one needs to find representations
More informationSTATISTICS FOR PSYCH MATH REVIEW GUIDE
STATISTICS FOR PSYCH MATH REVIEW GUIDE ORDER OF OPERATIONS Although remembering the order of operations as BEDMAS may seem simple, it is definitely worth reviewing in a new context such as statistics formulae.
More information1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics)
1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics) As well as displaying data graphically we will often wish to summarise it numerically particularly if we wish to compare two or more data sets.
More information13.2 Measures of Central Tendency
13.2 Measures of Central Tendency Measures of Central Tendency For a given set of numbers, it may be desirable to have a single number to serve as a kind of representative value around which all the numbers
More informationData Exploration Data Visualization
Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select
More informationDescribing Data. We find the position of the central observation using the formula: position number =
HOSP 1207 (Business Stats) Learning Centre Describing Data This worksheet focuses on describing data through measuring its central tendency and variability. These measurements will give us an idea of what
More informationSTAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I)
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 5: Density Curves and Normal Distributions (I) 9/12/06 Lecture 5 1 A problem about Standard Deviation A variable
More informationMEASURES OF VARIATION
NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are
More information32 Measures of Central Tendency and Dispersion
32 Measures of Central Tendency and Dispersion In this section we discuss two important aspects of data which are its center and its spread. The mean, median, and the mode are measures of central tendency
More informationDESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.
DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,
More information3: Summary Statistics
3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes
More informationNumerical Summarization of Data OPRE 6301
Numerical Summarization of Data OPRE 6301 Motivation... In the previous session, we used graphical techniques to describe data. For example: While this histogram provides useful insight, other interesting
More informationData Analysis: Describing Data  Descriptive Statistics
WHAT IT IS Return to Table of ontents Descriptive statistics include the numbers, tables, charts, and graphs used to describe, organize, summarize, and present raw data. Descriptive statistics are most
More informationDescriptive Statistics
Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web
More informationLecture 1: Review and Exploratory Data Analysis (EDA)
Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course
More informationChapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
More informationCenter: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)
Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center
More informationIntroduction to Statistics for Psychology. Quantitative Methods for Human Sciences
Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html
More information1. 2. 3. 4. Find the mean and median. 5. 1, 2, 87 6. 3, 2, 1, 10. Bellwork 32315 Simplify each expression.
Bellwork 32315 Simplify each expression. 1. 2. 3. 4. Find the mean and median. 5. 1, 2, 87 6. 3, 2, 1, 10 1 Objectives Find measures of central tendency and measures of variation for statistical data.
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. C) (a) 2 (b) 1
Unit 2 Review Name Use the given frequency distribution to find the (a) class width. (b) class midpoints of the first class. (c) class boundaries of the first class. 1) Miles (per day) 12 9 34 22 56
More informationAP Statistics Solutions to Packet 2
AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 68 2.1 DENSITY CURVES (a) Sketch a density curve that
More information1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers
1.3 Measuring Center & Spread, The Five Number Summary & Boxplots Describing Quantitative Data with Numbers 1.3 I can n Calculate and interpret measures of center (mean, median) in context. n Calculate
More informationModule 4: Data Exploration
Module 4: Data Exploration Now that you have your data downloaded from the Streams Project database, the detective work can begin! Before computing any advanced statistics, we will first use descriptive
More informationSection 3.1 Measures of Central Tendency: Mode, Median, and Mean
Section 3.1 Measures of Central Tendency: Mode, Median, and Mean One number can be used to describe the entire sample or population. Such a number is called an average. There are many ways to compute averages,
More informationShape of Data Distributions
Lesson 13 Main Idea Describe a data distribution by its center, spread, and overall shape. Relate the choice of center and spread to the shape of the distribution. New Vocabulary distribution symmetric
More informationExercise 1.12 (Pg. 2223)
Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.
More informationMeans, standard deviations and. and standard errors
CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard
More informationCC Investigation 5: Histograms and Box Plots
Content Standards 6.SP.4, 6.SP.5.c CC Investigation 5: Histograms and Box Plots At a Glance PACING 3 days Mathematical Goals DOMAIN: Statistics and Probability Display numerical data in histograms and
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More information2 Descriptive statistics with R
Biological data analysis, Tartu 2006/2007 1 2 Descriptive statistics with R Before starting with basic concepts of data analysis, one should be aware of different types of data and ways to organize data
More informationExploratory Data Analysis
Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction
More informationExploratory Data Analysis. Psychology 3256
Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find
More informationDescriptive Statistics. Understanding Data: Categorical Variables. Descriptive Statistics. Dataset: Shellfish Contamination
Descriptive Statistics Understanding Data: Dataset: Shellfish Contamination Location Year Species Species2 Method Metals Cadmium (mg kg  ) Chromium (mg kg  ) Copper (mg kg  ) Lead (mg kg  ) Mercury
More informationLesson 4 Measures of Central Tendency
Outline Measures of a distribution s shape modality and skewness the normal distribution Measures of central tendency mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central
More informationSummary of Formulas and Concepts. Descriptive Statistics (Ch. 14)
Summary of Formulas and Concepts Descriptive Statistics (Ch. 14) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume
More informationTreatment and analysis of data Applied statistics Lecture 3: Sampling and descriptive statistics
Treatment and analysis of data Applied statistics Lecture 3: Sampling and descriptive statistics Topics covered: Parameters and statistics Sample mean and sample standard deviation Order statistics and
More informationSection 2.4 Numerical Measures of Central Tendency
Section 2.4 Numerical Measures of Central Tendency 2.4.1 Definitions Mean: The Mean of a quantitative dataset is the sum of the observations in the dataset divided by the number of observations in the
More informationDescriptive Statistics
Chapter 2 Descriptive Statistics 2.1 Descriptive Statistics 1 2.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Display data graphically and interpret graphs:
More information2. Filling Data Gaps, Data validation & Descriptive Statistics
2. Filling Data Gaps, Data validation & Descriptive Statistics Dr. Prasad Modak Background Data collected from field may suffer from these problems Data may contain gaps ( = no readings during this period)
More informationAP * Statistics Review. Descriptive Statistics
AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production
More informationBNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I
BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential
More informationWeek 1. Exploratory Data Analysis
Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam
More informationSession 1.6 Measures of Central Tendency
Session 1.6 Measures of Central Tendency Measures of location (Indices of central tendency) These indices locate the center of the frequency distribution curve. The mode, median, and mean are three indices
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More informationCA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction
CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous
More informationDr. Peter Tröger Hasso Plattner Institute, University of Potsdam. Software Profiling Seminar, Statistics 101
Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam Software Profiling Seminar, 2013 Statistics 101 Descriptive Statistics Population Object Object Object Sample numerical description Object
More informationMCQ S OF MEASURES OF CENTRAL TENDENCY
MCQ S OF MEASURES OF CENTRAL TENDENCY MCQ No 3.1 Any measure indicating the centre of a set of data, arranged in an increasing or decreasing order of magnitude, is called a measure of: (a) Skewness (b)
More informationThe Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)
Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,
More information1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number
1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x  x) B. x 3 x C. 3x  x D. x  3x 2) Write the following as an algebraic expression
More informationMathematics. Probability and Statistics Curriculum Guide. Revised 2010
Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction
More informationMind on Statistics. Chapter 2
Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and crosstabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table
More informationDescriptive Statistics
Descriptive Statistics Suppose following data have been collected (heights of 99 fiveyearold boys) 117.9 11.2 112.9 115.9 18. 14.6 17.1 117.9 111.8 16.3 111. 1.4 112.1 19.2 11. 15.4 99.4 11.1 13.3 16.9
More informationLecture 2: Descriptive Statistics and Exploratory Data Analysis
Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals
More information3.1 Measures of central tendency: mode, median, mean, midrange Dana Lee Ling (2012)
3.1 Measures of central tendency: mode, median, mean, midrange Dana Lee Ling (2012) Mode The mode is the value that occurs most frequently in the data. Spreadsheet programs such as Microsoft Excel or OpenOffice.org
More informationDescriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion
Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research
More informationThe Big 50 Revision Guidelines for S1
The Big 50 Revision Guidelines for S1 If you can understand all of these you ll do very well 1. Know what is meant by a statistical model and the Modelling cycle of continuous refinement 2. Understand
More informationTopic 9 ~ Measures of Spread
AP Statistics Topic 9 ~ Measures of Spread Activity 9 : Baseball Lineups The table to the right contains data on the ages of the two teams involved in game of the 200 National League Division Series. Is
More informationCollege of the Canyons Math 140 Exam 1 Amy Morrow. Name:
Name: Answer the following questions NEATLY. Show all necessary work directly on the exam. Scratch paper will be discarded unread. One point each part unless otherwise marked. 1. Owners of an exercise
More informationCHINHOYI UNIVERSITY OF TECHNOLOGY
CHINHOYI UNIVERSITY OF TECHNOLOGY SCHOOL OF NATURAL SCIENCES AND MATHEMATICS DEPARTMENT OF MATHEMATICS MEASURES OF CENTRAL TENDENCY AND DISPERSION INTRODUCTION From the previous unit, the Graphical displays
More information6.4 Normal Distribution
Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under
More informationVariables. Exploratory Data Analysis
Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is
More information2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.
Math 1530017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible
More informationIntroduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data
A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel
More informationHomework 8 Solutions
Homework 8 Solutions Chapter 5D Review Questions. 6. What is an exponential scale? When is an exponential scale useful? An exponential scale is one in which each unit corresponds to a power of. In general,
More information! x sum of the entries
3.1 Measures of Central Tendency (Page 1 of 16) 3.1 Measures of Central Tendency Mean, Median and Mode! x sum of the entries a. mean, x = = n number of entries Example 1 Find the mean of 26, 18, 12, 31,
More informationMean = (sum of the values / the number of the value) if probabilities are equal
Population Mean Mean = (sum of the values / the number of the value) if probabilities are equal Compute the population mean Population/Sample mean: 1. Collect the data 2. sum all the values in the population/sample.
More informationEXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!
STP 231 EXAM #1 (Example) Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.
More informationCentral Tendency. n Measures of Central Tendency: n Mean. n Median. n Mode
Central Tendency Central Tendency n A single summary score that best describes the central location of an entire distribution of scores. n Measures of Central Tendency: n Mean n The sum of all scores divided
More informationMathematical goals. Starting points. Materials required. Time needed
Level S6 of challenge: B/C S6 Interpreting frequency graphs, cumulative cumulative frequency frequency graphs, graphs, box and box whisker and plots whisker plots Mathematical goals Starting points Materials
More informationGeostatistics Exploratory Analysis
Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa Master of Science in Geospatial Technologies Geostatistics Exploratory Analysis Carlos Alberto Felgueiras cfelgueiras@isegi.unl.pt
More informationMEAN 34 + 31 + 37 + 44 + 38 + 34 + 42 + 34 + 43 + 41 = 378 MEDIAN
MEASURES OF CENTRAL TENDENCY MEASURES OF CENTRAL TENDENCY The measures of central tendency are numbers that locate the center of a set of data. The three most common measures of center are mean, median
More informationHISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS
Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS
More informationTHE BINOMIAL DISTRIBUTION & PROBABILITY
REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution
More informationThis is Descriptive Statistics, chapter 2 from the book Beginning Statistics (index.html) (v. 1.0).
This is Descriptive Statistics, chapter from the book Beginning Statistics (index.html) (v..). This book is licensed under a Creative Commons byncsa. (http://creativecommons.org/licenses/byncsa/./)
More informationGraphical and Tabular. Summarization of Data OPRE 6301
Graphical and Tabular Summarization of Data OPRE 6301 Introduction and Recap... Descriptive statistics involves arranging, summarizing, and presenting a set of data in such a way that useful information
More informationStatistics 1040 Dr. Tom McGahagan DATA AND DESCRIPTIVE STATISTICS
Statistics 1040 Dr. Tom McGahagan DATA AND DESCRIPTIVE STATISTICS Data  information on a variable or group of variables, which may be either numeric (examples: income in dollars, weight in pounds) or
More informationComplement: 0.4 x 0.8 = =.6
Homework Chapter 5 Name: 1. Use the graph below 1 a) Why is the total area under this curve equal to 1? Rectangle; A = LW A = 1(1) = 1 b) What percent of the observations lie above 0.8? 1 .8 =.2; A =
More informationMULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.
Exam Name 1) A recent report stated ʺBased on a sample of 90 truck drivers, there is evidence to indicate that, on average, independent truck drivers earn more than company hired truck drivers.ʺ Does
More informationDiagrams and Graphs of Statistical Data
Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in
More informationStatistical Analysis Using Gnumeric
Statistical Analysis Using Gnumeric There are many software packages that will analyse data. For casual analysis, a spreadsheet may be an appropriate tool. Popular spreadsheets include Microsoft Excel,
More information3.2 Measures of Spread
3.2 Measures of Spread In some data sets the observations are close together, while in others they are more spread out. In addition to measures of the center, it's often important to measure the spread
More informationData handling and descriptive statistics in Proficiency Testing Microbiology
Data handling and descriptive statistics in Proficiency Testing Microbiology In relation to the standards ISO/IEC 1743 and ISO 1328 by PhD Microbiology division, Science department 1 Descriptive statistics
More information8. THE NORMAL DISTRIBUTION
8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,
More informationChapter 2  Graphical Summaries of Data
Chapter 2  Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense
More informationconsider the number of math classes taken by math 150 students. how can we represent the results in one number?
ch 3: numerically summarizing data  center, spread, shape 3.1 measure of central tendency or, give me one number that represents all the data consider the number of math classes taken by math 150 students.
More informationBASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi110 012 seema@iasri.res.in Genomics A genome is an organism s
More informationNorthumberland Knowledge
Northumberland Knowledge Know Guide How to Analyse Data  November 2012  This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about
More informationFirst Midterm Exam (MATH1070 Spring 2012)
First Midterm Exam (MATH1070 Spring 2012) Instructions: This is a one hour exam. You can use a notecard. Calculators are allowed, but other electronics are prohibited. 1. [40pts] Multiple Choice Problems
More informationMeasures of Central Tendency and Variability: Summarizing your Data for Others
Measures of Central Tendency and Variability: Summarizing your Data for Others 1 I. Measures of Central Tendency: Allow us to summarize an entire data set with a single value (the midpoint). 1. Mode :
More informationWhat are Data? The Research Question (Randomised Controlled Trials (RCTs)) The Research Question (Non RCTs)
What are Data? Quantitative Data o Sets of measurements of objective descriptions of physical and behavioural events; susceptible to statistical analysis Qualitative data o Descriptive, views, actions
More informationProbability and Statistics Vocabulary List (Definitions for Middle School Teachers)
Probability and Statistics Vocabulary List (Definitions for Middle School Teachers) B Bar graph a diagram representing the frequency distribution for nominal or discrete data. It consists of a sequence
More information