F. Farrokhyar, MPhil, PhD, PDoc

Size: px
Start display at page:

Download "F. Farrokhyar, MPhil, PhD, PDoc"

Transcription

1 Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How to display data with numbers and tables To learn about measures of central tendency To learn about the measures of variation Descriptive and Inferential statistics? Descriptive statistics help us with the presentation, organization, and summarization of data. Inferential statistics allow us to make inferences from a sample of individuals to a larger population. What is data? Data is a set of information or observation about a group of individuals or subjects. This information is organized in the form of variables. A variable is any characteristic of a person or a subject that can be measured or categorized. Its value varies from individual to individual. Type of variables Qualitative or attribute variable Nonnumeric gender (male, female), type of injury (blunt, fall, burn, etc) Quantitative variable Numeric Discrete variable can assume only whole numbers no. of accidents, no. of injuries, no. of positive nodes Continuous variable may take any value, within a defined range: weight, age, blood pressure, level of cholesterol Level of measurement There are four levels of measurement: Nominal Ordinal. Interval. Ratio 1

2 Level of measurement cont d Nominal variable: consists of named categories with no order among the categories. - binomial ---- gender, mortality - multinomial ---- type of injury, blood type Level of measurement cont d Interval variable: has equal distances between values with no meaningful zero value. -IQ test - Temperature (0 o C does not represent absence of temperature Ordinal variable: consists of ordered categories, where the differences between categories cannot be considered to be equal. - Tumour stage 1,, 3, 4 - Likert scale excellent, very good, good, fair, poor Ratio variable: has equal intervals between values and a meaningful zero point. The ratio between them makes sense. - height, weight, laboratory test values Level of measurement Variable type: Nominal Ordinal. Interval. ratio Assumptions: Named categories Same as nominal plus ordered categories Same as ordinal plus equal intervals Same as interval plus meaningful zero Type of variables Dependent variable Is the outcome of interest, which changes in response to some intervention or exposure. - mortality, survival, post-op pain, quality of life Independent variable Is the explanatory variable that explains the changes in the dependent variable - demographics (age, gender, height), risk factors (diabetes, BP) Is the intervention or exposure variable that causes the changes in the dependent variable. - drug, surgery, radiation, smoking Independent (Explanatory) variables: Age, Sex, Pre-op pain Severity Independent (Comparison) variable Dependent/outcome variables: Changes in pain, Complication Describing Categorical data Graphs Bar charts Pie charts

3 Bar charts Bar Charts Used to display nominal or ordinal data. It is a series of separated bars. Bars represent frequency (counts) or relative frequency (percent or proportion) of each category. Used to display data for more than one group. Bar Charts Pie charts Used for nominal and ordinal data. Used to display relative frequency distribution. The circle is divided proportionally using relative frequency of each category. A pie chart is useful for showing data for one group but it is useless for illustration of two or more groups. Pie Charts Describing Categorical data Numerically Frequencies (counts) Relative frequencies (%) 3

4 Cross-tabulation of categorical data Type of surgery Open Laparoscopic Total Severity mild 4 (7%) 3 (0%) 7 (3%) moderate severe 6 (40%) 5 (33%) 7 (47%) 5 (33%) 13 (43%) 10 (33%) Describing quantitative data Graphs Histograms The five-number summary Boxplot Sex male female 7 (47%) 8 (53%) 4 (7%) 11 (73%) 11 (37%) 19 (63%) Histogram Histograms Used for interval and ratio data. A histogram is a graph in which each bar (horizontal axis) represent a range of numbers called interval width. The vertical axis represents the frequency of each interval. There are no spaces between bars. The frequencies are represented by the bar height and area of each bar Histogram is useful for graphic illustration of one group. Box plot: 5 number summary 100 th Maximum Q3 Median (Q) Q1 Box Plots Used for interval and ratio data. Uses the five-number summary measures Median, Q1, Q3, minimum and maximum. It is useful in detecting outliers It is useful to illustrate the distribution of more than on group. 1 st Minimum 4

5 Box plot of change in pain score Scatter plot Used to display the relationship between two continuous variables. Describing quantitative data Numbers Measures of central tendency mode, median, mean Measures of spread range, interquartile range, variance, standard deviation Mode Measures of central tendency Mode is the most frequent value the highest peak Used for nominal, ordinal, interval and ratio data. Could be more than one mode. Example: pain score 1, 4, 6, 8, 5, 6, 3,, 15 1,, 3, 4, 5, 6, 6, 8, 15 Median Measures of central tendency Median is the midpoint of the values after arranging the observations in order of size, from smallest to largest. There is a unique median for each dataset Used for interval and ratio data. It may not be necessarily equal to one of the sample values. Properties: It is resistant (insensitive) toward extreme values. It is useful for summarising skewed data. Mean Measures of central tendency Mean is the sum of sample values divided by the number of sample values --- n. It is useful for interval and ratio data. n x i i 1 X = = n = = Example - 1,, 3, 4, 5, 6, 6, 8, 15 5

6 Properties of mean Measures of central tendency There is a unique mean for each dataset. All values are included in the computation. It is the only measure of central tendency where the sum of deviations of each value from the mean will always be zero. n ( X i - X ) i= 1 Normal curve Skewed curve The mean is sensitive toward extreme values. X Mean Median Mode Mean Median Mode Measures of Spread Range Interquartile range Variance Standard deviation Range Used mainly for interval or ratio data Range is the differences between the largest and smallest values in a dataset. Properties It uses only two values in its calculation. It is effected by extreme values. It is easy to understand. 1,, 3, 4, 5, 6, 6, 8, range = 14 Interquartile range Used mainly for interval and ratio data It is the distance between the third quartile (Q 3 ) and the first quartile (Q 1 ). Interquartile range = Q 3 Q 1 Interquartile range It is resistant (insensitive) to extreme values. It is useful for summarising skewed interval and ratio data. Arrange the observations from smallest to largest. Divide into 4 equal parts. Example, 1,, 3, 4, 5, 6, 6, 8, 15 1 st quartile (Q 1 ) = (+3)/ =.5 Median (Q ) = 5 3 rd quartile (Q 3 ) = (6+8) / = 7 Interquartile range = 7.5 = 4.5 6

7 Interquartile range Used to locate the outliers. What are outliers? Outliers are extreme data values that fall outside of distribution of the data set. 1.5 IQR Criterion for Outliers Interquartile range (IQR) is the distance between the first and third quartiles. IQR = Q 3 Q 1 From data Q 1 = 59 yrs, Q 3 = 70 yrs, IQR = = IQR = = 16.5 Q 1 IQR = = 4.5 Q 3 + IQR = = 86.5 From data: Min= 44 and Max = Box plot: 5 number summary th Outliers: 8 < 4.5 > 86.5 Q3 Median (Q) Q1 1 st 44 Variance Used for interval or ratio data Is the average of the squared deviations from the mean population variance σ n - ( x i x ) = i = 1 N sample variance n ( x i - x ) = n - 1 i = 1 Degrees of freedom measure the amount of information available in the data that can be to estimate σ. Here, the df is n-1 rather than n because we lose 1 df by estimating the sample mean. s Variance Properties All values are used in the calculation The units are not the same as data, they are the square of the original units Standard deviation is square root of variance sd = n (xi - x) n - 1 i=1 = 4.1 Example: 1,, 3, 4, 5, 6, 6, 8, 15 It is the average deviation from the mean in the same unit as the data. (1 5.5) + ( 5.5) + (3 5.5) (15 5.5) S = 9 1 = 17. 7

8 Uses of standard deviation Standard normal curve It is used for Empirical Rule. For any symmetrical distribution: About 68% of the observations will lie within 1 s.d. of the mean. About 95% of the observations will lie within s.d. of the mean. About 99.8% of the observations will lie within 3 s.d. of the mean. Summary of what we have learned. We report Mean with standard deviation Median with first and third quartiles Median with minimum and maximum Data type Graph Numerically Ratio and interval Histogram Box plot Scatter plot Mean with standard deviation Median with IQR, range Mode Ordinal data Bar chart Count and % Pie chart Median IQR, range mode Nominal Bar chart Pie chart Count and % mode 8

Content DESCRIPTIVE STATISTICS. Data & Statistic. Statistics. Example: DATA VS. STATISTIC VS. STATISTICS

Content DESCRIPTIVE STATISTICS. Data & Statistic. Statistics. Example: DATA VS. STATISTIC VS. STATISTICS Content DESCRIPTIVE STATISTICS Dr Najib Majdi bin Yaacob MD, MPH, DrPH (Epidemiology) USM Unit of Biostatistics & Research Methodology School of Medical Sciences Universiti Sains Malaysia. Introduction

More information

Chapter 2: Exploring Data with Graphs and Numerical Summaries. Graphical Measures- Graphs are used to describe the shape of a data set.

Chapter 2: Exploring Data with Graphs and Numerical Summaries. Graphical Measures- Graphs are used to describe the shape of a data set. Page 1 of 16 Chapter 2: Exploring Data with Graphs and Numerical Summaries Graphical Measures- Graphs are used to describe the shape of a data set. Section 1: Types of Variables In general, variable can

More information

Chapter 3: Central Tendency

Chapter 3: Central Tendency Chapter 3: Central Tendency Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the distribution and represents

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

Data Mining Part 2. Data Understanding and Preparation 2.1 Data Understanding Spring 2010

Data Mining Part 2. Data Understanding and Preparation 2.1 Data Understanding Spring 2010 Data Mining Part 2. and Preparation 2.1 Spring 2010 Instructor: Dr. Masoud Yaghini Introduction Outline Introduction Measuring the Central Tendency Measuring the Dispersion of Data Graphic Displays References

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods

Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods Desciptive Statistics Qualitative data Quantitative data Graphical methods Numerical methods Qualitative data Data are classified in categories Non numerical (although may be numerically codified) Elements

More information

Chapter 3: Data Description Numerical Methods

Chapter 3: Data Description Numerical Methods Chapter 3: Data Description Numerical Methods Learning Objectives Upon successful completion of Chapter 3, you will be able to: Summarize data using measures of central tendency, such as the mean, median,

More information

We will use the following data sets to illustrate measures of center. DATA SET 1 The following are test scores from a class of 20 students:

We will use the following data sets to illustrate measures of center. DATA SET 1 The following are test scores from a class of 20 students: MODE The mode of the sample is the value of the variable having the greatest frequency. Example: Obtain the mode for Data Set 1 77 For a grouped frequency distribution, the modal class is the class having

More information

Univariate Descriptive Statistics

Univariate Descriptive Statistics Univariate Descriptive Statistics Displays: pie charts, bar graphs, box plots, histograms, density estimates, dot plots, stemleaf plots, tables, lists. Example: sea urchin sizes Boxplot Histogram Urchin

More information

Descriptive Statistics. Frequency Distributions and Their Graphs 2.1. Frequency Distributions. Chapter 2

Descriptive Statistics. Frequency Distributions and Their Graphs 2.1. Frequency Distributions. Chapter 2 Chapter Descriptive Statistics.1 Frequency Distributions and Their Graphs Frequency Distributions A frequency distribution is a table that shows classes or intervals of data with a count of the number

More information

A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes

A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes A frequency distribution is a table used to describe a data set. A frequency table lists intervals or ranges of data values called data classes together with the number of data values from the set that

More information

How to interpret scientific & statistical graphs

How to interpret scientific & statistical graphs How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics theresa.scott@vanderbilt.edu http://biostat.mc.vanderbilt.edu/theresascott 1 A brief introduction Graphics:

More information

Numerical Summaries. Chapter 2. Mean or Average. Median (M) Basic Practice of Statistics - 3rd Edition

Numerical Summaries. Chapter 2. Mean or Average. Median (M) Basic Practice of Statistics - 3rd Edition Numerical Summaries Chapter 2 Describing Distributions with Numbers Center of the data mean median Variation range quartiles (interquartile range) variance standard deviation BPS - 5th Ed. Chapter 2 1

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Variables. Exploratory Data Analysis

Variables. Exploratory Data Analysis Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is

More information

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

More information

Descriptive Statistics. Understanding Data: Categorical Variables. Descriptive Statistics. Dataset: Shellfish Contamination

Descriptive Statistics. Understanding Data: Categorical Variables. Descriptive Statistics. Dataset: Shellfish Contamination Descriptive Statistics Understanding Data: Dataset: Shellfish Contamination Location Year Species Species2 Method Metals Cadmium (mg kg - ) Chromium (mg kg - ) Copper (mg kg - ) Lead (mg kg - ) Mercury

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

Chapter 1: Looking at Data Distributions. Dr. Nahid Sultana

Chapter 1: Looking at Data Distributions. Dr. Nahid Sultana Chapter 1: Looking at Data Distributions Dr. Nahid Sultana Chapter 1: Looking at Data Distributions 1.1 Displaying Distributions with Graphs 1.2 Describing Distributions with Numbers 1.3 Density Curves

More information

Chapter 2. Objectives. Tabulate Qualitative Data. Frequency Table. Descriptive Statistics: Organizing, Displaying and Summarizing Data.

Chapter 2. Objectives. Tabulate Qualitative Data. Frequency Table. Descriptive Statistics: Organizing, Displaying and Summarizing Data. Objectives Chapter Descriptive Statistics: Organizing, Displaying and Summarizing Data Student should be able to Organize data Tabulate data into frequency/relative frequency tables Display data graphically

More information

Frequency distributions, central tendency & variability. Displaying data

Frequency distributions, central tendency & variability. Displaying data Frequency distributions, central tendency & variability Displaying data Software SPSS Excel/Numbers/Google sheets Social Science Statistics website (socscistatistics.com) Creating and SPSS file Open the

More information

Statistics Chapter 3 Averages and Variations

Statistics Chapter 3 Averages and Variations Statistics Chapter 3 Averages and Variations Measures of Central Tendency Average a measure of the center value or central tendency of a distribution of values. Three types of average: Mode Median Mean

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

vs. relative cumulative frequency

vs. relative cumulative frequency Variable - what we are measuring Quantitative - numerical where mathematical operations make sense. These have UNITS Categorical - puts individuals into categories Numbers don't always mean Quantitative...

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

2. Describing Data. We consider 1. Graphical methods 2. Numerical methods 1 / 56

2. Describing Data. We consider 1. Graphical methods 2. Numerical methods 1 / 56 2. Describing Data We consider 1. Graphical methods 2. Numerical methods 1 / 56 General Use of Graphical and Numerical Methods Graphical methods can be used to visually and qualitatively present data and

More information

2 Descriptive statistics with R

2 Descriptive statistics with R Biological data analysis, Tartu 2006/2007 1 2 Descriptive statistics with R Before starting with basic concepts of data analysis, one should be aware of different types of data and ways to organize data

More information

10-3 Measures of Central Tendency and Variation

10-3 Measures of Central Tendency and Variation 10-3 Measures of Central Tendency and Variation So far, we have discussed some graphical methods of data description. Now, we will investigate how statements of central tendency and variation can be used.

More information

Numerical Measures of Central Tendency

Numerical Measures of Central Tendency Numerical Measures of Central Tendency Often, it is useful to have special numbers which summarize characteristics of a data set These numbers are called descriptive statistics or summary statistics. A

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

IQR Rule for Outliers

IQR Rule for Outliers 1. Arrange data in order. IQR Rule for Outliers 2. Calculate first quartile (Q1), third quartile (Q3) and the interquartile range (IQR=Q3-Q1). CO2 emissions example: Q1=0.9, Q3=6.05, IQR=5.15. 3. Compute

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

Statistical Analysis I

Statistical Analysis I CTSI BERD Research Methods Seminar Series Statistical Analysis I Lan Kong, PhD Associate Professor Department of Public Health Sciences December 22, 2014 Biostatistics, Epidemiology, Research Design(BERD)

More information

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers 1.3 Measuring Center & Spread, The Five Number Summary & Boxplots Describing Quantitative Data with Numbers 1.3 I can n Calculate and interpret measures of center (mean, median) in context. n Calculate

More information

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.) Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center

More information

Foundation of Quantitative Data Analysis

Foundation of Quantitative Data Analysis Foundation of Quantitative Data Analysis Part 1: Data manipulation and descriptive statistics with SPSS/Excel HSRS #10 - October 17, 2013 Reference : A. Aczel, Complete Business Statistics. Chapters 1

More information

What are Data? The Research Question (Randomised Controlled Trials (RCTs)) The Research Question (Non RCTs)

What are Data? The Research Question (Randomised Controlled Trials (RCTs)) The Research Question (Non RCTs) What are Data? Quantitative Data o Sets of measurements of objective descriptions of physical and behavioural events; susceptible to statistical analysis Qualitative data o Descriptive, views, actions

More information

Central Tendency. n Measures of Central Tendency: n Mean. n Median. n Mode

Central Tendency. n Measures of Central Tendency: n Mean. n Median. n Mode Central Tendency Central Tendency n A single summary score that best describes the central location of an entire distribution of scores. n Measures of Central Tendency: n Mean n The sum of all scores divided

More information

Basics and Beyond: Displaying Your Data. Mario Davidson, PhD Vanderbilt University School of Medicine Department of Biostatistics Instructor

Basics and Beyond: Displaying Your Data. Mario Davidson, PhD Vanderbilt University School of Medicine Department of Biostatistics Instructor Basics and Beyond: Displaying Your Data Mario Davidson, PhD Vanderbilt University School of Medicine Department of Biostatistics Instructor Objectives 1.Understand the types of data and levels of measurement

More information

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

More information

Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam. Software Profiling Seminar, Statistics 101

Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam. Software Profiling Seminar, Statistics 101 Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam Software Profiling Seminar, 2013 Statistics 101 Descriptive Statistics Population Object Object Object Sample numerical description Object

More information

4. Introduction to Statistics

4. Introduction to Statistics Statistics for Engineers 4-1 4. Introduction to Statistics Descriptive Statistics Types of data A variate or random variable is a quantity or attribute whose value may vary from one unit of investigation

More information

GCSE HIGHER Statistics Key Facts

GCSE HIGHER Statistics Key Facts GCSE HIGHER Statistics Key Facts Collecting Data When writing questions for questionnaires, always ensure that: 1. the question is worded so that it will allow the recipient to give you the information

More information

Statistics revision. Dr. Inna Namestnikova. Statistics revision p. 1/8

Statistics revision. Dr. Inna Namestnikova. Statistics revision p. 1/8 Statistics revision Dr. Inna Namestnikova inna.namestnikova@brunel.ac.uk Statistics revision p. 1/8 Introduction Statistics is the science of collecting, analyzing and drawing conclusions from data. Statistics

More information

What is Statistics? Statistics is about Collecting data Organizing data Analyzing data Presenting data

What is Statistics? Statistics is about Collecting data Organizing data Analyzing data Presenting data Introduction What is Statistics? Statistics is about Collecting data Organizing data Analyzing data Presenting data What is Statistics? Statistics is divided into two areas: descriptive statistics and

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

Chapter 2 Summarizing and Graphing Data

Chapter 2 Summarizing and Graphing Data Chapter 2 Summarizing and Graphing Data 2-1 Review and Preview 2-2 Frequency Distributions 2-3 Histograms 2-4 Graphs that Enlighten and Graphs that Deceive Preview Characteristics of Data 1. Center: A

More information

Mind on Statistics. Chapter 2

Mind on Statistics. Chapter 2 Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and cross-tabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Data Exploration Data Visualization

Data Exploration Data Visualization Data Exploration Data Visualization What is data exploration? A preliminary exploration of the data to better understand its characteristics. Key motivations of data exploration include Helping to select

More information

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1

DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 DESCRIPTIVE STATISTICS - CHAPTERS 1 & 2 1 OVERVIEW STATISTICS PANIK...THE THEORY AND METHODS OF COLLECTING, ORGANIZING, PRESENTING, ANALYZING, AND INTERPRETING DATA SETS SO AS TO DETERMINE THEIR ESSENTIAL

More information

1 Measures for location and dispersion of a sample

1 Measures for location and dispersion of a sample Statistical Geophysics WS 2008/09 7..2008 Christian Heumann und Helmut Küchenhoff Measures for location and dispersion of a sample Measures for location and dispersion of a sample In the following: Variable

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. C) (a) 3 (b) 51

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. C) (a) 3 (b) 51 Chapter 2- Problems to look at Use the given frequency distribution to find the (a) class width. (b) class midpoints of the first class. (c) class boundaries of the first class. 1) Height (in inches) 1)

More information

Lecture I. Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions.

Lecture I. Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions. Lecture 1 1 Lecture I Definition 1. Statistics is the science of collecting, organizing, summarizing and analyzing the information in order to draw conclusions. It is a process consisting of 3 parts. Lecture

More information

Chapter 3 Descriptive Statistics: Numerical Measures. Learning objectives

Chapter 3 Descriptive Statistics: Numerical Measures. Learning objectives Chapter 3 Descriptive Statistics: Numerical Measures Slide 1 Learning objectives 1. Single variable Part I (Basic) 1.1. How to calculate and use the measures of location 1.. How to calculate and use the

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

Visual Display of Data in Stata

Visual Display of Data in Stata Lab 2 Visual Display of Data in Stata In this lab we will try to understand data not only through numerical summaries, but also through graphical summaries. The data set consists of a number of variables

More information

Mathematics. Probability and Statistics Curriculum Guide. Revised 2010

Mathematics. Probability and Statistics Curriculum Guide. Revised 2010 Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

Exam # 1 STAT The number of people from the state of Alaska الاسكا) (ولاية who voted for a Republican

Exam # 1 STAT The number of people from the state of Alaska الاسكا) (ولاية who voted for a Republican King Abdulaziz University Faculty of Sciences Statistics Department Name: ID No: Exam # 1 STAT 11 First Term 149-143H Section: 6 You have 6 questions in 7 pages. You have 1 minutes to solve the exam. Please

More information

STAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I)

STAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I) The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 5: Density Curves and Normal Distributions (I) 9/12/06 Lecture 5 1 A problem about Standard Deviation A variable

More information

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel

More information

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo

Describe what is meant by a placebo Contrast the double-blind procedure with the single-blind procedure Review the structure for organizing a memo Readings: Ha and Ha Textbook - Chapters 1 8 Appendix D & E (online) Plous - Chapters 10, 11, 12 and 14 Chapter 10: The Representativeness Heuristic Chapter 11: The Availability Heuristic Chapter 12: Probability

More information

Histogram. Graphs, and measures of central tendency and spread. Alternative: density (or relative frequency ) plot /13/2004

Histogram. Graphs, and measures of central tendency and spread. Alternative: density (or relative frequency ) plot /13/2004 Graphs, and measures of central tendency and spread 9.07 9/13/004 Histogram If discrete or categorical, bars don t touch. If continuous, can touch, should if there are lots of bins. Sum of bin heights

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Introduction to Descriptive Statistics

Introduction to Descriptive Statistics Mathematics Learning Centre Introduction to Descriptive Statistics Jackie Nicholas c 1999 University of Sydney Acknowledgements Parts of this booklet were previously published in a booklet of the same

More information

GCSE Statistics Revision notes

GCSE Statistics Revision notes GCSE Statistics Revision notes Collecting data Sample This is when data is collected from part of the population. There are different methods for sampling Random sampling, Stratified sampling, Systematic

More information

Data Analysis: Describing Data - Descriptive Statistics

Data Analysis: Describing Data - Descriptive Statistics WHAT IT IS Return to Table of ontents Descriptive statistics include the numbers, tables, charts, and graphs used to describe, organize, summarize, and present raw data. Descriptive statistics are most

More information

SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)

SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Chapter 1 Review 1. As part of survey of college students a researcher is interested in the variable class standing. She records a 1 if the student is a freshman, a 2 if the student

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

Research Variables. Measurement. Scales of Measurement. Chapter 4: Data & the Nature of Measurement

Research Variables. Measurement. Scales of Measurement. Chapter 4: Data & the Nature of Measurement Chapter 4: Data & the Nature of Graziano, Raulin. Research Methods, a Process of Inquiry Presented by Dustin Adams Research Variables Variable Any characteristic that can take more than one form or value.

More information

Describing Data. Carolyn J. Anderson EdPsych 580 Fall Describing Data p. 1/42

Describing Data. Carolyn J. Anderson EdPsych 580 Fall Describing Data p. 1/42 Describing Data Carolyn J. Anderson EdPsych 580 Fall 2005 Describing Data p. 1/42 Describing Data Numerical Descriptions Single Variable Relationship Graphical displays Single variable. Relationships in

More information

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review. Descriptive Statistics AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

Comments 2 For Discussion Sheet 2 and Worksheet 2 Frequency Distributions and Histograms

Comments 2 For Discussion Sheet 2 and Worksheet 2 Frequency Distributions and Histograms Comments 2 For Discussion Sheet 2 and Worksheet 2 Frequency Distributions and Histograms Discussion Sheet 2 We have studied graphs (charts) used to represent categorical data. We now want to look at a

More information

Basics of Statistics

Basics of Statistics Basics of Statistics Jarkko Isotalo 30 20 10 Std. Dev = 486.32 Mean = 3553.8 0 N = 120.00 2400.0 2800.0 3200.0 3600.0 4000.0 4400.0 4800.0 2600.0 3000.0 3400.0 3800.0 4200.0 4600.0 5000.0 Birthweights

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

Chapter 2 - Graphical Summaries of Data

Chapter 2 - Graphical Summaries of Data Chapter 2 - Graphical Summaries of Data Data recorded in the sequence in which they are collected and before they are processed or ranked are called raw data. Raw data is often difficult to make sense

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles. Math 1530-017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

III. GRAPHICAL METHODS

III. GRAPHICAL METHODS Pie Charts and Bar Charts: III. GRAPHICAL METHODS Pie charts and bar charts are used for depicting frequencies or relative frequencies. We compare examples of each using the same data. Sources: AT&T (1961)

More information

CHINHOYI UNIVERSITY OF TECHNOLOGY

CHINHOYI UNIVERSITY OF TECHNOLOGY CHINHOYI UNIVERSITY OF TECHNOLOGY SCHOOL OF NATURAL SCIENCES AND MATHEMATICS DEPARTMENT OF MATHEMATICS MEASURES OF CENTRAL TENDENCY AND DISPERSION INTRODUCTION From the previous unit, the Graphical displays

More information

Technology Step-by-Step Using StatCrunch

Technology Step-by-Step Using StatCrunch Technology Step-by-Step Using StatCrunch Section 1.3 Simple Random Sampling 1. Select Data, highlight Simulate Data, then highlight Discrete Uniform. 2. Fill in the following window with the appropriate

More information

Chapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures)

Chapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures) Chapter 2: Frequency Distributions and Graphs (or making pretty tables and pretty pictures) Example: Titanic passenger data is available for 1310 individuals for 14 variables, though not all variables

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Quantitative Data Analysis: Choosing a statistical test Prepared by the Office of Planning, Assessment, Research and Quality

Quantitative Data Analysis: Choosing a statistical test Prepared by the Office of Planning, Assessment, Research and Quality Quantitative Data Analysis: Choosing a statistical test Prepared by the Office of Planning, Assessment, Research and Quality 1 To help choose which type of quantitative data analysis to use either before

More information

1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics)

1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics) 1.5 NUMERICAL REPRESENTATION OF DATA (Sample Statistics) As well as displaying data graphically we will often wish to summarise it numerically particularly if we wish to compare two or more data sets.

More information

Statistics GCSE Higher Revision Sheet

Statistics GCSE Higher Revision Sheet Statistics GCSE Higher Revision Sheet This document attempts to sum up the contents of the Higher Tier Statistics GCSE. There is one exam, two hours long. A calculator is allowed. It is worth 75% of the

More information

Table 2-1. Sucrose concentration (% fresh wt.) of 100 sugar beet roots. Beet No. % Sucrose. Beet No.

Table 2-1. Sucrose concentration (% fresh wt.) of 100 sugar beet roots. Beet No. % Sucrose. Beet No. Chapter 2. DATA EXPLORATION AND SUMMARIZATION 2.1 Frequency Distributions Commonly, people refer to a population as the number of individuals in a city or county, for example, all the people in California.

More information

Basic Biostatistics for Clinical Research. Ramses F Sadek, PhD GRU Cancer Center

Basic Biostatistics for Clinical Research. Ramses F Sadek, PhD GRU Cancer Center Basic Biostatistics for Clinical Research Ramses F Sadek, PhD GRU Cancer Center 1 1. Basic Concepts 2. Data & Their Presentation Part One 2 1. Basic Concepts Statistics Biostatistics Populations and samples

More information

M 225 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT!

M 225 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! M 225 Test 1 A Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points 1-14 14 15 3 16 5 17 4 18 4 19 11 20 9 21 8 22 16 Total 75 1 Multiple choice questions (1 point each) 1. Look

More information

Variables and Data A variable contains data about anything we measure. For example; age or gender of the participants or their score on a test.

Variables and Data A variable contains data about anything we measure. For example; age or gender of the participants or their score on a test. The Analysis of Research Data The design of any project will determine what sort of statistical tests you should perform on your data and how successful the data analysis will be. For example if you decide

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement

Measurement & Data Analysis. On the importance of math & measurement. Steps Involved in Doing Scientific Research. Measurement Measurement & Data Analysis Overview of Measurement. Variability & Measurement Error.. Descriptive vs. Inferential Statistics. Descriptive Statistics. Distributions. Standardized Scores. Graphing Data.

More information