TYPES OF DATA TYPES OF VARIABLES

Size: px
Start display at page:

Download "TYPES OF DATA TYPES OF VARIABLES"

Transcription

1 TYPES OF DATA Univariate data Examines the distribution features of one variable. Bivariate data Explores the relationship between two variables. Univariate and bivariate analysis will be revised separately. TYPES OF VARIABLES NUMERICAL VARIABLES Numerical variables represent quantities. They have numerical values. They are measured or counted. Numerical variables can either be continuous or discrete. A continuous variable can take any value in a given range. They are usually measured. Examples: Height, weight, number of litres of fuel. A discrete variable can take on only certain distinct values in a given range. They are often counted. i.e. 0, 1, 2, 3, etc. Examples: Number of siblings, number of goals in netball. CATEGORICAL VARIABLES Categorical variables represent qualities. The answer to the statistical question is a word rather than a number. Examples: Eye colour, favourite brand of cereal, type of car driven Categorical variables can be separated into two types, nominal and ordinal, but while this sometimes helps with understanding, the Further Maths Study design only refers to Categorical variables. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 4

2 Nominal categorical variables have no natural order, such as eye colour (blue, brown, green ). Ordinal categorical variables have a natural order such as salary (high, medium or low). Sometimes numerical values or scores can be assigned to can ordinal data, for example such as tidies up his/her room (1 = regularly, 2 = sometimes, 3 = rarely, 4 = never). However, these numerical values are artificial and therefore such variables are still considered categorical. SUMMARY The different types of variables collected can be summarised as follows: Warning! It is not the variable name itself that determines whether the data is numerical or not. It is the way the data was recorded. Length, for example, could be recorded in m and hence is numerical BUT if length was recorded as long, average or short it is categorical. QUESTION 1 The level of water usage of 200 homes was rated in a survey as low, medium or high and the size of the houses as small, standard or large. The results of the survey are displayed in the table below. Water Usage House Size Large Standard Small Low Medium High The variables, level of water usage and size of house, as recorded in this survey, are: A B C D E Both numerical variables. Both categorical variables. Neither numerical nor categorical variables. Numerical and categorical variables respectively. Categorical and numerical variables respectively. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 5

3 QUESTION 2 Which one of the following is an example of continuous numerical data? A B C D E Number of runs made by a cricket player. Speed of a car captured by a speed camera. Your favourite secondary school year level. Shoe sizes. Labor / Liberal preference of 100 people surveyed. QUESTION 3 30 shoppers using the express lane were surveyed to find the number of items being purchased. The results were as follows The type of data is: A B C D E Numerical discrete. Numerical continuous. Categorical ordinal. Categorical nominal. Categorical discrete. UNIVARIATE ANALYSIS The prime objectives of Univariate analysis are to determine: Type of distribution (i.e. shape and if any outliers) Central tendency Spread ORGANISING DATA Data needs to be organised so that further analysis can be completed promptly and accurately. The data is organised either as it is collected or afterwards. The techniques used in Further Maths are: Frequency Tables Dot plots Bar charts Histograms Stem and leaf plots Boxplots The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 6

4 In organising the data there will be the need to group the data. Categorical data is self-evident as each category would be labelled according to the category and is unique from the others. Numerical continuous data will often be grouped. No hard and fast rules apply, but generally the data should have between five and no more than fifteen groups. The size of the interval is usually in 1 s, 2 s, 5 s or 10 s with others as the need arise. To judge this correctly look at the highest and lowest value and divide by a suitable grouping interval to find the number of intervals so that there is no less than five groups or no more than 15 groups. Often the size of the interval is relevant in the big picture. Numerical discrete data can be ungrouped if there are less than 15 different scores. If there are more, then group the data in multiples of 2, 5, 10 so there is a minimum of 5 groups and no more than 15 groups. QUESTION 4 The following is a survey of 30 students recording the number of people in their family living at home. Summarise in a frequency table. Solution Identify it is numerical discrete. 2. There are only 7 different scores so leave as ungrouped. 3. Use a frequency table of two (three) columns. The first column is for the possible scores (x) and can be labelled as Number of members in a family. The second column is for the frequency (f) and can be labelled Number of students or families. Optional 3 rd column used as a tally for very large sets of data. 4. Add up the frequency (Σf ) to confirm scores from 30 students has been recorded. Number of people in the family (x) TALLY Number of Families (f) Σf = 30 The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 7

5 Stem plots (stem and leaf plot): VISUAL DISPLAYS Stem & leaf plot or stemplot is a most useful technique for summarising numerical data. It has two main columns where the stem holds the group value while the leaf holds the final digit in the recorded data. The following stem & leaf plot of 20 students CD collection size. Features of stem & leaf plot display 1. Title CD collection size of 20 students 2. Column heading Stem Leaf 3. Data values summarised 0 4 keep in neat columns the final digits Legend 2 5 = 25 CDs From the stemplot shown the scores collected were 4, 14, 16, 22, 23, 25, 28, 30, 34, 34, 36, 37, 37, 39, 41, 43, 45, 49, 54, 56. The benefits of using a stemplot are all the scores actual values are not lost like they would be in a grouped frequency table. Stemplots visually display the shape of the distribution. Each group is represented as a stem. The stem represents the first digit or set of digits of the available data (called leading digits). Example: A stem of 0 can be used to represent data whose first digit lies within the interval 0 9. A stem of 1 can be used to represent data whose first digit lies within the interval The leaf represents the last digit (called trailing digits) in the available data. If more than one piece of data carries the same value, you must represent each piece of datum in the stem and leaves. Stem and leaf plots can also be written for very large or very small numbers, using the key to overcome the need for decimal points or many zeros. An ordered stem and leaf plot is more useful than an unordered one, as position of values in distributions can be easily found. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 8

6 QUESTION 5 The golf scores for 30 golfers were recorded as follows. Summarise as an ordered stemplot grouped in 5 s. Comment on the distribution of scores Golf scores for 30 golf players Stem Leaf = 74 strokes QUESTION 6 For the stem & leaf plot shown: Stem Leaf = 7.6 Which of the following data sets match the above stem & leaf plot? A B C D E The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 9

7 Bar Graphs: For categorical data. Length of each bar represents frequency, relative frequency or percentage frequency. Width of bars and spaces between bars must be kept uniform. Note: Bars usually don t touch one another. Segmented bar charts: For categorical data. A segmented bar chart is very similar to a bar chart except the bars are stacked on one another to give a single bar. The lengths of the segments correspond to the data set frequency. Usually the percentage frequency is shown, rather than the frequency count. This is then called a percentaged segmented bar chart. The height of the bar gives the total frequency or 100% if percentage is used. Dot Plots: Dot plots are graphical displays of discrete numerical or categorical data sets. A single dot represents each data point. They show the shape of the distribution. They are often used for small data sets or to compare more than one data set. A dot plot is constructed by drawing a scale and then evenly placing dots above the appropriate values to represent the data points. There is no vertical axis. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 10

8 QUESTION 7 At a major family reunion of 200 of the Ryan clan, the age groupings were recorded for all the people who attended. For the following data set summarised in a frequency table display as a bar chart (percentaged). Age group Number of Babies 20 Toddlers 24 Children 34 Teenagers 36 Adults 50 Pensioners 36 Barchart: 1. Give the barchart a suitable title. 2. Label the horizontal axis with the names of the categories of age groups. Leave a space between each category. 3. Label the vertical axis as frequency (percentage) or Percentage of family members. 4. Size each column to the calculated percentage from table. Segmented Bar chart: 1. Give the segmented barchart a suitable title. 2. Label the horizontal axis with variable age groups. 3. Label the vertical axis as frequency (percentage) or percentage of family members. 4. Stack each column on top of each other. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 11

9 QUESTION 8 For the following data set display as (i) bar chart (ii) dot plot: Type of sport played by 32 students on the weekend. basketball football basketball soccer tennis netball basketball football football tennis netball netball football soccer netball karate football football athletics basketball netball tennis football soccer football soccer basketball athletics soccer tennis soccer hockey For Barcharts: Follow the instructions provided. 1. Identify all the different categories. 2. Give the barchart a suitable title. 3. Label the horizontal axis with the names of the categories of sport. Leave a space between each category. 4. Label the vertical axis as frequency or number of students. 5. Count the number of each sport and make the bar the height of that frequency. For Dot plots: 1. Identify all the different categories. 2. Give the dot plot a suitable title. 3. Label the horizontal axis with the names of the categories of sport. Leave a space between each category. 4. Use a single dot represents each data point. QUESTION 9 The segmented bar chart shows the distribution of degree of text-messaging use by a random sample of school students. Degree of text messaging For these students, the percentage in year levels 9 10 who are occasional text messagers is closest to: A 25% B 32% C 42% D 50% E 75% The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 12

10 HISTOGRAMS For numerical data, either discrete or continuous data, grouped data. The height of the bar gives the frequency (count or percentage). The end of one rectangle must be the beginning of the next rectangle i.e. no gaps. We can also plot percentage frequencies or relative frequencies on the vertical axis for both grouped and ungrouped data. If the data is individual discrete values the bars are directly above the appropriate values. If the data is continuous class intervals the bars start at the bottom end of the class interval and extend to the upper end of the class interval. QUESTION 10 The following is a survey of 30 students of the number of people in their family living at home Display as histogram. Solution FOR HISTOGRAMS 1. Identify it is numerical discrete data with a range of less than 15. Label the x-axis as Number of members in a family, leave a space at the beginning and scale the x- axis from lowest to highest score. Label the score in the middle of each column. 2. Label the y-axis as frequency (or number of families). 3. Draw in the first column in the middle of the score on the x-axis and a height to the value of its frequency taken from y-axis. 4. Repeat for all other scores s 8 ilie 7 m fa 6 f o r 5 e b 4 m u 3 N Histogram of Members in Families Number of members in a family The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 13

11 USING A TI 84 GRAPHIC CALCULATOR Entering data 1. Press STAT and select 1:Edit. 2. Enter raw scores into list L 1. Setting Up Display-Histogram 3. Press 2 nd STATPLOT and select 1:Plot1. 4. Select ON and cursor down and select histogram icon. 5. Cursor down and set score list to L 1 (Xlist:L 1 ) and frequency to 1 (Freq:1). Display Histogram. 6. Press ZOOM and select 9:ZoomStat. Press WINDOW to fine tune the x (scores) and y (frequency) axis values and interval size (xscl). Press GRAPH and trace and cursor left and right to investigate the value and frequency of each column. USING THE CASIO CLASSPAD CALCULATOR In Stat menu, enter data into List 1 SetGraph Settings set screen as shown Set Set the interval, here we want to start at 10 and go up in class intervals of 10 OK The graph will appear as shown in the bottom half of the screen. Resize screen if necessary, Analysis trace will allow you to find heights of peaks using the 4 way key to move across. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 14

12 QUESTION 11 Distances students travel to school by 300 students are summarised in the table below: Distance km Represent this information as Histogram. Solution Number of students 0 - < < < < < < 12 8 The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 15

13 SHAPES OF DISTRIBUTIONS The shape or skewness describes how symmetrical the distribution or results are. Most distributions are skewed. Perfect symmetry: The mean, mode and median are similar. Frequency polygon will often be a bell shaped curve. Note that data can be symmetric without being bell shaped (normal). Negative skew: Tail of curve stretches to the left. Most of the data is concentrated around the higher values. Mode, mean and median will not coincide (not the same values). The mean will usually be lower than the median. Mode, mean and median are located to the right (in the main body of curve). Positive skew: Tail of curve stretches to the right. Most of the data is concentrated around the lower values. Mode, mean and median will not coincide (not the same values). The mean will usually be higher than the median. Mode, mean and median are located to the left (in the main body of curve). QUESTION 12 For the frequency table shown the shape of the distribution is most likely to be: Score Frequency A B C D E Positively skewed Negatively skewed Skewed to the left Bimodal Bell-shaped The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 16

14 PARAMETERS IN STATISTICS Measures of central tendency are given by the mode, median and mean. These measures can be applied to both grouped and ungrouped data. The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 17

15 MODE Value which occurs most frequently or most popular value. Data sets can have 2 or more modes or modal classes. As the mode is simply the score (or interval) with the highest frequency, it is not always a measure of centre. However, when the number of scores is very large and the distribution is symmetric, the mode often does occur near the central value. MEDIAN The median represents the central value or middle value of the given data set. The median can be viewed as the value below which 50% of the results lie. The median is not affected by extreme values, so it is a good measure of centre when outliers are present or when the distribution is skewed. n 1 The median is located at the th position in the data set. 2 If n is odd, the median is the middle data set. If n is even, the median is the average of the two middle data points. MEAN The mean represents the average value of the given data set. It is found by adding together all the values of the data and dividing by the number of values. For Ungrouped Data: x x n i.e. Sum of the values Number of values When there is a large amount of data, group data together in a frequency table. For Grouped Data: x n i x i f f i When values are given in class intervals: Mean:- Sum of ( midpoint of class frequency) Sum of frequencies The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 18

16 For continuous variables: Assume that all the frequencies in a class interval have the same value as the midpoint of that interval. i.e. x = midpoint of interval f frequency of that class interval Total for a class = Midpoint of class Frequency For discrete variables: Midpoint of interval is calculated by taking the mean of the first and last value of each interval. The mean is not necessarily equal to one of the given values in the data set. The mean is affected by extreme values and by skewing, unless there are extremes at both ends that balance out. In this way the mean takes into account all of the given data. QUESTION 13 For the following raw data: Calculate the: (a) Median (b) Mean (c) Mode Solution (a) Median REORDER from LOWEST to HIGHEST n 1 Median score th score th score 2 that is between 5 th and 6 th score which is between 5 and 5 so Median = 5. (b) (c) Mean x sumof all scores Mean = x n number of scores x Mode Look for a score that occurs the most often in the list. The Mode is 4 as it occurred 3 times The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 19

17 ALTERNATIVE METHOD USING GRAPHIC CALCULATOR 1. Press STAT and select 1:EDIT. 2. Enter the scores in the L 1 table. 3. Press STAT and select CALC 4. Select 1:1-Var Stats and press ENTER. 5. The mean is given as x 6. The number of scores is given as n 7. Scroll down the screen for the median, given as Med = Note: Some graphic calculators do not give the mode. Examine raw data for most frequent score. Mean = 5.5. Number of scores is 10. Median is 5. Mode is 4 (it occurred 3 times). QUESTION 14 The survey of 30 people and the number of cars in their household was summarised in a frequency table as shown. Calculate the: (a) (b) (c) Median Mean Mode Solution Number of Number of cars households Σ f 30 The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 20

18 METHOD USING GRAPHIC CALCULATOR 1. Press STAT and select 1:EDIT. 2. Enter the scores in the L 1 table and the frequencies in L Press STAT and select CALC 4. Select 1:1-Var Stats and press ENTER 5. Press 2 nd L 1 followed by COMMA and the 2 nd L Press ENTER The mean is given as x The number of scores is given as n Mean = Number of scores is 30. Scroll down the screen for the median, given as Med = The median is 2.5 <screen47> The mode is 2 (it occurred 12 times). QUESTION 15 The results of a mathematics test marked out of 60 are represented by this stem-and-leaf plot. The mean, median and mode, respectively, are: A 60, 42, 49 B 40, 45, 60 C 46, 60, 40 D 42, 45, 60 E 60, 45, 42 Stem Leaf Key 5 4 means 54 The School For Excellence 2011 The Essentials Further Mathematics Core Materials Page 21

Chapter 2: Frequency Distributions and Graphs

Chapter 2: Frequency Distributions and Graphs Chapter 2: Frequency Distributions and Graphs Learning Objectives Upon completion of Chapter 2, you will be able to: Organize the data into a table or chart (called a frequency distribution) Construct

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)

More information

Statistics Revision Sheet Question 6 of Paper 2

Statistics Revision Sheet Question 6 of Paper 2 Statistics Revision Sheet Question 6 of Paper The Statistics question is concerned mainly with the following terms. The Mean and the Median and are two ways of measuring the average. sumof values no. of

More information

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

More information

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

More information

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

More information

AP * Statistics Review. Descriptive Statistics

AP * Statistics Review. Descriptive Statistics AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

What Does the Normal Distribution Sound Like?

What Does the Normal Distribution Sound Like? What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University ananda@pittstate.edu Published: June 2013 Overview of Lesson In this activity, students conduct an investigation

More information

2 Describing, Exploring, and

2 Describing, Exploring, and 2 Describing, Exploring, and Comparing Data This chapter introduces the graphical plotting and summary statistics capabilities of the TI- 83 Plus. First row keys like \ R (67$73/276 are used to obtain

More information

Exploratory Data Analysis

Exploratory Data Analysis Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

More information

Lesson 4 Measures of Central Tendency

Lesson 4 Measures of Central Tendency Outline Measures of a distribution s shape -modality and skewness -the normal distribution Measures of central tendency -mean, median, and mode Skewness and Central Tendency Lesson 4 Measures of Central

More information

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures Introductory Statistics Lectures Visualizing Data Descriptive Statistics I Department of Mathematics Pima Community College Redistribution of this material is prohibited without written permission of the

More information

Sta 309 (Statistics And Probability for Engineers)

Sta 309 (Statistics And Probability for Engineers) Instructor: Prof. Mike Nasab Sta 309 (Statistics And Probability for Engineers) Chapter 2 Organizing and Summarizing Data Raw Data: When data are collected in original form, they are called raw data. The

More information

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

More information

AP Statistics Solutions to Packet 2

AP Statistics Solutions to Packet 2 AP Statistics Solutions to Packet 2 The Normal Distributions Density Curves and the Normal Distribution Standard Normal Calculations HW #9 1, 2, 4, 6-8 2.1 DENSITY CURVES (a) Sketch a density curve that

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

Describing, Exploring, and Comparing Data

Describing, Exploring, and Comparing Data 24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

More information

Statistics Chapter 2

Statistics Chapter 2 Statistics Chapter 2 Frequency Tables A frequency table organizes quantitative data. partitions data into classes (intervals). shows how many data values are in each class. Test Score Number of Students

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple. Graphical Representations of Data, Mean, Median and Standard Deviation In this class we will consider graphical representations of the distribution of a set of data. The goal is to identify the range of

More information

Descriptive Statistics

Descriptive Statistics Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web

More information

II. DISTRIBUTIONS distribution normal distribution. standard scores

II. DISTRIBUTIONS distribution normal distribution. standard scores Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

More information

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175) Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,

More information

Using SPSS, Chapter 2: Descriptive Statistics

Using SPSS, Chapter 2: Descriptive Statistics 1 Using SPSS, Chapter 2: Descriptive Statistics Chapters 2.1 & 2.2 Descriptive Statistics 2 Mean, Standard Deviation, Variance, Range, Minimum, Maximum 2 Mean, Median, Mode, Standard Deviation, Variance,

More information

Module 2: Introduction to Quantitative Data Analysis

Module 2: Introduction to Quantitative Data Analysis Module 2: Introduction to Quantitative Data Analysis Contents Antony Fielding 1 University of Birmingham & Centre for Multilevel Modelling Rebecca Pillinger Centre for Multilevel Modelling Introduction...

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS

SECTION 2-1: OVERVIEW SECTION 2-2: FREQUENCY DISTRIBUTIONS SECTION 2-1: OVERVIEW Chapter 2 Describing, Exploring and Comparing Data 19 In this chapter, we will use the capabilities of Excel to help us look more carefully at sets of data. We can do this by re-organizing

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

A Picture Really Is Worth a Thousand Words

A Picture Really Is Worth a Thousand Words 4 A Picture Really Is Worth a Thousand Words Difficulty Scale (pretty easy, but not a cinch) What you ll learn about in this chapter Why a picture is really worth a thousand words How to create a histogram

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

Bar Charts, Histograms, Line Graphs & Pie Charts

Bar Charts, Histograms, Line Graphs & Pie Charts Bar Charts and Histograms Bar charts and histograms are commonly used to represent data since they allow quick assimilation and immediate comparison of information. Normally the bars are vertical, but

More information

Variables. Exploratory Data Analysis

Variables. Exploratory Data Analysis Exploratory Data Analysis Exploratory Data Analysis involves both graphical displays of data and numerical summaries of data. A common situation is for a data set to be represented as a matrix. There is

More information

Lab 1: The metric system measurement of length and weight

Lab 1: The metric system measurement of length and weight Lab 1: The metric system measurement of length and weight Introduction The scientific community and the majority of nations throughout the world use the metric system to record quantities such as length,

More information

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 110 012 seema@iasri.res.in 1. Descriptive Statistics Statistics

More information

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. MATH 3/GRACEY PRACTICE EXAM/CHAPTERS 2-3 Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) The frequency distribution

More information

GeoGebra Statistics and Probability

GeoGebra Statistics and Probability GeoGebra Statistics and Probability Project Maths Development Team 2013 www.projectmaths.ie Page 1 of 24 Index Activity Topic Page 1 Introduction GeoGebra Statistics 3 2 To calculate the Sum, Mean, Count,

More information

Week 1. Exploratory Data Analysis

Week 1. Exploratory Data Analysis Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam

More information

Bar Graphs and Dot Plots

Bar Graphs and Dot Plots CONDENSED L E S S O N 1.1 Bar Graphs and Dot Plots In this lesson you will interpret and create a variety of graphs find some summary values for a data set draw conclusions about a data set based on graphs

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

a. mean b. interquartile range c. range d. median

a. mean b. interquartile range c. range d. median 3. Since 4. The HOMEWORK 3 Due: Feb.3 1. A set of data are put in numerical order, and a statistic is calculated that divides the data set into two equal parts with one part below it and the other part

More information

MODUL 8 MATEMATIK SPM ENRICHMENT TOPIC : STATISTICS TIME : 2 HOURS

MODUL 8 MATEMATIK SPM ENRICHMENT TOPIC : STATISTICS TIME : 2 HOURS MODUL 8 MATEMATIK SPM ENRICHMENT TOPIC : STATISTICS TIME : 2 HOURS 1. The data in Diagram 1 shows the body masses, in kg, of 40 children in a kindergarten. 16 24 34 26 30 40 35 30 26 33 18 20 29 31 30

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Consolidation of Grade 3 EQAO Questions Data Management & Probability

Consolidation of Grade 3 EQAO Questions Data Management & Probability Consolidation of Grade 3 EQAO Questions Data Management & Probability Compiled by Devika William-Yu (SE2 Math Coach) GRADE THREE EQAO QUESTIONS: Data Management and Probability Overall Expectations DV1

More information

Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab

Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab 1 Valor Christian High School Mrs. Bogar Biology Graphing Fun with a Paper Towel Lab I m sure you ve wondered about the absorbency of paper towel brands as you ve quickly tried to mop up spilled soda from

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Lecture 2: Descriptive Statistics and Exploratory Data Analysis Lecture 2: Descriptive Statistics and Exploratory Data Analysis Further Thoughts on Experimental Design 16 Individuals (8 each from two populations) with replicates Pop 1 Pop 2 Randomly sample 4 individuals

More information

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

More information

Statistics. Measurement. Scales of Measurement 7/18/2012

Statistics. Measurement. Scales of Measurement 7/18/2012 Statistics Measurement Measurement is defined as a set of rules for assigning numbers to represent objects, traits, attributes, or behaviors A variableis something that varies (eye color), a constant does

More information

Census@School Data Handling and Probability (Grades 10, 11 and 12)

Census@School Data Handling and Probability (Grades 10, 11 and 12) Census@School Data Handling and Probability (Grades 10, 11 and 12) Statistics South Africa, 2013 Pali Lehohla, Statistician-General Census@School Data Handling and Probability (Grades 10, 11 and 12) /

More information

Introduction to Quantitative Methods

Introduction to Quantitative Methods Introduction to Quantitative Methods October 15, 2009 Contents 1 Definition of Key Terms 2 2 Descriptive Statistics 3 2.1 Frequency Tables......................... 4 2.2 Measures of Central Tendencies.................

More information

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous Chapter 2 Overview Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Classify as categorical or qualitative data. 1) A survey of autos parked in

More information

Box-and-Whisker Plots

Box-and-Whisker Plots Learning Standards HSS-ID.A. HSS-ID.A.3 3 9 23 62 3 COMMON CORE.2 Numbers of First Cousins 0 3 9 3 45 24 8 0 3 3 6 8 32 8 0 5 4 Box-and-Whisker Plots Essential Question How can you use a box-and-whisker

More information

Chapter 1: Exploring Data

Chapter 1: Exploring Data Chapter 1: Exploring Data Chapter 1 Review 1. As part of survey of college students a researcher is interested in the variable class standing. She records a 1 if the student is a freshman, a 2 if the student

More information

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

Unit 9 Describing Relationships in Scatter Plots and Line Graphs Unit 9 Describing Relationships in Scatter Plots and Line Graphs Objectives: To construct and interpret a scatter plot or line graph for two quantitative variables To recognize linear relationships, non-linear

More information

Lesson 2: Constructing Line Graphs and Bar Graphs

Lesson 2: Constructing Line Graphs and Bar Graphs Lesson 2: Constructing Line Graphs and Bar Graphs Selected Content Standards Benchmarks Assessed: D.1 Designing and conducting statistical experiments that involve the collection, representation, and analysis

More information

Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies

Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies Creating Charts in Microsoft Excel A supplement to Chapter 5 of Quantitative Approaches in Business Studies Components of a Chart 1 Chart types 2 Data tables 4 The Chart Wizard 5 Column Charts 7 Line charts

More information

Graph it! Grade Six. Estimated Duration: Three hours

Graph it! Grade Six. Estimated Duration: Three hours Ohio Standards Connection Data Analysis and Probability Benchmark A Read, Create and use line graphs, histograms, circle graphs, box-and whisker plots, stem-and-leaf plots, and other representations when

More information

AMS 7L LAB #2 Spring, 2009. Exploratory Data Analysis

AMS 7L LAB #2 Spring, 2009. Exploratory Data Analysis AMS 7L LAB #2 Spring, 2009 Exploratory Data Analysis Name: Lab Section: Instructions: The TAs/lab assistants are available to help you if you have any questions about this lab exercise. If you have any

More information

Academic Support Center. Using the TI-83/84+ Graphing Calculator PART II

Academic Support Center. Using the TI-83/84+ Graphing Calculator PART II Academic Support Center Using the TI-83/84+ Graphing Calculator PART II Designed and Prepared by The Academic Support Center Revised June 2012 1 Using the Graphing Calculator (TI-83+ or TI-84+) Table of

More information

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area under the curve must equal 1. 2. Every point on the curve

More information

Exploratory Data Analysis. Psychology 3256

Exploratory Data Analysis. Psychology 3256 Exploratory Data Analysis Psychology 3256 1 Introduction If you are going to find out anything about a data set you must first understand the data Basically getting a feel for you numbers Easier to find

More information

CHAPTER 14 STATISTICS. 14.1 Introduction

CHAPTER 14 STATISTICS. 14.1 Introduction 238 MATHEMATICS STATISTICS CHAPTER 14 14.1 Introduction Everyday we come across a lot of information in the form of facts, numerical figures, tables, graphs, etc. These are provided by newspapers, televisions,

More information

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives. The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution

More information

Interpreting Data in Normal Distributions

Interpreting Data in Normal Distributions Interpreting Data in Normal Distributions This curve is kind of a big deal. It shows the distribution of a set of test scores, the results of rolling a die a million times, the heights of people on Earth,

More information

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data A Few Sources for Data Examples Used Introduction to Environmental Statistics Professor Jessica Utts University of California, Irvine jutts@uci.edu 1. Statistical Methods in Water Resources by D.R. Helsel

More information

Practice#1(chapter1,2) Name

Practice#1(chapter1,2) Name Practice#1(chapter1,2) Name Solve the problem. 1) The average age of the students in a statistics class is 22 years. Does this statement describe descriptive or inferential statistics? A) inferential statistics

More information

Lecture 1: Review and Exploratory Data Analysis (EDA)

Lecture 1: Review and Exploratory Data Analysis (EDA) Lecture 1: Review and Exploratory Data Analysis (EDA) Sandy Eckel seckel@jhsph.edu Department of Biostatistics, The Johns Hopkins University, Baltimore USA 21 April 2008 1 / 40 Course Information I Course

More information

Describing Data: Measures of Central Tendency and Dispersion

Describing Data: Measures of Central Tendency and Dispersion 100 Part 2 / Basic Tools of Research: Sampling, Measurement, Distributions, and Descriptive Statistics Chapter 8 Describing Data: Measures of Central Tendency and Dispersion In the previous chapter we

More information

The Normal Distribution

The Normal Distribution Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution

More information

THE BINOMIAL DISTRIBUTION & PROBABILITY

THE BINOMIAL DISTRIBUTION & PROBABILITY REVISION SHEET STATISTICS 1 (MEI) THE BINOMIAL DISTRIBUTION & PROBABILITY The main ideas in this chapter are Probabilities based on selecting or arranging objects Probabilities based on the binomial distribution

More information

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Descriptive Statistics Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion Statistics as a Tool for LIS Research Importance of statistics in research

More information

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers 1.3 Measuring Center & Spread, The Five Number Summary & Boxplots Describing Quantitative Data with Numbers 1.3 I can n Calculate and interpret measures of center (mean, median) in context. n Calculate

More information

Using Excel for descriptive statistics

Using Excel for descriptive statistics FACT SHEET Using Excel for descriptive statistics Introduction Biologists no longer routinely plot graphs by hand or rely on calculators to carry out difficult and tedious statistical calculations. These

More information

Analyzing Experimental Data

Analyzing Experimental Data Analyzing Experimental Data The information in this chapter is a short summary of some topics that are covered in depth in the book Students and Research written by Cothron, Giese, and Rezba. See the end

More information

3: Summary Statistics

3: Summary Statistics 3: Summary Statistics Notation Let s start by introducing some notation. Consider the following small data set: 4 5 30 50 8 7 4 5 The symbol n represents the sample size (n = 0). The capital letter X denotes

More information

Midterm Review Problems

Midterm Review Problems Midterm Review Problems October 19, 2013 1. Consider the following research title: Cooperation among nursery school children under two types of instruction. In this study, what is the independent variable?

More information

Common Tools for Displaying and Communicating Data for Process Improvement

Common Tools for Displaying and Communicating Data for Process Improvement Common Tools for Displaying and Communicating Data for Process Improvement Packet includes: Tool Use Page # Box and Whisker Plot Check Sheet Control Chart Histogram Pareto Diagram Run Chart Scatter Plot

More information

Lecture 2. Summarizing the Sample

Lecture 2. Summarizing the Sample Lecture 2 Summarizing the Sample WARNING: Today s lecture may bore some of you It s (sort of) not my fault I m required to teach you about what we re going to cover today. I ll try to make it as exciting

More information

determining relationships among the explanatory variables, and

determining relationships among the explanatory variables, and Chapter 4 Exploratory Data Analysis A first look at the data. As mentioned in Chapter 1, exploratory data analysis or EDA is a critical first step in analyzing the data from an experiment. Here are the

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Licensed to: CengageBrain User

Licensed to: CengageBrain User This is an electronic version of the print textbook. Due to electronic rights restrictions, some third party content may be suppressed. Editorial review has deemed that any suppressed content does not

More information

Data exploration with Microsoft Excel: univariate analysis

Data exploration with Microsoft Excel: univariate analysis Data exploration with Microsoft Excel: univariate analysis Contents 1 Introduction... 1 2 Exploring a variable s frequency distribution... 2 3 Calculating measures of central tendency... 16 4 Calculating

More information

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques

More information

Frequency Distributions

Frequency Distributions Descriptive Statistics Dr. Tom Pierce Department of Psychology Radford University Descriptive statistics comprise a collection of techniques for better understanding what the people in a group look like

More information

Descriptive statistics parameters: Measures of centrality

Descriptive statistics parameters: Measures of centrality Descriptive statistics parameters: Measures of centrality Contents Definitions... 3 Classification of descriptive statistics parameters... 4 More about central tendency estimators... 5 Relationship between

More information

GeoGebra. 10 lessons. Gerrit Stols

GeoGebra. 10 lessons. Gerrit Stols GeoGebra in 10 lessons Gerrit Stols Acknowledgements GeoGebra is dynamic mathematics open source (free) software for learning and teaching mathematics in schools. It was developed by Markus Hohenwarter

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

Box-and-Whisker Plots

Box-and-Whisker Plots Mathematics Box-and-Whisker Plots About this Lesson This is a foundational lesson for box-and-whisker plots (boxplots), a graphical tool used throughout statistics for displaying data. During the lesson,

More information

Mind on Statistics. Chapter 2

Mind on Statistics. Chapter 2 Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and cross-tabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table

More information

Microsoft Excel 2010 Charts and Graphs

Microsoft Excel 2010 Charts and Graphs Microsoft Excel 2010 Charts and Graphs Email: training@health.ufl.edu Web Page: http://training.health.ufl.edu Microsoft Excel 2010: Charts and Graphs 2.0 hours Topics include data groupings; creating

More information

Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds

Expression. Variable Equation Polynomial Monomial Add. Area. Volume Surface Space Length Width. Probability. Chance Random Likely Possibility Odds Isosceles Triangle Congruent Leg Side Expression Equation Polynomial Monomial Radical Square Root Check Times Itself Function Relation One Domain Range Area Volume Surface Space Length Width Quantitative

More information

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: "What do the data look like?"

4.1 Exploratory Analysis: Once the data is collected and entered, the first question is: What do the data look like? Data Analysis Plan The appropriate methods of data analysis are determined by your data types and variables of interest, the actual distribution of the variables, and the number of cases. Different analyses

More information

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08

CHARTS AND GRAPHS INTRODUCTION USING SPSS TO DRAW GRAPHS SPSS GRAPH OPTIONS CAG08 CHARTS AND GRAPHS INTRODUCTION SPSS and Excel each contain a number of options for producing what are sometimes known as business graphics - i.e. statistical charts and diagrams. This handout explores

More information

How Does My TI-84 Do That

How Does My TI-84 Do That How Does My TI-84 Do That A guide to using the TI-84 for statistics Austin Peay State University Clarksville, Tennessee How Does My TI-84 Do That A guide to using the TI-84 for statistics Table of Contents

More information

USING A TI-83 OR TI-84 SERIES GRAPHING CALCULATOR IN AN INTRODUCTORY STATISTICS CLASS

USING A TI-83 OR TI-84 SERIES GRAPHING CALCULATOR IN AN INTRODUCTORY STATISTICS CLASS USING A TI-83 OR TI-84 SERIES GRAPHING CALCULATOR IN AN INTRODUCTORY STATISTICS CLASS W. SCOTT STREET, IV DEPARTMENT OF STATISTICAL SCIENCES & OPERATIONS RESEARCH VIRGINIA COMMONWEALTH UNIVERSITY Table

More information