Diagrams and Graphs of Statistical Data



Similar documents
Summarizing and Displaying Categorical Data

Exercise 1.12 (Pg )

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Data Exploration Data Visualization

Exploratory data analysis (Chapter 2) Fall 2011


Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th

Common Core Unit Summary Grades 6 to 8

Exploratory Data Analysis

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

MTH 140 Statistics Videos

Demographics of Atlanta, Georgia:

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Intro to Statistics 8 Curriculum

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

How To Write A Data Analysis

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

List of Examples. Examples 319

DESCRIPTIVE STATISTICS AND EXPLORATORY DATA ANALYSIS

Week 1. Exploratory Data Analysis

Variables. Exploratory Data Analysis

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

Statistics Chapter 2

COMMON CORE STATE STANDARDS FOR

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

Unit 9 Describing Relationships in Scatter Plots and Line Graphs

The Comparisons. Grade Levels Comparisons. Focal PSSM K-8. Points PSSM CCSS 9-12 PSSM CCSS. Color Coding Legend. Not Identified in the Grade Band

South Carolina College- and Career-Ready (SCCCR) Probability and Statistics

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol

Descriptive statistics parameters: Measures of centrality

with functions, expressions and equations which follow in units 3 and 4.

Data Visualization Techniques

Module 2: Introduction to Quantitative Data Analysis

Common Tools for Displaying and Communicating Data for Process Improvement

DATA INTERPRETATION AND STATISTICS

Lecture 1: Review and Exploratory Data Analysis (EDA)

Interpreting Data in Normal Distributions

Pennsylvania System of School Assessment

Geostatistics Exploratory Analysis

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

Data Visualization Techniques

Data exploration with Microsoft Excel: analysing more than one variable

Chapter 2: Frequency Distributions and Graphs

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

What Does the Normal Distribution Sound Like?

Algebra 1 Course Information

Glencoe. correlated to SOUTH CAROLINA MATH CURRICULUM STANDARDS GRADE 6 3-3, , , 4-9

AP * Statistics Review. Descriptive Statistics

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

GRADES 7, 8, AND 9 BIG IDEAS

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

business statistics using Excel OXFORD UNIVERSITY PRESS Glyn Davis & Branko Pecar

Descriptive Statistics

Manhattan Center for Science and Math High School Mathematics Department Curriculum

SPSS Manual for Introductory Applied Statistics: A Variable Approach

a) Find the five point summary for the home runs of the National League teams. b) What is the mean number of home runs by the American League teams?

What is the purpose of this document? What is in the document? How do I send Feedback?

CRLS Mathematics Department Algebra I Curriculum Map/Pacing Guide

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Exploratory Data Analysis. Psychology 3256

Describing, Exploring, and Comparing Data

430 Statistics and Financial Mathematics for Business

Algebra Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Iris Sample Data Set. Basic Visualization Techniques: Charts, Graphs and Maps. Summary Statistics. Frequency and Mode

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Exploratory Data Analysis

Lesson 4 Measures of Central Tendency

AP STATISTICS REVIEW (YMS Chapters 1-8)

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

2. Simple Linear Regression

RUTHERFORD HIGH SCHOOL Rutherford, New Jersey COURSE OUTLINE STATISTICS AND PROBABILITY

For example, estimate the population of the United States as 3 times 10⁸ and the

Lecture 2. Summarizing the Sample

Prentice Hall Mathematics Courses 1-3 Common Core Edition 2013

Statistics. Measurement. Scales of Measurement 7/18/2012

Shape of Data Distributions

determining relationships among the explanatory variables, and

UNIT 1: COLLECTING DATA

Data representation and analysis in Excel

Relationships Between Two Variables: Scatterplots and Correlation

EXPLORING SPATIAL PATTERNS IN YOUR DATA

Variable: characteristic that varies from one individual to another in the population

Problem of the Month Through the Grapevine

Chapter 1: Exploring Data

MBA 611 STATISTICS AND QUANTITATIVE METHODS

Statistics Revision Sheet Question 6 of Paper 2

Scatter Plots with Error Bars

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

Chapter 111. Texas Essential Knowledge and Skills for Mathematics. Subchapter B. Middle School

Transcription:

Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in which statistical data may be displayed pictorially such as different types of graphs and diagrams. 1

Patterns in Data Graphic displays are useful for seeing patterns in data. Patterns in data are commonly described in terms of: center, spread, shape, and unusual features. Some common distributions have special descriptive labels, such as: symmetric, bell-shaped, skewed, etc. 2

Center Graphically, the center of a distribution is located at the median of the distribution. This is the point in a graphic display where about half of the observations are on either side. In the chart to the right, the height of each column indicates the frequency of observations. Here, the observations are centered over 4. 3

Spread The spread of a distribution refers to the variability of the data. If the observations cover a wide range, the spread is larger. If the observations are clustered around a single value, the spread is smaller. Consider the figures above. In the figure on the left, data values range from 3 to 7; whereas in the figure on the right, values range from 1 to 9. The figure on the right is more variable, so it has the greater spread. 4

Shape The shape of a distribution is described by the following characteristics. 1. Symmetry. When it is graphed, a symmetric distribution can be divided at the center so that each half is a mirror image of the other. 2. Number of peaks. Distributions can have few or many peaks. Distributions with one clear peak are called unimodal, and distributions with two clear peaks are called bimodal. When a symmetric distribution has a single peak at the center, it is referred to as bell-shaped. 3. Skewness. When they are displayed graphically, some distributions have many more observations on one side of the graph than the other. Distributions with most of their observations on the left (toward lower values) are said to be skewed right; and distributions with most of their observations on the right (toward higher values) are said to be skewed left. 4. Uniform. When the observations in a set of data are equally spread across the range of the distribution, the distribution is called a uniform distribution. A uniform distribution has no clear peaks. 5

Some examples of distributions and shapes. 6

Unusual Features Sometimes, statisticians refer to unusual features in a set of data. The two most common unusual features are gaps and outliers. Gaps. Gaps refer to areas of a distribution where there are no observations. The first figure below has a gap; there are no observations in the middle of the distribution. Outliers. Sometimes, distributions are characterized by extreme values that differ greatly from the other observations. These extreme values are called outliers. The second figure below illustrates a distribution with an outlier. Except for one lonely observation (the outlier on the extreme right), all of the observations fall between 0 and 4. 7

Graphs in Statistics 1. Bar graph 2. Histogram 3. Pie graph 4. Line graph 5. Boxplot graph 6. Scatter graph 8

Bar Charts A bar graph is a way to visually represent qualitative data. A bar chart is made up of columns plotted on a graph. Here is how to read a bar chart. The columns are positioned over a label that represents a categorical variable. The height of the column indicates the size of the group defined by the column label. 9

Frecvency table http://www.youtube.com/watch?v=y2u0bmo91ys

Histograms Histograms are graphs of a distribution of data designed to show centering, dispersion (spread), and shape (relative frequency) of the data. Like a bar chart, a histogram is made up of columns plotted on a graph. Usually, there is no space between adjacent columns. The columns are positioned over a label that represents a quantitative variable. The column label can be a single value or a range of values. The height of the column indicates the size of the group defined by the column label. 11

Frecvency table http://www.youtube.com/watch?v=llc41-1pqqk

Problem Consider the histograms below. Which of the following statements are true? I. Both data sets are symmetric. II. Both data sets have the same range. (A) I only (B) II only (C) I and II (D) Neither is true. (E) There is insufficient information to answer this question.

Pie Chart Pie Chart or Circle Graph - A pie chart displays qualitative data in the form of a pie. Each slice of pie represents a different category. In a pie chart, the arc length of each sector (and consequently its central angle and area), is proportional to the quantity it represents. 15

Pie chart Eye colors of 100 third grader students. Brown corresponds to brown eyes, blue to blue eyes, and green to hazel eyes. A pie chart is a way of summarizing a set of categorical data. It is a circle which is divided into segments. Each segment represents a particular category. The area of each segment is proportional to the number of cases in that category. 16

Haw to create a pie chart. Expenditure Items Expenditure Angle of sectors Cumulative angle Food 600 144 144 Clothing 100 24 168 House rent 400 96 264 Fuel and Lighting 100 24 288 Miscellaneous 300 72 360 Total 1500 360 Food House rent Miscellaneous Clothing Fuel and Lighting http://www.youtube.com/watch?v=dysnnxc5jvo 17

Line graph A line graph is often used to represent a set of data values in which a quantity varies with time. These graphs are useful for finding trends. That is, finding a general pattern in data sets including temperature, sales, employment, company profit or cost over a period of time. http://www.youtube.com/watch?v=rn_275psjfc 18

Line graph. Exemple A cylinder of liquid was heated. Its temperature was recorded at ten-minute intervals as shown in the following table Time in minutes Temperature in C 0 5 10 26 20 45 30 61 40 74 50 80 60 85 a. Draw a line graph to represent this information. b. Estimate the temperature of the cylinder after 25 minutes of heating. 19

Boxplot graph What is a box plot? A box plot is a diagram that gives a visual representation to the distribution of the data, highlighting where most values lie and those values that greatly differ from the norm, called outliers. The box plot is also referred to as box and whisker plot or box and whisker diagram 20

Elements of the box plot http://www.youtube.com/watch?v=covf1jlxgj4 21 http://www.youtube.com/watch?v=wvr3q41pneo

Consider the boxplot below. Which of the following statements are true? I. The distribution is skewed right. II. The interquartile range is about 8. III. The median is about 10. (A) I only (B) II only (C) III only (D) I and III (E) II and III 22

Scatter graph A scatterplot is a graphic tool used to display the relationship between two quantitative variables. It gives a good visual picture of the relationship between the two variables, and aids the interpretation of the correlation coefficient or regression model. Scatter plots are similar to line graphs in that they use horizontal and vertical axes to plot data points. However, they have a very specific purpose. Scatter plots show how much one variable is affected by another. The relationship between two variables is called their correlation. 23 http://www.youtube.com/watch?v=y0qupw1dddu

More about Scatter Plot What is a trend line? A line on a graph showing the general direction that a group of points seem to be heading. A scatter plot describes a positive trend if, as one set of values increases, the other set tends to increase. A scatter plot describes a negative trend if, as one set of values increases, the other set tends to decrease. A scatter plot shows no trend if the ordered pairs show no correlation. http://www.youtube.com/watch?v=rbme0yo45bo 24

Patterns of Data in Scatterplots Scatterplots are used to analyze patterns in bivariate data. These patterns are described in terms of linearity, slope, and strength. Linearity refers to whether a data pattern is linear (straight) or nonlinear (curved). Slope refers to the direction of change in variable Y when variable X gets bigger. If variable Y also gets bigger, the slope is positive; but if variable Y gets smaller, the slope is negative. Strength refers to the degree of "scatter" in the plot. If the dots are widely spread, the relationship between variables is weak. If the dots are concentrated around a line, the relationship is strong. 25

Patterns of Data in Scatterplots 26

Problem The scatterplot below shows the relation between two variables. Which of the following statements are true? I. The relation is strong. II. The slope is positive. III. The slope is negative. (A) I only (B) II only (C) III only (D) I and II (E) I and III