Section 3: Examining Center, Spread, and Shape with Box Plots



Similar documents
Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Shape of Data Distributions

3: Summary Statistics

Bellwork Students will review their study guide for their test. Box-and-Whisker Plots will be discussed after the test.

Walk the Line Written by: Maryann Huey Drake University

Diagrams and Graphs of Statistical Data

Assignment #03: Time Management with Excel

Exploratory Data Analysis. Psychology 3256

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

What is a Box and Whisker Plot?

Public School Teacher Experience Distribution. Public School Teacher Experience Distribution

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

AP * Statistics Review. Descriptive Statistics

Descriptive Statistics

Summarizing and Displaying Categorical Data

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

Lecture 1: Review and Exploratory Data Analysis (EDA)

Box-and-Whisker Plots

Data Visualization Techniques

How To Write A Data Analysis

Measures of Spread and Their Effects Grade Seven

MEASURES OF CENTER AND SPREAD MEASURES OF CENTER 11/20/2014. What is a measure of center? a value at the center or middle of a data set

Exercise 1.12 (Pg )

Data Visualization Techniques

Data Exploration Data Visualization

Demographics of Atlanta, Georgia:

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Common Tools for Displaying and Communicating Data for Process Improvement

USUAL WEEKLY EARNINGS OF WAGE AND SALARY WORKERS FIRST QUARTER 2015


Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Public Health Activities and Services Tracking (PHAST) Interactive Data Visualization Tool User Manual

Lesson 4 Measures of Central Tendency

Common Core State Standards for Mathematical Practice 4. Model with mathematics. 7. Look for and make use of structure.

Big Ideas, Goals & Content for 4 th grade Data Collection & Analysis Unit

Box-and-Whisker Plots

Exploratory data analysis (Chapter 2) Fall 2011

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Mean, Median, and Mode

RECOMMENDED COURSE(S): Algebra I or II, Integrated Math I, II, or III, Statistics/Probability; Introduction to Health Science

Lecture 2. Summarizing the Sample

Geostatistics Exploratory Analysis

Exploratory Data Analysis

Scaling the Solar System

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Descriptive Statistics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

A STATISTICS COURSE FOR ELEMENTARY AND MIDDLE SCHOOL TEACHERS. Gary Kader and Mike Perry Appalachian State University USA

COMMON CORE STATE STANDARDS FOR

New Zealand Crash Statistics Mathematics and Statistics (3.10) version 1: Use statistical methods to make a formal inference Credits: 4

Random Fibonacci-type Sequences in Online Gambling

Texas School District Energy Management: The Status of Energy Management in Texas Schools

COMPARING BOX PLOT DISTRIBUTIONS: 4 A TEACHER S REASONING

* Graph paper is a pdf file of graph paper that you can use to print onto printer appropriate acetate.

Progress Monitoring Briefs Series

AP Statistics: Syllabus 1

Exploratory Spatial Data Analysis

a. mean b. interquartile range c. range d. median

chapter Behind the Supply Curve: >> Inputs and Costs Section 2: Two Key Concepts: Marginal Cost and Average Cost

Back to the Basics! Dashboards, Quartiles, and Setting Priorities

Statistics and Probability

Bar Graphs and Dot Plots

Section 1.3 Exercises (Solutions)

Our goal, as journalists, is to look for some patterns and trends in this information.

2: Frequency Distributions

Problem of the Month Through the Grapevine

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

The Challenge of Helping Adults Learn: Principles for Teaching Technical Information to Adults

Module 3: Correlation and Covariance

Understanding, Identifying & Analyzing Box & Whisker Plots

Crime Scene Investigation in the classroom: Using Novel Contexts to Explore Data Sets

Mathematical goals. Starting points. Materials required. Time needed

In A Heartbeat (Algebra)

Topic 9 ~ Measures of Spread

CollegeInColorado Set Goals, Create a Plan, Achieve Your Dreams Transcript

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th

2. Filling Data Gaps, Data validation & Descriptive Statistics

Week 1. Exploratory Data Analysis

Grade 6 Mathematics Performance Level Descriptors

Students summarize a data set using box plots, the median, and the interquartile range. Students use box plots to compare two data distributions.

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

AMS 7L LAB #2 Spring, Exploratory Data Analysis

All Visualizations Documentation

P(every one of the seven intervals covers the true mean yield at its location) = 3.

Interpreting Data in Normal Distributions

Grade 6 Mathematics Assessment. Eligible Texas Essential Knowledge and Skills

COMPLIMENTARY WOODWORKING PLAN

MATH 4470/5470 EXPLORATORY DATA ANALYSIS ONLINE COURSE SYLLABUS

Seeing Math Course Syllabus

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Northumberland Knowledge

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

DRP Report Interpretation Guide

Learning Analytics and Learning Tribes

Transcription:

Section 3: Examining Center, Spread, and Shape with Box Plots Q32. So far with my examination of the data, most of the data seems to be skewed. Expenditure per student and revenue per student are both skewed to the right with District of Columbia having the highest quantity. Average teacher salary is also skewed to the right. The two attributes most related to population, number of teachers and number of high school graduates, are both skewed to the right with Texas having the highest quantity. The only attribute that is not skewed to the right is students per teacher which is skewed to the left. Q33. The median of the expenditure per student data is $7168. The overall range of the data is $7341. The data is clustered in the $6000-$7500 range. The data is skewed to the right. The median of the average teacher salary data is $40476. The overall range of the data is $21948. The data is clustered in the $37000-$42000 range. The data is skewed to the right.

The median of the total number of teachers data is 42920. The overall range of the data is 283805. The data is clustered in the 0-60000 range. The data is skewed to the right. The median of the number of high school graduates data is 37385. The overall range of the data is241929. The data is clustered in the 20000-80000 range. The data is skewed to the right.

The median of the revenue per student data is $8208. The overall range of the data is $5972. The data is clustered in the $6500-$8500 range. The data is skewed to the right. The median of the students per teacher data is 15.2. The overall range of the data is 6. The data is clustered in the 14-16.5 range. The data is skewed to the left. Q34. It is easier to understand a box plot if you realize that every quartile has the same amount of data in it, even though they are various sizes. Having the dots of data displayed along with the box plot makes it easier for students to make this connection. Visually seeing the data reinforces this idea. It is easier to describe the spread and shape of data when you see the points of data. Most students can easily see the shape of data in a dot plot. Relating this to the box plot they can interpret how the box plot can show the shape of the data as well. Q35. The lower quartile of average teacher salary is $38461. The upper quartile is $43655 and the inner quartile range is $5194. The total range of this data is $21948. The big difference in the range and the inner quartile range shows that although the range is big the inner 50% of data is relatively close in amounts. This difference shows that there may be some outliers that are affecting the data.

Q36. I would use the Tukey method for my students. I feel that this will be the easiest for students to understand. I think it is important for students know how to find the quartiles no matter what technology they are using, and the Tukey method would work everytime. It also will work no matter what the data set looks like. Whether it is even or odd and if there are multiple pieces of data equal to the median. I would show students the other methods and show that you may get different answers using the different methods. I will tell them this is okay, but for consistency in this classroom we will all use the same method. Q37. Washington DC is considered an outlier in this data set. The inner quartile range of the data is $5194. 1.5($5194) = $7791. Since the upper quartile is $43655, the outliers will be values above $51446. The average salary in Washington DC is $57009, so it is an outlier. Q38. Original: Outliers Shown: The only thing that change from the original box plot to the box plot where the outliers are shown is the range and the value of the maximum.

Q39. I think that removing the outlier from the data set will drastically affect the range and mean; however, I don t think it will affect the median or the appearance of the box plot much except for the value of the maximum. Q40. Original: Modified: The median of the data did not change. The range changed from $21948 to $15200. The lower quartile changed $38461 to $38393. The upper quartile changed from $43655 to $43433 and the inner quartile range changed from $5194 to $5040. Q41. I would use the modified box plot to estimate the average teacher salary. It paints a more realistic picture of all the salaries in the South region. The outlier of Washington DC threw the data off and made it more skewed. The outlier didn t fit in with the rest of the data. Q42. I would have them use Tinkerplots to construct a box plot first. I think that the way students will be able to see the individual data plots along with the box plot will really help them grasp what

a box plot shows and represents. Also, using the technology is a great way to show the affect of outliers on box plots. Q43. Tinkerplots will really show how data is distributed in a box plot. Being able to view the data plots and box plot at the same time will make this concept clear, and the connection will be the student s own discovery. Seeing box plots on Tinkerplots will help students understand shape, centers, and ranges of different data sets and how a box plot represents all the measures.