Math 140 Introductory Statistics

Similar documents
Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Summarizing and Displaying Categorical Data

Lesson 4 Measures of Central Tendency

Exploratory data analysis (Chapter 2) Fall 2011

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Exercise 1.12 (Pg )

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Variables. Exploratory Data Analysis

AP * Statistics Review. Descriptive Statistics

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Exploratory Data Analysis. Psychology 3256

Diagrams and Graphs of Statistical Data

Descriptive Statistics

AP Statistics Solutions to Packet 2

Data Exploration Data Visualization

Statistics Chapter 2

What Does the Normal Distribution Sound Like?

AMS 7L LAB #2 Spring, Exploratory Data Analysis

Shape of Data Distributions

How To Write A Data Analysis

Exploratory Data Analysis

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

a. mean b. interquartile range c. range d. median

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Chapter 1: Exploring Data

Chapter 4 Displaying Quantitative Data

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

determining relationships among the explanatory variables, and

Section 1.3 Exercises (Solutions)

Section 3: Examining Center, Spread, and Shape with Box Plots

First Midterm Exam (MATH1070 Spring 2012)

Descriptive statistics parameters: Measures of centrality

Pie Charts. proportion of ice-cream flavors sold annually by a given brand. AMS-5: Statistics. Cherry. Cherry. Blueberry. Blueberry. Apple.

THE BINOMIAL DISTRIBUTION & PROBABILITY

TEACHER NOTES MATH NSPIRED

Lecture 1: Review and Exploratory Data Analysis (EDA)

Name: Date: Use the following to answer questions 2-3:

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Common Tools for Displaying and Communicating Data for Process Improvement

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MTH 140 Statistics Videos

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Lecture 2. Summarizing the Sample

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Descriptive Statistics

Chapter 2: Frequency Distributions and Graphs

Sta 309 (Statistics And Probability for Engineers)

Week 1. Exploratory Data Analysis

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Mind on Statistics. Chapter 2

Chapter 2 Data Exploration

Demographics of Atlanta, Georgia:

COMMON CORE STATE STANDARDS FOR

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Interpreting Data in Normal Distributions

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

GeoGebra Statistics and Probability

AP STATISTICS REVIEW (YMS Chapters 1-8)

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

SKEWNESS. Measure of Dispersion tells us about the variation of the data set. Skewness tells us about the direction of variation of the data set.

2: Frequency Distributions

STAB22 section 1.1. total = 88(200/100) + 85(200/100) + 77(300/100) + 90(200/100) + 80(100/100) = = 837,

Describing, Exploring, and Comparing Data

Statistics Revision Sheet Question 6 of Paper 2

Sampling and Descriptive Statistics

Section 1.1 Exercises (Solutions)

Descriptive Statistics and Measurement Scales

MEASURES OF VARIATION

Lecture 5 : The Poisson Distribution

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Northumberland Knowledge

MEASURES OF CENTER AND SPREAD MEASURES OF CENTER 11/20/2014. What is a measure of center? a value at the center or middle of a data set

List of Examples. Examples 319


Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

3: Summary Statistics

sample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted

+ Chapter 1 Exploring Data

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Bar Graphs and Dot Plots

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller

6.4 Normal Distribution

Probability and Statistics Vocabulary List (Definitions for Middle School Teachers)

Chapter 3. The Normal Distribution

Descriptive statistics; Correlation and regression

9. Sampling Distributions

Descriptive Statistics

6 3 The Standard Normal Distribution

Exploratory Spatial Data Analysis

Module 2: Introduction to Quantitative Data Analysis

Transcription:

Math 140 Introductory Statistics Math 140 tutoring: LIVE OAK 1319 MW 11:30-4:30 TTh 3:30-5:30 F 10:30-12:30 General Hours: M - Th 10:00-5:30 F 10:00-3:00 Later: Saturday from 11 to 2.

Last time Uniform - rectangular distribution Normal distribution mean inflection points standard deviation

Skewed distributions Not symmetric curves Data is bunched on one end and a tail appears on the other side

New tools Median: The value of the line dividing the number of values in equal halves The area (or the number of points) to the left or to the right of the median are equal

New tools Quartiles: Once you have found the median, look at the left of the distribution and repeat the same procedure. This new value is called the lower quartile Q1 Repeat on the right, and find the upper quartile Q3

Median, lower and upper quartiles They divide the distribution in quarters. How much data is contained between Q1 and Q3?

Median, lower and upper quartiles They divide the distribution in quarters. How much data is contained between Q1 and Q3? 50%

Example - the weight of bears Find median, Q1 and Q3

Example - the weight of bears Median ~ 155 lb Q1 ~ 115 lb Q3 ~ 250 lb

Outliers, gaps and clusters outliers are special values that stand out when we look at the distribution mistakes? Just flukes (a really really big bear!) sometimes they can lead to interesting discoveries gaps and clusters informal definitions

Outliers, gaps and clusters Lord Rayleigh s densities of nitrogen what is different between the two? why two clusters?

Outliers, gaps and clusters Chemically produced Atmospheric There might be something else in the atmosphere!

Bimodal distributions Some distributions have two peaks instead of one Unimodal (one peak) Bimodal (two peaks) Multimodal (many peaks)

Example Bimodal - what to make of this? is there other info we can use?

Splitting data Africa - spread out Europe - skewed to left

Quantitative vs. categorical data Quantitative : data in form of numbers that can be compared and that can take a large range of values Categorical : a case can belong to a category or not

How to look at quantitative data?

1. Dot plots Each dot represents a case Dots may represent more than one case (one dot may represent 1000 cases - USA births) We can use different symbols for different Categories of data

Dot plots work best when Relatively small number of values to plot Want to keep track of individuals Want to see the shape of the distribution Have one group or a small number of groups that we want to compare Making plots by hand

2. Histograms Similar to dot plots but where data is grouped Groups of cases represented as rectangles or bars The vertical axis gives the number of cases (called frequency or count) By convention borderline values go to the bar on the right. There is no prescribed number for the width of the bars.

Random numbers Dot plot Histogram

Histograms A histogram is like a coarse grained dot plot bins on the x-axis frequency on the y-axis We can choose bin size any way we like

Relative Frequency The sum of all heights is one

Frequency and Relative frequency actual occurences percent of total (in this case divide by 1000)

Different bin choices Speed of mammal species Using two bar widths THERE IS NO RIGHT OR WRONG

Histograms work best when Large number of values to plot Don t need to see individual values exactly Don t want to see exact shape of distribution Have one distribution to look at Use a calculator or computer

3. Stemplots Speeds of mammals (mph) 11, 12, 20, 25, 30, 30, 30, 32, 35, 39, 40, 40, 40, 42, 45, 48, 50, 70

3. Stemplots Speeds of mammals (mph) 11, 12, 20, 25, 30, 30, 30, 32, 35, 39, 40, 40, 40, 42, 45, 48, 50, 70 1 1 2

3. Stemplots Speeds of mammals (mph) 11, 12, 20, 25, 30, 30, 30, 32, 35, 39, 40, 40, 40, 42, 45, 48, 50, 70 3 0 0 0 2 5 9

3. Stemplots

3. Stemplots Or stem-and-leaf plots Numbers on the left are called stems (the first digits of the data value) Numbers on the right are called leaves (the last digit of the data value)

Split stemplots

Split stemplots The unit digits 0,1,2,3,4 are associated with the first stem and they are placed on the first line. The unit digits 5,6,7,8,9 are associated with the second stem and they are placed on the second line.

Back to back stemplots The data is differentiated on whether the mammals are predators or non-predators

Who has the faster speed?

Calculating medians and quartiles

Stemplots work best when Small number of values to plot Want to keep track of individual values (at least approximately) Want to see shape of distribution Have two or more groups that we want to compare

Hk Page 45 E16, E17 a/b, E13, E14