# If the Shoe Fits! Overview of Lesson GAISE Components Common Core State Standards for Mathematical Practice

Save this PDF as:

Size: px
Start display at page:

Download "If the Shoe Fits! Overview of Lesson GAISE Components Common Core State Standards for Mathematical Practice"

## Transcription

1 If the Shoe Fits! Overview of Lesson In this activity, students explore and use hypothetical data collected on student shoe print lengths, height, and gender in order to help develop a tentative description of a person who entered a school s grounds over a weekend without permission. Graphs such as comparative boxplots and scatterplots are drawn to illustrate the data. Measures of center (median, mean) and spread (range, Interquartile Range (IQR), Mean Absolute Deviation (MAD)) are computed. Conclusions are drawn based upon the data analysis in the context of question(s) asked. GAISE Components This investigation follows the four components of statistical problem solving put forth in the Guidelines for Assessment and Instruction in Statistics Education (GAISE) Report. The four components are: formulate a question, design and implement a plan to collect data, analyze the data by measures and graphs, and interpret the results in the context of the original question. This is a GAISE Level B activity. Common Core State Standards for Mathematical Practice 1. Make sense of problems and persevere in solving them. 2. Reason abstractly and quantitatively. 3. Construct viable arguments and critique the reasoning of others. 4. Model with mathematics. 5. Use appropriate tools strategically. Common Core State Standard Grade Level Content (High School) S-ID. 1. Represent data with plots on the real number line (dot plots, histograms, and box plots). S-ID. 2. Use statistics appropriate to the shape of the data distribution to compare center (median, mean) and spread (interquartile range, standard deviation) of two or more different data sets. S-ID. 3. Interpret differences in shape, center, and spread in the context of the data sets, accounting for possible effects of extreme data points (outliers). S-ID. 6. Represent data on two quantitative variables on a scatter plot, and describe how the variables are related. S-IC. 3. Recognize the purposes of and differences among sample surveys, experiments, and observational studies; explain how randomization relates to each. NCTM Principles and Standards for School Mathematics Data Analysis and Probability Standards for Grades 9-12 Formulate questions that can be addressed with data and collect, organize, and display relevant data to answer them: understand the differences among various kinds of studies and which types of inferences can legitimately be drawn from each; know the characteristics of well-designed studies, including the role of randomization in surveys and experiments; understand the meaning of measurement data and categorical data, of univariate and bivariate data, and of the term variable; 1

2 understand histograms, parallel box plots, and scatterplots and use them to display data; compute basic statistics and understand the distinction between a statistic and a parameter. Select and use appropriate statistical methods to analyze data: for univariate measurement data, be able to display the distribution, describe its shape, and select and calculate summary statistics; for bivariate measurement data, be able to display a scatterplot, describe its shape, and determine regression coefficients, regression equations, and correlation coefficients using technological tools; display and discuss bivariate data where at least one variable is categorical. Prerequisites Students will have knowledge of calculating numerical summaries for one variable (mean, median, Mean Absolute Deviation (MAD), five-number summary). Students will have knowledge of how to construct boxplots and scatterplots. Students will have some familiarity with general concepts from linear regression. Learning Targets Students will be able to calculate numerical summaries and use them to compare two data sets. Students will be able to determine if any data values are outliers. Students will be able to display data in comparative boxplots and use the plot to compare two data sets. Students will be able to display the relationship between two variables on a scatterplot and interpret the resulting plot. Time Required 1 class period. Materials Required Pencil and (graphing) paper; graphing calculator or statistical software package. Instructional Lesson Plan The GAISE Statistical Problem-Solving Procedure I. Formulate Question(s) Begin the lesson by discussing a hypothetical background: Welcome to CSI at School! Over the weekend, a student entered the school grounds without permission. Even though it appears that the culprit was just looking for a quiet place to study undisturbed by friends, school administrators are anxious to identify the offender and have asked for your help. The only available evidence is a suspicious footprint found outside the library door. Ask students to write some questions that they would be interested in investigating about students shoeprint lengths. Some possible questions might be: 1. What kinds of lengths do shoeprints have? What is the shortest shoe print length expected to be? What is the longest shoe print length expected to be? 2. Are there differences in the shoe print lengths for males and females? If so, what are the differences? 2

3 3. Are shoe print lengths related to any other variables? The investigation that follows is based upon some of the questions that might be posed by students and on attempting to answer the overall question of identifying the offender. II. Design and Implement a Plan to Collect the Data Explain the following hypothetical scenario for data collection: After the incident, school administrators arranged for data to be obtained from a random sample of this high school s students. The data table shows the shoe print length (in cm), height (in inches), and gender for each individual in the sample. Shoe Print Length Height Gender Shoe Print Length Height Gender F F M F F M F F F F M M M F M M F F F F F F F M F M M F F M F F F F F F F F F Ask students to explain why this is an observational study and not an experiment. Students should note that in this context, nothing has been done deliberately to the students in order to measure their responses. From direct observation and measurement, data values were recorded for each student s height, shoe print length, and gender. Ask students to think about why the school s administrators chose to collect data on a random sample of students from the school. What benefit might a random sample offer? Students should note that whenever possible, random selection should be used to choose samples for an observational study. By using random selection, chance determines which individuals are included in the sample. This helps ensure that a sample is representative of the population from 3

4 which it was chosen. Random selection allows the researchers to generalize sample results to a larger population of interest. III. Analyze the Data Have students begin the investigation by performing data analyses to help determine the gender of the offender. By using appropriate graphs and numerical calculations students can compare shoe print lengths for males and females. Ask students to suggest graphs that might be used to use to compare the shoe print length data distributions for females and males. Comparative graphs such as dotplots or boxplots are appropriate for displaying these data. Students describe one advantage of using comparative dotplots instead of comparative boxplots to display these data. Students describe one advantage of using comparative boxplots instead of comparative dotplots to display these data. Comparative dotplots have the advantage of showing each individual data value while comparative boxplots are useful for comparing the percentiles of the two distributions. After a discussion, ask students to construct comparative boxplots for the shoe print lengths for males and females. First, have students calculate five-number summaries of the shoe print length data for males and females. Additionally, have students determine if there are any outlying shoeprint length values. The five-number summaries of the shoe print length data are shown in the Table below. Minimum Quartile 1 Median (Q2) Quartile 3 Maximum (Q1) (Q3) Male Female Note that the median is the halfway point in the data set. Also note that the first quartile is the median of the data points strictly below the median of the data set, and the third quartile is the median of the data points strictly above the median of the data set. Demonstrate to students that in order to check for outlying shoe print lengths for the males: (1) the interquartile range (IQR) is calculated: Q3 Q1 = = 2 cm (2) the IQR is multiplied by 1.5: 1.5(2) = 3 cm (3) 3 is subtracted from Q1: 30 3 = 27 cm (4) 3 is added to Q3: = 35 cm Any shoe print length values smaller than 27 cm or greater than 35 cm are outlying values. There is one male student with an outlying shoe print length of 37 cm. In order to check for outlying shoe print lengths for the females: (1) the interquartile range (IQR) is calculated: Q3 Q1 = = 2.4 cm (2) 1.5(IQR) is 1.5(2.4) = 3.6 cm (3) = 21.4 cm (4) = 31 cm Any shoe print length values smaller than 21.4 cm or greater than 31 cm are outlying values. There is one female student with an outlying shoe print length of 31 cm. 4

5 Now ask students to construct comparative boxplots to display the shoe print length distributions. The Figure below illustrates the comparative boxplots. Discuss with students how to interpret the boxplots. Students should understand that there are about the same number of shoe print lengths between the minimum and Q1, Q1 to Q2, Q2 to Q3, and Q3 to the maximum; or approximately 25% of the data will lie in each of these four intervals. Ask students to describe the similarities and differences in the shoe print length distributions for the males and females in the sample. The box plots show that the females tended to have shorter shoe print lengths than the males. The median shoe print length for females was much lower than for males. The interquartile range for females is much lower than the interquartile range for males. All shoe print lengths for males are longer than at least 75% of the shoe print lengths for females. There is one high outlier in the female group and one high outlier in the male group. The next step in the analysis of the data is to focus on measures of center. The mean and median shoe print lengths for males are and 31 cm; respectively. For females, the mean and median lengths were and 26.5 cm; respectively. As expected, it seems that a typical male shoe print length was longer than a typical female shoe print length by about 5 cm. Students are now asked to characterize the spread of the shoe print length distributions. Ask students to begin by calculating the range in shoe print lengths. For males the data cover the interval from the minimum of 29 cm to the maximum of 37 cm so the range in lengths is 8 cm. 5

7 female? How sure are you that you are correct? Ask students to explain their reasoning. Students will say that a shoe print length of 32 cm is beyond the maximum shoe print length for the 28 females in the sample. Further the maximum female shoe print length of 31 cm was determined to be an outlier. It is highly unlikely that a shoe print 32 cm long was left by a female. Thus the offending student is most likely not a female. Note that the mean shoe print length for the 11 sampled males is cm so a shoe print length of 32 cm long would not be unexpected for a male. Now, ask students: how would you answer if the suspect s shoe print length was 27 cm? Students will note that a shoe print length of 27 cm is 2 cm below the minimum shoe print length for the 11 sampled males. It is highly unlikely that a shoe print 27 cm long was left by a male. The offending student is most likely not a male. The mean shoe print length for the 28 sampled females is cm with a MAD of 1.50 cm. Further; students will note that a shoe print length of 27 cm is only.4 cm below the third quartile of the shoe print lengths for females. We would expect close to 25% of female students to have a shoe print length of 27 cm or longer. It is much more likely for a female student to leave a shoe print 27 cm long than it is for a male student. In the next phase of analysis have students explore the relationship between height (in inches) and length of shoe print (in cm). Discuss which variable should be the explanatory (independent) variable and which variable should be the response (dependent) variable. Here we would expect a student s shoe print length to help explain the height of the student (since the shoe print length of the offender is assumed to be known). Ask students to construct a scatterplot of height versus shoe print length using different colors or different plotting symbols to represent the data for males and females. The Figure below illustrates the scatterplot. Ask students if it looks like there is an overall linear relationship between height and shoe print length. The scatterplot does indicate a somewhat strong, positive, linear association between 7

9 acknowledge that since a random sample of student data was utilized in the analysis; looking beyond the data is feasible; however, we always need to be mindful of sampling error and sampling variability. Assessment 1. A question in the 2002 General Social Survey (GSS) conducted by the national Opinion Research Center asked participants how long they spend on each week. A summary of responses (hours) for 1881 respondents follows. (a) Explain how the summary statistics show us that at least 25% of the respondents said that they do not use . (b) What is the interval that contains the lower 50% of the responses? (c) What is the interval that contains the upper 50% of the responses? (d) Explain whether the maximum value, 70 hours, would be marked as an outlier on a boxplot. Answer: (a) The lower quartile Q1 = 0 hours. By definition, 25% of the values are at or below the lower quartile. (b) 0 to 2 hours (Minimum to Median). (c) 2 to 70 hours (Median to Maximum). (d) Yes, 70 hours would be marked as an outlier. The boundary defining an outlier on the high side is Q IQR = (5-0) = 12.5 hours. 2. The figure below is a boxplot for comparing tip percentages for a male and a female restaurant server, each of whom drew happy faces on the checks of randomly selected dining parties. Discuss the ways in which the tip percentage distributions for the two servers differ. Answer: Generally females tended to have higher tip percentages. The median is clearly greater for females than it is for males. Note that the first quartile for females is above the third quartile for the males. As can be seen from the overall range and the interquartile range, the data for the 9

10 females shows greater spread than the data for the males. There is one high outlier for the females. The males do not have any outliers. Possible Extensions (to GAISE Level C) 1. Determine the estimated regression equation for predicting a student s shoe print length (in cm) from the student s height (in inches). (a) Interpret the slope of the estimated regression equation in the context of this problem. (b) Predict the height of a student who has a shoe print length of 30 cm. (c) Calculate the residual for a student who has a shoe print length of 30 cm. 2. Give the numerical value and interpret the Pearson correlation coefficient between a student s shoe print length (in cm) and a student s height (in inches). References 1. Guidelines for Assessment and Instruction in Statistics Education (GAISE) Report, ASA, Franklin et al., ASA, Adapted from an activity appearing in Making Sense of Statistical Studies by Peck and Starnes (with Kranendonk and Morita), ASA, Adapted from the Investigation: How Long are our Shoes? In Bridging the Gap Between the Common Core State Standards and Teaching Statistics (2012, in press). Authors: Pat Hopfensperger, Tim Jacobbe, Deborah Lurie, and Jerry Moreno. 4. CSI Stats: Helping Students Become Data Detectives with the GAISE Framework presented by Daren Starnes. Recorded web-based seminar Assessment questions from: Mind on Statistics, Third Edition by Utts/Heckard, Cengage Learning. 10

11 If the Shoe Fits! Activity Sheet Welcome to CSI at School! Over the weekend, a student entered the school grounds without permission. Even though it appears that the culprit was just looking for a quiet place to study undisturbed by friends, school administrators are anxious to identify the offender and have asked for your help. The only available evidence is a suspicious footprint outside the library door. After the incident, school administrators arranged for the data in the table below to be obtained from a random sample of this high school s students. The table shows the shoe print length (in cm), height (in inches), and gender for each individual in the sample. Shoe Print Length Height Gender Shoe Print Length Height Gender F F M F F M F F F F M M M F M M F F F F F F F M F M M F F M F F F F F F F F F 1. Explain why this study was an observational study and not an experiment. 2. Why do you think the school s administrators chose to collect data on a random sample of students from the school? What benefit might a random sample offer? 3. Suggest a graph that might be used to use to compare the shoe print length data distributions for females and males. 4. Describe one advantage of using comparative boxplots instead of comparative dotplots to display these data. 11

12 5. For each gender calculate the five-number summary for the shoe print lengths. Additionally, for each gender, determine if there are any outlying shoe print length values. 6. Construct comparative boxplots for the shoe print lengths of males and females. Discuss the similarities and differences in the shoe print length distributions for the males and females in this sample. 7. For each gender calculate the mean shoe print length. What information does the mean shoe print length provide? 8. The mean will give us an indication of a typical shoe print length. In addition to knowing a typical length we would also like to know how much variability to expect around this length. For each gender calculate the Range; Interquartile Range; and Mean Absolute Deviation of the shoe print lengths. Interpret each of the calculated values. 9. If the length of a student s shoe print was 32 cm, would you think that the print was made by a male or a female? How sure are you that you are correct? Explain your reasoning. Use results from Questions 5 through 8 in your explanation. 10. How would you answer Question 9 if the suspect s shoe print length was 27 cm? 11. Construct a scatterplot of height (vertical scale) versus shoe print length horizontal scale) using different colors or different plotting symbols to represent the data for males and females. (a) Interpret the scatterplot. Does it look like there is a linear relationship between height and shoe print length? Explain. 12

13 (b) Does it look like the same straight line could be used to summarize the relationship between shoe print length and height for both males and females? Explain. (c) Based on the scatterplot, if a student s shoe print length was 30 cm, approximately what height would you predict for the person who made the shoe print? Explain how you arrived at your prediction. 12. Suppose that the shoe print found outside the library actually had a length of 30 cm. Based on the given data and your answers to Questions 5 through 11, write a description of the person who you think may have left the shoe print. Explain the reasoning that led to your description and give some indication of how confident you are that your description is correct. 13

### Cell Phone Impairment?

Cell Phone Impairment? Overview of Lesson This lesson is based upon data collected by researchers at the University of Utah (Strayer and Johnston, 2001). The researchers asked student volunteers (subjects)

### Cents and the Central Limit Theorem Overview of Lesson GAISE Components Common Core State Standards for Mathematical Practice

Cents and the Central Limit Theorem Overview of Lesson In this lesson, students conduct a hands-on demonstration of the Central Limit Theorem. They construct a distribution of a population and then construct

### Armspans. Published: August 2012. Debra L. Hydorn University of Mary Washington

Armspans Debra L. Hydorn University of Mary Washington dhydorn@umw.edu Published: August 2012 Overview of Lesson In this activity students conduct an investigation to compare the arm span lengths of the

### What Does the Normal Distribution Sound Like?

What Does the Normal Distribution Sound Like? Ananda Jayawardhana Pittsburg State University ananda@pittstate.edu Published: June 2013 Overview of Lesson In this activity, students conduct an investigation

### Exercise 1.12 (Pg. 22-23)

Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

### Common Core State Standards for Mathematical Practice 4. Model with mathematics. 7. Look for and make use of structure.

Who Sends the Most Text Messages? Written by: Anna Bargagliotti and Jeanie Gibson (for Project-SET) Loyola Marymount University and Hutchison School abargagl@lmu.edu, jgibson@hutchisonschool.org, www.project-set.com

### What is the Probability of Pigging Out

What is the Probability of Pigging Out Mary Richardson Susan Haller Grand Valley State University St. Cloud State University richamar@gvsu.edu skhaller@stcloudstate.edu Published: April 2012 Overview of

### Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables

### The United States of Obesity

The United States of Obesity Matt Malloure Grand Valley State University matt.malloure@gmail.com Diann Reischman Grand Valley State University reischmd@gvsu.edu Mary Richardson Grand Valley State University

### Home Runs, Statistics, and Probability

NATIONAL MATH + SCIENCE INITIATIVE Mathematics American League AL Central AL West AL East National League NL West NL East Level 7 th grade in a unit on graphical displays Connection to AP* Graphical Display

### MTH 140 Statistics Videos

MTH 140 Statistics Videos Chapter 1 Picturing Distributions with Graphs Individuals and Variables Categorical Variables: Pie Charts and Bar Graphs Categorical Variables: Pie Charts and Bar Graphs Quantitative

### Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

### Chapter 3: Data Description Numerical Methods

Chapter 3: Data Description Numerical Methods Learning Objectives Upon successful completion of Chapter 3, you will be able to: Summarize data using measures of central tendency, such as the mean, median,

### Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

### College of the Canyons Math 140 Exam 1 Amy Morrow. Name:

Name: Answer the following questions NEATLY. Show all necessary work directly on the exam. Scratch paper will be discarded unread. One point each part unless otherwise marked. 1. Owners of an exercise

### Fairfield Public Schools

Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

### PowerTeaching i3: Algebra I Mathematics

PowerTeaching i3: Algebra I Mathematics Alignment to the Common Core State Standards for Mathematics Standards for Mathematical Practice and Standards for Mathematical Content for Algebra I Key Ideas and

### AP Statistics 2001 Solutions and Scoring Guidelines

AP Statistics 2001 Solutions and Scoring Guidelines The materials included in these files are intended for non-commercial use by AP teachers for course and exam preparation; permission for any other use

### Correlation and Regression

Correlation and Regression Scatterplots Correlation Explanatory and response variables Simple linear regression General Principles of Data Analysis First plot the data, then add numerical summaries Look

### COMMON CORE STATE STANDARDS FOR

COMMON CORE STATE STANDARDS FOR Mathematics (CCSSM) High School Statistics and Probability Mathematics High School Statistics and Probability Decisions or predictions are often based on data numbers in

### Chapter 10 - Practice Problems 1

Chapter 10 - Practice Problems 1 1. A researcher is interested in determining if one could predict the score on a statistics exam from the amount of time spent studying for the exam. In this study, the

### Stats Review Chapters 3-4

Stats Review Chapters 3-4 Created by Teri Johnson Math Coordinator, Mary Stangler Center for Academic Success Examples are taken from Statistics 4 E by Michael Sullivan, III And the corresponding Test

### AP * Statistics Review. Descriptive Statistics

AP * Statistics Review Descriptive Statistics Teacher Packet Advanced Placement and AP are registered trademark of the College Entrance Examination Board. The College Board was not involved in the production

### Relationships Between Two Variables: Scatterplots and Correlation

Relationships Between Two Variables: Scatterplots and Correlation Example: Consider the population of cars manufactured in the U.S. What is the relationship (1) between engine size and horsepower? (2)

### The Comparisons. Grade Levels Comparisons. Focal PSSM K-8. Points PSSM CCSS 9-12 PSSM CCSS. Color Coding Legend. Not Identified in the Grade Band

Comparison of NCTM to Dr. Jim Bohan, Ed.D Intelligent Education, LLC Intel.educ@gmail.com The Comparisons Grade Levels Comparisons Focal K-8 Points 9-12 pre-k through 12 Instructional programs from prekindergarten

### Box-and-Whisker Plots

Mathematics Box-and-Whisker Plots About this Lesson This is a foundational lesson for box-and-whisker plots (boxplots), a graphical tool used throughout statistics for displaying data. During the lesson,

### Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation

Lecture 11: Chapter 5, Section 3 Relationships between Two Quantitative Variables; Correlation Display and Summarize Correlation for Direction and Strength Properties of Correlation Regression Line Cengage

### Statistics courses often teach the two-sample t-test, linear regression, and analysis of variance

2 Making Connections: The Two-Sample t-test, Regression, and ANOVA In theory, there s no difference between theory and practice. In practice, there is. Yogi Berra 1 Statistics courses often teach the two-sample

### This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

Algebra I Overview View unit yearlong overview here Many of the concepts presented in Algebra I are progressions of concepts that were introduced in grades 6 through 8. The content presented in this course

### 6 th Grade Math 4 th Quarter Unit 5: Geometry Topic C: Volume of Right Rectangular Prisms

HIGLEY UNIFIED SCHOOL DISTRICT INSTRUCTIONAL ALIGNMENT 6 th Grade Math 4 th Quarter Unit 5: Geometry Topic C: Volume of Right Rectangular Prisms In Grade 5, students recognized volume as an attribute of

### Hints for Success on the AP Statistics Exam. (Compiled by Zack Bigner)

Hints for Success on the AP Statistics Exam. (Compiled by Zack Bigner) The Exam The AP Stat exam has 2 sections that take 90 minutes each. The first section is 40 multiple choice questions, and the second

### Infinite Algebra 1 supports the teaching of the Common Core State Standards listed below.

Infinite Algebra 1 Kuta Software LLC Common Core Alignment Software version 2.05 Last revised July 2015 Infinite Algebra 1 supports the teaching of the Common Core State Standards listed below. High School

### Report of for Chapter 2 pretest

Report of for Chapter 2 pretest Exam: Chapter 2 pretest Category: Organizing and Graphing Data 1. "For our study of driving habits, we recorded the speed of every fifth vehicle on Drury Lane. Nearly every

### Bar Graphs and Dot Plots

CONDENSED L E S S O N 1.1 Bar Graphs and Dot Plots In this lesson you will interpret and create a variety of graphs find some summary values for a data set draw conclusions about a data set based on graphs

### . 58 58 60 62 64 66 68 70 72 74 76 78 Father s height (inches)

PEARSON S FATHER-SON DATA The following scatter diagram shows the heights of 1,0 fathers and their full-grown sons, in England, circa 1900 There is one dot for each father-son pair Heights of fathers and

### Current Standard: Mathematical Concepts and Applications Shape, Space, and Measurement- Primary

Shape, Space, and Measurement- Primary A student shall apply concepts of shape, space, and measurement to solve problems involving two- and three-dimensional shapes by demonstrating an understanding of:

### Walk the Line Written by: Maryann Huey Drake University Maryann.Huey@drake.edu

Walk the Line Written by: Maryann Huey Drake University Maryann.Huey@drake.edu Overview of Lesson In this activity, students will conduct an investigation to collect data to determine how far students

### Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a

### Pearson s correlation

Pearson s correlation Introduction Often several quantitative variables are measured on each member of a sample. If we consider a pair of such variables, it is frequently of interest to establish if there

### Simple linear regression

Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

### Introductory Statistics Notes

Introductory Statistics Notes Jamie DeCoster Department of Psychology University of Alabama 348 Gordon Palmer Hall Box 870348 Tuscaloosa, AL 35487-0348 Phone: (205) 348-4431 Fax: (205) 348-8648 August

### IQR Rule for Outliers

1. Arrange data in order. IQR Rule for Outliers 2. Calculate first quartile (Q1), third quartile (Q3) and the interquartile range (IQR=Q3-Q1). CO2 emissions example: Q1=0.9, Q3=6.05, IQR=5.15. 3. Compute

### Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

### II. DISTRIBUTIONS distribution normal distribution. standard scores

Appendix D Basic Measurement And Statistics The following information was developed by Steven Rothke, PhD, Department of Psychology, Rehabilitation Institute of Chicago (RIC) and expanded by Mary F. Schmidt,

### F. Farrokhyar, MPhil, PhD, PDoc

Learning objectives Descriptive Statistics F. Farrokhyar, MPhil, PhD, PDoc To recognize different types of variables To learn how to appropriately explore your data How to display data using graphs How

### The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

### Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam. Software Profiling Seminar, Statistics 101

Dr. Peter Tröger Hasso Plattner Institute, University of Potsdam Software Profiling Seminar, 2013 Statistics 101 Descriptive Statistics Population Object Object Object Sample numerical description Object

### STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

### Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

### HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Mathematics Revision Guides Histograms, Cumulative Frequency and Box Plots Page 1 of 25 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

### South Carolina College- and Career-Ready (SCCCR) Probability and Statistics

South Carolina College- and Career-Ready (SCCCR) Probability and Statistics South Carolina College- and Career-Ready Mathematical Process Standards The South Carolina College- and Career-Ready (SCCCR)

### Mind on Statistics. Chapter 2

Mind on Statistics Chapter 2 Sections 2.1 2.3 1. Tallies and cross-tabulations are used to summarize which of these variable types? A. Quantitative B. Mathematical C. Continuous D. Categorical 2. The table

### SPSS for Exploratory Data Analysis Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav)

Data used in this guide: studentp.sav (http://people.ysu.edu/~gchang/stat/studentp.sav) Organize and Display One Quantitative Variable (Descriptive Statistics, Boxplot & Histogram) 1. Move the mouse pointer

### This curriculum is part of the Educational Program of Studies of the Rahway Public Schools. ACKNOWLEDGMENTS

CURRICULUM FOR STATISTICS & PROBABILITY GRADES 11 & 12 This curriculum is part of the Educational Program of Studies of the Rahway Public Schools. ACKNOWLEDGMENTS Christine H. Salcito, Director of Curriculum

### Inferential Statistics

Inferential Statistics Sampling and the normal distribution Z-scores Confidence levels and intervals Hypothesis testing Commonly used statistical methods Inferential Statistics Descriptive statistics are

### UNIT 1: COLLECTING DATA

Core Probability and Statistics Probability and Statistics provides a curriculum focused on understanding key data analysis and probabilistic concepts, calculations, and relevance to real-world applications.

### The Importance of Statistics Education

The Importance of Statistics Education Professor Jessica Utts Department of Statistics University of California, Irvine http://www.ics.uci.edu/~jutts jutts@uci.edu Outline of Talk What is Statistics? Four

### Simple Linear Regression in SPSS STAT 314

Simple Linear Regression in SPSS STAT 314 1. Ten Corvettes between 1 and 6 years old were randomly selected from last year s sales records in Virginia Beach, Virginia. The following data were obtained,

### Statistics 151 Practice Midterm 1 Mike Kowalski

Statistics 151 Practice Midterm 1 Mike Kowalski Statistics 151 Practice Midterm 1 Multiple Choice (50 minutes) Instructions: 1. This is a closed book exam. 2. You may use the STAT 151 formula sheets and

### Univariate Regression

Univariate Regression Correlation and Regression The regression line summarizes the linear relationship between 2 variables Correlation coefficient, r, measures strength of relationship: the closer r is

### The correlation coefficient

The correlation coefficient Clinical Biostatistics The correlation coefficient Martin Bland Correlation coefficients are used to measure the of the relationship or association between two quantitative

### Measurement with Ratios

Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical

### Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Introduction to Statistics for Psychology and Quantitative Methods for Human Sciences Jonathan Marchini Course Information There is website devoted to the course at http://www.stats.ox.ac.uk/ marchini/phs.html

### AP Statistics Semester Exam Review Chapters 1-3

AP Statistics Semester Exam Review Chapters 1-3 1. Here are the IQ test scores of 10 randomly chosen fifth-grade students: 145 139 126 122 125 130 96 110 118 118 To make a stemplot of these scores, you

### Technology Step-by-Step Using StatCrunch

Technology Step-by-Step Using StatCrunch Section 1.3 Simple Random Sampling 1. Select Data, highlight Simulate Data, then highlight Discrete Uniform. 2. Fill in the following window with the appropriate

### Problem Solving and Data Analysis

Chapter 20 Problem Solving and Data Analysis The Problem Solving and Data Analysis section of the SAT Math Test assesses your ability to use your math understanding and skills to solve problems set in

### Statistics 2014 Scoring Guidelines

AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

### 2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

Math 1530-017 Exam 1 February 19, 2009 Name Student Number E There are five possible responses to each of the following multiple choice questions. There is only on BEST answer. Be sure to read all possible

### Chapter 5: The normal approximation for data

Chapter 5: The normal approximation for data Context................................................................... 2 Normal curve 3 Normal curve.............................................................

### Chapter 15 Multiple Choice Questions (The answers are provided after the last question.)

Chapter 15 Multiple Choice Questions (The answers are provided after the last question.) 1. What is the median of the following set of scores? 18, 6, 12, 10, 14? a. 10 b. 14 c. 18 d. 12 2. Approximately

### The Big Picture. Describing Data: Categorical and Quantitative Variables Population. Descriptive Statistics. Community Coalitions (n = 175)

Describing Data: Categorical and Quantitative Variables Population The Big Picture Sampling Statistical Inference Sample Exploratory Data Analysis Descriptive Statistics In order to make sense of data,

### 4. Describing Bivariate Data

4. Describing Bivariate Data A. Introduction to Bivariate Data B. Values of the Pearson Correlation C. Properties of Pearson's r D. Computing Pearson's r E. Variance Sum Law II F. Exercises A dataset with

### Common Core Unit Summary Grades 6 to 8

Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations

### LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE MAT 119 STATISTICS AND ELEMENTARY ALGEBRA 5 Lecture Hours, 2 Lab Hours, 3 Credits Pre-

### What is the purpose of this document? What is in the document? How do I send Feedback?

This document is designed to help North Carolina educators teach the Common Core (Standard Course of Study). NCDPI staff are continually updating and improving these tools to better serve teachers. Statistics

### 3. Data Analysis, Statistics, and Probability

3. Data Analysis, Statistics, and Probability Data and probability sense provides students with tools to understand information and uncertainty. Students ask questions and gather and use data to answer

### Describing, Exploring, and Comparing Data

24 Chapter 2. Describing, Exploring, and Comparing Data Chapter 2. Describing, Exploring, and Comparing Data There are many tools used in Statistics to visualize, summarize, and describe data. This chapter

### Problem of the Month Through the Grapevine

The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards: Make sense of problems

### MULTIPLE REGRESSION EXAMPLE

MULTIPLE REGRESSION EXAMPLE For a sample of n = 166 college students, the following variables were measured: Y = height X 1 = mother s height ( momheight ) X 2 = father s height ( dadheight ) X 3 = 1 if

### NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS

NEW YORK STATE TEACHER CERTIFICATION EXAMINATIONS TEST DESIGN AND FRAMEWORK September 2014 Authorized for Distribution by the New York State Education Department This test design and framework document

### 1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number A. 3(x - x) B. x 3 x C. 3x - x D. x - 3x 2) Write the following as an algebraic expression

### Name: Date: Use the following to answer questions 2-3:

Name: Date: 1. A study is conducted on students taking a statistics class. Several variables are recorded in the survey. Identify each variable as categorical or quantitative. A) Type of car the student

### Georgia Department of Education Common Core Georgia Performance Standards Framework Teacher Edition Coordinate Algebra Unit 4

Equal Salaries for Equal Work? Mathematical Goals Represent data on a scatter plot Describe how two variables are related Informally assess the fit of a function by plotting and analyzing residuals Fit

### CC Investigation 5: Histograms and Box Plots

Content Standards 6.SP.4, 6.SP.5.c CC Investigation 5: Histograms and Box Plots At a Glance PACING 3 days Mathematical Goals DOMAIN: Statistics and Probability Display numerical data in histograms and

### Once saved, if the file was zipped you will need to unzip it. For the files that I will be posting you need to change the preferences.

1 Commands in JMP and Statcrunch Below are a set of commands in JMP and Statcrunch which facilitate a basic statistical analysis. The first part concerns commands in JMP, the second part is for analysis

### 10-3 Measures of Central Tendency and Variation

10-3 Measures of Central Tendency and Variation So far, we have discussed some graphical methods of data description. Now, we will investigate how statements of central tendency and variation can be used.

### STAT 155 Introductory Statistics. Lecture 5: Density Curves and Normal Distributions (I)

The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Introductory Statistics Lecture 5: Density Curves and Normal Distributions (I) 9/12/06 Lecture 5 1 A problem about Standard Deviation A variable

### Using GAISE and NCTM Standards as Frameworks for Teaching Probability and Statistics to Pre-Service Elementary and Middle School Mathematics Teachers

Using GAISE and NCTM Standards as Frameworks for Teaching Probability and Statistics to Pre-Service Elementary and Middle School Mathematics Teachers Mary Louise Metz Indiana University of Pennsylvania

### DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

DESCRIPTIVE STATISTICS The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses. DESCRIPTIVE VS. INFERENTIAL STATISTICS Descriptive To organize,

### Point and Interval Estimates

Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

### The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces

The Effect of Dropping a Ball from Different Heights on the Number of Times the Ball Bounces Or: How I Learned to Stop Worrying and Love the Ball Comment [DP1]: Titles, headings, and figure/table captions

### Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Center: Finding the Median When we think of a typical value, we usually look for the center of the distribution. For a unimodal, symmetric distribution, it s easy to find the center it s just the center

### Research Variables. Measurement. Scales of Measurement. Chapter 4: Data & the Nature of Measurement

Chapter 4: Data & the Nature of Graziano, Raulin. Research Methods, a Process of Inquiry Presented by Dustin Adams Research Variables Variable Any characteristic that can take more than one form or value.

### Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

### The Big 50 Revision Guidelines for S1

The Big 50 Revision Guidelines for S1 If you can understand all of these you ll do very well 1. Know what is meant by a statistical model and the Modelling cycle of continuous refinement 2. Understand

### Foundations for Functions

Activity: TEKS: Overview: Materials: Grouping: Time: Crime Scene Investigation (A.2) Foundations for functions. The student uses the properties and attributes of functions. The student is expected to:

### Descriptive Statistics

Y520 Robert S Michael Goal: Learn to calculate indicators and construct graphs that summarize and describe a large quantity of values. Using the textbook readings and other resources listed on the web