First Midterm Exam (MATH1070 Spring 2012)



Similar documents
Chapter 1: Exploring Data

Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs

Name: Date: Use the following to answer questions 2-3:

AP * Statistics Review. Descriptive Statistics

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

Statistics 151 Practice Midterm 1 Mike Kowalski

Second Midterm Exam (MATH1070 Spring 2012)

Exercise 1.12 (Pg )

2. Here is a small part of a data set that describes the fuel economy (in miles per gallon) of 2006 model motor vehicles.

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test March 2014

Section 1.3 Exercises (Solutions)

6.2 Normal distribution. Standard Normal Distribution:

Chapter 3. The Normal Distribution

1.3 Measuring Center & Spread, The Five Number Summary & Boxplots. Describing Quantitative Data with Numbers

Introduction to Environmental Statistics. The Big Picture. Populations and Samples. Sample Data. Examples of sample data

AP Statistics Solutions to Packet 2

Exploratory data analysis (Chapter 2) Fall 2011

a. mean b. interquartile range c. range d. median

Lecture 1: Review and Exploratory Data Analysis (EDA)

a) Find the five point summary for the home runs of the National League teams. b) What is the mean number of home runs by the American League teams?

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

Descriptive Statistics

MEASURES OF VARIATION

The Normal Distribution

consider the number of math classes taken by math 150 students. how can we represent the results in one number?

Center: Finding the Median. Median. Spread: Home on the Range. Center: Finding the Median (cont.)

Variables. Exploratory Data Analysis

Mind on Statistics. Chapter 8

MTH 140 Statistics Videos

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

Diagrams and Graphs of Statistical Data

MATH 103/GRACEY PRACTICE EXAM/CHAPTERS 2-3. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Geostatistics Exploratory Analysis

Mind on Statistics. Chapter 2

Algebra I Vocabulary Cards

Week 1. Exploratory Data Analysis

Descriptive Statistics. Purpose of descriptive statistics Frequency distributions Measures of central tendency Measures of dispersion

Statistics E100 Fall 2013 Practice Midterm I - A Solutions

Final Exam Practice Problem Answers

Mark. Use this information and the cumulative frequency graph to draw a box plot showing information about the students marks.

Chapter 2 Data Exploration

Data Exploration Data Visualization

Exploratory Data Analysis

RECOMMENDED COURSE(S): Algebra I or II, Integrated Math I, II, or III, Statistics/Probability; Introduction to Health Science

HISTOGRAMS, CUMULATIVE FREQUENCY AND BOX PLOTS

Lecture 2: Descriptive Statistics and Exploratory Data Analysis

Thursday, November 13: 6.1 Discrete Random Variables

Chapter 23. Inferences for Regression

+ Chapter 1 Exploring Data

Module 4: Data Exploration

Density Curve. A density curve is the graph of a continuous probability distribution. It must satisfy the following properties:

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Lecture 2. Summarizing the Sample

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Ch. 3.1 # 3, 4, 7, 30, 31, 32

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

Shape of Data Distributions

Dongfeng Li. Autumn 2010

Describing, Exploring, and Comparing Data

Math Mammoth End-of-the-Year Test, Grade 6, Answer Key

3: Summary Statistics

Descriptive Statistics

Statistics 100 Sample Final Questions (Note: These are mostly multiple choice, for extra practice. Your Final Exam will NOT have any multiple choice!

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Descriptive Statistics Practice Problems (Total 6 marks) (Total 8 marks) (Total 8 marks) (Total 8 marks) (1)

AP STATISTICS REVIEW (YMS Chapters 1-8)

Probability Distributions

DESCRIPTIVE STATISTICS. The purpose of statistics is to condense raw data to make it easier to answer specific questions; test hypotheses.

Topic 9 ~ Measures of Spread

THE BINOMIAL DISTRIBUTION & PROBABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

AP Statistics: Syllabus 1

Box-and-Whisker Plots

7. Normal Distributions

Exploratory Data Analysis. Psychology 3256

List of Examples. Examples 319

GeoGebra Statistics and Probability

2: Frequency Distributions

How To Write A Data Analysis

Interpreting Data in Normal Distributions

Classify the data as either discrete or continuous. 2) An athlete runs 100 meters in 10.5 seconds. 2) A) Discrete B) Continuous

Chapter 5 DASH Your Way to Weight Loss

Box-and-Whisker Plots

LAGUARDIA COMMUNITY COLLEGE CITY UNIVERSITY OF NEW YORK DEPARTMENT OF MATHEMATICS, ENGINEERING, AND COMPUTER SCIENCE

STAT 350 Practice Final Exam Solution (Spring 2015)

Statistics Chapter 2

Means, standard deviations and. and standard errors

Algebra II EOC Practice Test

Summarizing and Displaying Categorical Data

STATISTICS 8: CHAPTERS 7 TO 10, SAMPLE MULTIPLE CHOICE QUESTIONS

How Does My TI-84 Do That

Manhattan Center for Science and Math High School Mathematics Department Curriculum

Graphs. Exploratory data analysis. Graphs. Standard forms. A graph is a suitable way of representing data if:

Statistics Revision Sheet Question 6 of Paper 2

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

6.4 Normal Distribution

Transcription:

First Midterm Exam (MATH1070 Spring 2012) Instructions: This is a one hour exam. You can use a notecard. Calculators are allowed, but other electronics are prohibited. 1. [40pts] Multiple Choice Problems In a statistics class with 136 students, the professor records how much money each student has in his or her possession during the first class of the semester. The following histogram is of the data collected. Based on this histogram, answer questions 1) 3). 1) The number of students with under USD 10 in their possession is closest to C A. 50. B. 70. C. 60. D. 40. 2) The percent of students with over USD 20 in their possession is about B A. 10%. B. 20%. C. 30%. D. 40%. 3) From the histogram, which of the following is true? A A. The mean is much larger than the median. B. The mean is much smaller than the median. C. It is impossible to compare the mean and median for these data. D. The mean and median are approximately equal.

A sample was taken of the verbal SAT scores of applicants to a California State College. The following is a boxplot of the scores. Based on this histogram, answer questions 4) and 5). 4) Based on this boxplot, the interquartile range is closest to B A. 500. B. 200. C. 600. D. 400. 5) If 25 points were added to each score, then interquartile range of the new scores would A A. remain unchanged. B. be increased by 5. C. be increased by 25. D. be increased by 625. 6) A Normal density curve has which of the following properties? D A. It has a peak centered above its mean. B. It is symmetric. C. The spread of the curve is proportional to the standard deviation. D. All of the above.

Refer to the following scatterplot For each menu item at a fast food restaurant, fat content (in grams) and number of calories were recorded. A scatterplot of these data is given below. 7) A plausible value for the correlation between calories and fat is A A. +0.9. B. -0.9. C. -1.2. D. +0.2. 8) Which of the following is not true of the correlation coefficient r? D A. 1 r 1. B. If r = 0, then there is no relationship between x and y. C. If r is the correlation between x and y, then r is also the correlation between y and x. D. Multiplying all data values (x s and y s) by 10 will have no impact on r.

2. [12pts] A company produces packets of soap powder labeled Giant Size 32 Ounces. The actual weight of soap powder in such a box has a Normal distribution with a mean of 33 oz and a standard deviation of 0.7 oz. To avoid having dissatisfied customers, the company says a box of soap is considered underweight if it weighs less than 32 oz. To avoid losing money, it labels the top 5% (the heaviest 5%) overweight. 1). What proportion of boxes is underweight (i.e., weigh less than 32 oz)? 2). How heavy does a box have to be for it to be labeled overweight? 1. Let X denote the weight of a box. Then we want to know the proportion of boxes such that X < 32. The corresponding z- score is Z = X 33 32 33 = = 1.43 0.7 0.7 From the table of the standard normal cumulative proportions, we find that the proportion for X < 32 is 0.0764. 2. Let x 0 be the threshold of overweight. Then the proportion corresponding to X x 0 is 5%, or equivalently the proportion corresponding to X < x 0 is 95%. From the table of the standard normal cumulative proportions, we find that the z-score corresponding to 0.95 is 1.645 (both 1.64 and 1.65 are O.K.). Therefore x 0 = 0.7(1.645) + 33 = 34.1515.

3. [10pts] The following are the heights (in inches) of 25 students in a given class. Draw the histogram. 51 53 55 55 57 59 60 60 62 62 63 63 64 66 66 67 68 68 68 69 70 70 72 74 78 Since there are 25 observations, it is suggested to use 25 = 5 bins for our histogram. (It s O.K. to use different number of bins as long as that number is neither too big nor too small.) The range is 78 51 = 27. Thus the bin size should be around 6. In fact, it is more natural to use 6 bins and use bin size 5 here. The following is the frequency table Here is the histogram: bins frequency 50 x < 55 2 55 x < 60 4 60 x < 65 7 65 x < 70 7 70 x < 75 4 75 x < 80 1

4. The following are the grades of 18 students in a given exam. (a) [4pts] Make a stemplot. Here we draw a stemplot with split stems, i.e., the stem 6 represents 60 64 and the stem 6 + represents 65 69. The stemplot is given as follows: 6 03 6 + 9 7 4 7 + 66789 8 23 8 + 568 9 02 9 + 79 (b) [10pts] Find the five-number summary (min, Q1, median, Q3, max). Since there are 18 observations, the median is the average of the 9th and 10th observation, i.e. (79 + 82)/2 = 80.5. Since the median is not an observation in the data set, the lower half is the 9 observation from 60 to 79. Then the first quartile which is the median of the lower half is the 5th observation, which is 76. Similarly, the third quartile is 88. Therefore the five number summary is min Q1 median Q3 max 60 76 80.5 88 99 (c) [6pts] Are there any potential outlier(s) according to the 1.5 IQR rule? We have IQR = Q3 Q1 = 88 76 = 12, and 1.5 IQR = 12(1.5) = 18. Since Q1 18 = 58 < 60 and Q3 + 18 = 106 > 99, there is no outlier according to the 1.5 IQR rule.

5. A student wonders if people of similar heights tend to date each other. She measures herself, her dormitory roommate, and the women in the adjoining rooms; then she measures the next man each woman dates. Here are the data (heights in inches). Women x 66 64 66 Men y 72 68 70 (a) [4pts] What is the mean of the heights of these three women? What about men? We have x = 66 + 64 + 66 3 = 65.333 and ȳ = 72 + 68 + 70 3 = 70 (b) [8pts] Compute the standard deviation of the height for these 3 men by complete the following table. Use your calculator only to add, subtract, multiply, divide, square or take the square root of numbers. y i y i ȳ (y i ȳ) 2 72 2 4 68-2 4 70 0 0 Therefore the standard deviation of y is s y = 1 n 1 (y i ȳ) n 1 2 = (4 + 4 + 0) = 2. 3 1 i=1 Now find the standard deviation of the height for these 3 women by the same procedure. x i x i x (x i x) 2 66 0.667 0.444 64-1.333 1.778 66 0.667 0.444 Therefore the standard deviation of x is s x = 1 n 1 (x i x) n 1 2 = (0.444 + 1.778 + 0.444) = 1.155. 3 1 i=1

(c) [6pts] Find the correlation coefficient r between the height of men and women. r = 1 n 1 = 1 3 1 = 0.866 n ( ) xi x i=1 [( 0.667 1.155 s x ) ( ) yi ȳ ( ) 2 + 2 s y ( 1.333 1.155 ) ( ) 2 + 2 ( ) 0.667 1.155 ( )] 0 2 8