PROBLEM SET 1. For the first three answer true or false and explain your answer. A picture is often helpful.

Similar documents
Name: Date: Use the following to answer questions 3-4:

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Chapter 7: Simple linear regression Learning Objectives

The normal approximation to the binomial

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

4. Continuous Random Variables, the Pareto and Normal Distributions

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Review #2. Statistics

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

Introduction to Hypothesis Testing

Probability Distributions

Normal distribution. ) 2 /2σ. 2π σ

Statistics 2014 Scoring Guidelines

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Characteristics of Binomial Distributions

Mind on Statistics. Chapter 12

Point and Interval Estimates

Section 6.1 Discrete Random variables Probability Distribution

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

AP STATISTICS 2010 SCORING GUIDELINES

STAT 350 Practice Final Exam Solution (Spring 2015)

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

HYPOTHESIS TESTING WITH SPSS:

Introduction to Hypothesis Testing. Hypothesis Testing. Step 1: State the Hypotheses

Math Quizzes Winter 2009

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Introduction to Hypothesis Testing OPRE 6301

Probability Distributions

EXAM #1 (Example) Instructor: Ela Jackiewicz. Relax and good luck!

The Math. P (x) = 5! = = 120.

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

TEACHER NOTES MATH NSPIRED

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) B) C) D) 0.

Lecture 5 : The Poisson Distribution

Projects Involving Statistics (& SPSS)

Getting Started with Statistics. Out of Control! ID: 10137

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

3.4 Statistical inference for 2 populations based on two samples

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section:

Math 251, Review Questions for Test 3 Rough Answers

Final Exam Practice Problem Answers

Part 3. Comparing Groups. Chapter 7 Comparing Paired Groups 189. Chapter 8 Comparing Two Independent Groups 217

Descriptive Statistics

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

Contemporary Mathematics Online Math 1030 Sample Exam I Chapters No Time Limit No Scratch Paper Calculator Allowed: Scientific

Fairfield Public Schools

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Lecture Notes Module 1

Lesson 17: Margin of Error When Estimating a Population Proportion

Study Guide for the Final Exam

CHAPTER 14 NONPARAMETRIC TESTS

WHERE DOES THE 10% CONDITION COME FROM?

Independent samples t-test. Dr. Tom Pierce Radford University

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Chapter 23. Two Categorical Variables: The Chi-Square Test

Simple Regression Theory II 2010 Samuel L. Baker

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Solutions: Problems for Chapter 3. Solutions: Problems for Chapter 3

Chicago Booth BUSINESS STATISTICS Final Exam Fall 2011

Understanding Confidence Intervals and Hypothesis Testing Using Excel Data Table Simulation

Results from the 2014 AP Statistics Exam. Jessica Utts, University of California, Irvine Chief Reader, AP Statistics

8 6 X 2 Test for a Variance or Standard Deviation

Mind on Statistics. Chapter 8

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

DETERMINE whether the conditions for a binomial setting are met. COMPUTE and INTERPRET probabilities involving binomial random variables

The normal approximation to the binomial

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

HYPOTHESIS TESTING: POWER OF THE TEST

Mathematics (Project Maths Phase 1)

socscimajor yes no TOTAL female male TOTAL

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Preparation of Two-Year College Mathematics Instructors to Teach Statistics with GAISE Session on Assessment

GCSE Mathematics (Non-calculator Paper)

Statistics Revision Sheet Question 6 of Paper 2

StatCrunch and Nonparametric Statistics

Notes on Continuous Random Variables

II. DISTRIBUTIONS distribution normal distribution. standard scores

AP STATISTICS (Warm-Up Exercises)

Bellwork Students will review their study guide for their test. Box-and-Whisker Plots will be discussed after the test.

6.4 Normal Distribution

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

Introduction to Statistics for Psychology. Quantitative Methods for Human Sciences

Thursday, November 13: 6.1 Discrete Random Variables

Question: What is the probability that a five-card poker hand contains a flush, that is, five cards of the same suit?

Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

What is a Box and Whisker Plot?

5/31/ Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

Section 7.1. Introduction to Hypothesis Testing. Schrodinger s cat quantum mechanics thought experiment (1935)

Math 201: Statistics November 30, 2006

The power of a test is the of. by using a particular and a. value of the that is an to the value

Transcription:

PROBLEM SET 1 For the first three answer true or false and explain your answer. A picture is often helpful. 1. Suppose the significance level of a hypothesis test is α=0.05. If the p-value of the test statistic is p-value = 0.07, then the null hypothesis (H 0 ) should be rejected. False. We reject the null hypothesis when the p-value is less than α. But 0.07 > 0.05 so we fail to reject H 0. 2. If we fail to reject the null hypothesis (H 0 ) that means that the test statistics was not in the rejection region. True. We only reject the null hypothesis if the test statistic is in the rejection region (also called critical region). 3. TRUE OR FALSE: If we fail to reject the null hypothesis (H 0 ) at a significance level of α=0.05, then we also must fail to reject it at a significance level of α=0.10. False. Consider the p-values. If we fail to reject H 0, then the p-value must be greater than α; so if α=0.05, then the p-value > 0.05. However, this does not necessarily imply that the p-value must also be greater than 0.10. For example if the p-value = 0.08, then we would fail to reject H 0 at the significance level of α=0.05 since 0.08 > 0.05, but we would reject H 0 at the significance level of α = 0.10 since 0.08 < 0.10. 4. Suppose that we do reject the null hypothesis at a significance level of α=0.05, but we do not reject the null hypothesis at a significance level of α=0.01. What can you say about the p-value of the test statistic? This means that the p-value is less than 0.05 but greater than 0.01 (0.01 < p-value < 0.05) 5. We defined a value as unusual if it was more than 2 standard deviations from the mean. Suppose we used this definition to define a critical region for a two-sided hypothesis test (in other words we define our critical region to be z < -2 or z > 2). What would be the significance level of the test? -2 2 The significance level α is the area under the curve in the critical region: P(Z< -2) + P(Z > 2) = 2*(0.0228) = 0.0456.

PROBLEM SET 2 1. A multiple choice test has 5 answers per question (a,b,c,d,e). Suppose you randomly guess on each question. If you get the question correct, you get 5 points, if you get the question wrong, you lose 2 points. If there are 3 questions, what is your expected point total for the test? There are 2 ways you could think about this problem. STRATEGY 1: The probability of randomly getting a problem correct is p=0.2. This is a binomial problem, X~ b(3,0.2). First figure out the possible point values you might get and the probability of each outcome: Number Correct (+5 each) 0 1 2 3 Number wrong (-2 each) 3 2 1 0 Total Points (x) -6 1 8 15 Probability of outcome (used binomial table (n=3,p=0.2)) (p(x)) 0.512 0.384 0.096 0.008 The expected value is the sum of the xp(x) which is: -6(0.512)+1(0/384)+8(0.096)+15(0.008) = -1.8 You would expect to have -1.8 points if you randomly guessed each answer. STRATEGY 2: The expected number correct is μ=np = 3 (0.2) = 0.6. So you expect to get 0.6 problems correct and 3-0.6 = 2.4 problems wrong. If you get the problem correct you get 5 points. If you get it wrong you get -2 points so your expected number of points is 5(0.6) -2(2.4) = -1.8 points 2. A 5 card hand is dealt at random from a standard 52 card deck. a. What is the probability that you will get all red cards? b. What is the probability of getting two 10 s and three face cards? There are 4 C 2 = 6 ways to choose two 10 s and 12 C 3 =220 was to choose 3 face cards. Since we want both we multiply these values together to get 6*220 = 1320 ways to get two 10 s and three face cards. Hence the probability is:

3. I was not paying attention to where I was walking and I stepped on a very large bag of tortilla chips. The probability that any given chip in the bag was broken is p=0.25. Suppose I reach in and grab 10 chips. Let X be the number of cracked chips. a. Explain why X is binomial. What is n? p? Even though we are sampling without replacement, we can consider the trials independent since the number of chips I am choosing (10) is likely less than 5% of the whole population of chips since I said it was a very large bag of chips. The outcome of each trial is only one of two choices: a success (cracked chip) or failure (not cracked). The probability of success for each trial is 0.25. n=10, p = 0.25 b. What is the probability that at most 3 of my chips are broken? (Note you can t use the tables unless the exact values for p are listed don t average, use the formula if you can t use the table) P(X 3) = P(0)+P(1)+P(2)+P(3) = + c. What is the probability that at least 4 of my chips are broken? P(X 4) = 1 P(X 3) = 1 0.7759 = 0.2241 (I used the result for P(X 3) from the last problem.

PROBLEM SET 3 A disgruntled customer complained to a candy bar company that although the label weight of the candy bar was 58.7 grams, it actually weighed less. The company claims that the weights of the candy bars follow a normal distribution with mean μ = 58.7 grams and standard deviation σ=0.5 grams. 1. Assuming the company s claim is true, what is the probability that a given candy bar will weigh less than 58 grams? 2. The customer bought 20 candy bars. Find the probability that the average weight,, was less than 58.5 grams. (Note this is AVERAGE weight so we use the central limit theorem to find the standard deviation) 3. Suppose that the customer s 20 candy bars had an average weight of grams. Do a hypothesis test to determine if his claim that the average is less than 58.7 grams is valid. State the null and alternate hypothesis. Use a significance level of α=0.05 and clearly state your conclusion. H 0 : μ=58.7 grams H 1 : μ<58.7 grams This is a left-tailed test with significance level α=0.05. The critical value is z α = -1.645 and the critical region is z -1.645. The test statistic is Since z < -1.645 we reject H 0. Our conclusion is that the data supports the claim that the mean weight of the candy bars is less than 58.7 grams. 4. Again suppose that the customer s 20 candy bars had an average weight of. Find a 90% percent confidence interval for the true mean, μ, of the candy bars. Assume σ=0.5 grams. z α = 1.645 so. The confidence interval for μ is (58.32,58.68). This says that I am 90% confident that the mean weight of the candy bars is between 58.32 grams and 58.69 grams. 5. Explain clearly why your confidence interval in 4 supports the result you got in 3. My hypothesis result in (3) supported the claim that the true mean was less than 58.7 grams. This is supported by my confidence interval since all values in my interval are less than 58.7 grams.

PROBLEM SET 4 1. Why is this graph misleading? The bars are not proportional. For example it looks like Emily has a bowling average over twice as much as Hilde, but really it is only about 12 points higher. The true differences would be more accurately represented if the y-axis started at 0. 2. Some students are comparing their scores on an exam when their friend walks in with a lower score than the min, in this case the RANGE: a. Decreases b. Increases c. Stays the Same d. None of the Above 3. The test score distribution for a class of about 50 students is approximately normal. The teacher finds four papers that she misplaced that skew the data more to the left. This means the four scores, compared to the rest of the class : a. Are much higher than the mean b. Are much lower than the mean c. Are about the same as the mean d. None of the Above 4. The test score distribution for a class of about 50 students is approximately normal. The teacher finds four papers that she misplaced decrease the mean test score. This means the average of the four scores, compared to the rest of the class: a. Are higher than the mean b. Are lower than the mean c. Are about the same as the mean 5. The test score distribution for a class of about 50 students is approximately normal. The teacher finds four papers that she misplaced increase the median test score. This means the four scores, compared to the rest of the class: a. Are higher than the median b. Are lower than the median c. Are about the same as the median

6. Suppose you go to Fry s Electronics to buy a CD. If the CD is marked down 20% to $11.20, what was the original price of the CD? a. $8.96 b. $11.40 c. $14.00 d. $12.44 7. If the CD you bought at Fry s (from Question 6) is then marked back up to its original price from its discounted price of $11.20, what was the percentage increase from the discounted price to the original price. (Remember the CD was originally marked down 20%.) a. 20% b. 25% c. 28% d. 30% e. 33 1/3 % This information is for the next three problems Use the box-and-whisker plots to compare the starting salaries for first year graduates in education and engineering. Starting salaries in $ thousands 25 30 35 40 45 50 55 60 65 Engineering Salaries Education Majors 8. If there were 504 engineering majors; how many of them had a starting salary $50,000 or more? a. 378 b. 252 c. 126 d. Can t be determined 9. If there were 1200 education majors; what percent of them made $30,000 or less? a. 25 b. 300 c. 75 d. Can t be determined 10. Why is one part of the box in the plot for education salaries longer than the other part? a. There is more data between the upper part than the lower part. b. The data in the lower part is spread out more. c. There is less data between the upper part than the lower part. d. The data in the upper part is more spread out.