Sections 9.1 and 9.2 HYPOTHESIS TESTS FOR PROPORTIONS

Size: px
Start display at page:

Download "Sections 9.1 and 9.2 HYPOTHESIS TESTS FOR PROPORTIONS"

Transcription

1 Sections 9.1 and 9.2 HYPOTHESIS TESTS FOR PROPORTIONS

2 Inferential Statistics Two important features Information is obtained from a sample This information is used to draw a conclusion (an inference) about the entire population from which the sample was drawn. Two major types Using confidence intervals Using hypothesis tests

3 Confidence Interval vs. Hypothesis Test We use a confidence interval to indicate what we believe the population proportion is. This is an estimate. We use a hypothesis test to indicate what we believe the proportion is not. Usually this is in challenge to a claim. The difference is in the emphasis is vs. is not.

4 Example Do you believe that the amount of taxes you pay is fair? 54% responded yes, with a margin of error ±4.2%. We build the confidence interval (49.8%, 58.2%). From the same survey, we would design a hypothesis test for the following questions: Do you believe 60% of Americans believe their tax bill is fair? I believe the proportion is not 60%. Do you believe 50% of Americans believe their tax bill is fair? I believe the proportion could be 50%.

5 A Hypothesis Testing Way of Thinking Claim: The population proportion is 60% Result of survey: 54% was the proportion for the sample Conclusion: I believe the claim is not correct Claim: The population proportion is 50% Result of survey: 54% was the proportion for the sample Conclusion: I believe the claim could be correct

6 Another Example Setup If we roll a pair of fair dice, the total on the two dice ranges from 2 to 12. The probability of totaling 7 is 6/36 = 1/6 = If the dice are loaded, this probability can be changed. State gaming commissions inspect casino equipment including dice. Particularly important when machines are simulating dice, cards, etc. Claim: The casino claims that two dice are fair, i.e., that the probability of totaling 7 is 16.67%

7 Our Experiment Roll the dice many times If the proportion of 7 s is not close to 1/6, we have evidence that the probability is not 1/6. We will reject the claim. If the proportion of 7 s is close to 1/6, we acknowledge that the claim could be true.

8 Our Experiment Roll the dice many times If the proportion of 7 s is not close to 1/6, we have evidence that the probability is not 1/6. We will reject the claim If the proportion of 7 s is close to 1/6, we acknowledge that the claim could be true. Use the applet at keepordiscarddicepart1.html

9 Are the dice fair or loaded? To answer this question by experiment, we make two choices. How many rolls should we use to test the claim? (sample size) How close should the sample proportion be to 1/6 for us to believe the population proportion could be 1/6? (measure of closeness)

10 Example 2. Suppose we think the casino is cheating by using dice that do not sum to seven as often as they should. We collect data on 1000 dice rolls and find that 153 of them sum to seven. Is this enough evidence to accuse them of cheating?

11 Are the dice fair or loaded? To answer this question by experiment, we make two choices. How many rolls? (sample size) How close should we be to 1/6? (measure of closeness) Two different ways to be correct, and two ways to be incorrect.

12 Types of Errors

13 Connection to Criminal Trials

14 Part II: The Logic of Hypothesis Testing

15 Logic of Hypothesis Testing: Dice Example Using the dice example. Claim: The population proportion is 1/6. There are two possible conclusions: The sample proportion was not close to 1/6, so we reject the claim. The sample proportion was close to 1/6, so we do not reject the claim.

16 The Logic of Hypothesis Testing: Unusual Data To be more precise about what is unusual, we use z-scores and P-values. The sample value p-hat being unusual means we would not expect to have such a sample value given the claimed value for p.

17 Unusual Observations q Adult male heights are normally distributed with a mean of 70 inches and a standard deviation of 4 inches. q q q q q Consider these two Sam is 79 inches tall (z-score is 2.25; corresponds to in Table A) Joe is 61 inches tall (z-score is -2.25; corresponds to in Table A) For a given person, we can think of unusual in two ways Sam is unusually tall, he is in the rarest 1.22% of tall people. Joe is unusually short, he is in the rarest 1.22% of short people. q Both have unusual height, they are in the rarest 2.44%

18 P-Values q The P-value is a measure of just how unusual the data is, in terms of probability. Recall that q q Sam is unusually tall, he is in the rarest 1.22% of tall people. Joe is unusually short, he is in the rarest 1.22% of short people. q Both have unusual height, they are in the rarest 2.44% This can be restated as q Sam s one-tail (right-tail) P-value is q Joe s one-tail (left-tail) P-value is q Either one of them has a two-tail P-value of

19 Graphical Depiction of P-Values

20 Components of a Hypothesis Test Claim to be investigated Hypothetical sampling distributions based on claim. Calculations based on the sample Measure of closeness

21 Claim to be investigated Claim: The population proportion is p 0 (a particular value known in advance) We will reject this claim if we obtain evidence that the population proportion is not equal to this value either smaller or larger. But wait, a sample value will rarely be *exactly* the population value so when should we reject?

22 Sampling Distribution Recall that if we look at the sample proportions for many, many samples of the same size, the resulting values have an approximately normal distribution with mean = p (where p is the population proportion) standard deviation (called standard error) = p(1 p) n

23 Sampling Distribution (cont.) We do not know the value of p. However, we know that it is claimed to be p 0., so we can build a hypothetical distribution. Thus, we use p 0 in our calculations. mean = p 0 standard deviation (called standard error) = p 0 (1 p 0 ) n

24 Calculations based on the sample We obtain a simple random sample, and compute the sample proportion ˆp From the sampling distribution we know what to expect if the claim is true: ˆp should be close to p 0.

25 Measure of closeness Reasoning: If the claim is true, the sample proportion should not be unusually large or small. The smaller the P-value, the more unusual the sample. The P-value is literally the probability that a p-hat would be this far from the mean, within the framework of our hypothetical distribution.

26 Measure of closeness Reasoning: If the claim is true, the sample proportion should not be unusually large or small. The smaller the (two-tail) P-value, the more unusual the sample

27 Measure of closeness (cont.) But how small is small? This decision should be made in advance, prior to taking the sample and varies depending on the situation. For example, we might decide that small will mean less than So, we reject the claim if our sample is in the most unusual 5% of all possible samples.

28 Conclusion We reject the claim if the calculated P-value is less than the chosen value. Otherwise, we do not reject the claim. Recall: We calculate a P-value using the z- score and Table A or normalcdf(). For mean and standard deviation we use: mean = p 0 standard deviation (called standard error) = p 0 (1 p 0 ) n

29 An Example Consider the applet again and let s do a hypothesis test using 0.05 as our cut-off. (p. 12) Calculations on board. Were we correct? What type of error might we have made?

30 Part III: Terminology and Two-Tail Tests

31 Our Assumptions We are taking a simple random sample. We expect a normal sampling distribution. For this we need our sample size n to satisfy both of the following: np 0 15 n(1-p 0 ) 15 In other words, if you think in terms of a yes/no survey question, you need to reasonably expect at least 15 yes s and at least 15 no s.

32 The Null Hypothesis The null hypothesis is the claim that is to be investigated. (This gives us our hypothetical sampling distribution.) The claim is that the population proportion is equal to some value p 0. We use the notation H 0 : p = p 0.

33 The Alternative Hypothesis The alternative hypothesis is the conclusion we will reach if we reject to null hypothesis. For a two-tail P-value test, we use the notation H a : p p 0.

34 An Example Recall the example when we had two dice and the casino claimed that the dice are fair, i.e., the probability of totaling 7 is 16.67% The null hypothesis: H 0 : p = The alternative hypothesis: H a : p

35 Significance Level If the P-value of the sample proportion is less than a pre-specified cutoff, then we reject the claim. We have used 0.05 for this cutoff. This cutoff is called the significance level, and is denoted by α. So we might set α=0.05 or α=0.01.

36 Conclusion of Hypothesis Test If the P-value of the sample proportion is less than α, reject the null hypothesis, and conclude the alternative hypothesis is true. P-value < α reject H 0 Otherwise, fail to reject the null hypothesis it might be true, there is not enough evidence to conclude that the alternative hypothesis is true. P-value α fail to reject H 0

37 Hypothesis-Testing Steps 1. Write the null and alternative hypothesis. The null hypothesis: H 0 : p = p 0 The alternative hypothesis: H a : p p 0 2. Calculate the ˆp from data and find the z- score (test statistic). Remember how?

38 Steps (Step 2 details) We get z-scores by z = ˆp p 0 se Where for mean and standard deviation we use: mean = p 0 standard deviation (called standard error) se = p 0 (1 p 0 ) n

39 Steps 1. Write the null and alternative hypothesis. The null hypothesis: H 0 : p = p 0 The alternative hypothesis: H a : p p 0 2. Calculate the ˆp from data and find the z- score. 3. From the z-score get a P-value. Remember how?

40 Steps (Step 3 details) Given a z-score use Table A to calculate area. Total Area= P-value A z-score Will always DOUBLE area for a two-sided test. This total area is the P-value for your sample.

41 Steps (Step 3 details) Given a z-score could use normalcdf to calculate area. Total Area= P-value A z-score 1-normalcdf(lower, upper)

42 Steps 1. Write the null and alternative hypothesis. The null hypothesis: H 0 : p = p 0 The alternative hypothesis: H a : p p 0 2. Based on sample size, find s.e. for hypothetical dist. 3. Calculate the ˆp from data and find the z-score. 4. From the z-score get a P-value. 5. Decision time: Is P-value < α? Yes Reject H 0 No Fail to reject H 0

43 Could We Have Made an Error? Yes, if we rejected H 0 but in fact H 0 is true that is a Type I error. If we failed to reject H 0 when H 0 was not true that is a Type II error.

44 One-Tail Test If a researcher, prior to taking the sample, has reason to believe that the proportion in the null hypothesis is too low, then H a : p > p 0. In this case, we use the one-tail (right-tail) P- value of the sample proportion.

45 One-Tail Test (Similarly) If a researcher, prior to taking the sample, has reason to believe that the proportion in the null hypothesis is too high, then H a : p < p 0. In this case, we use the one-tail (left-tail) P- value of the sample proportion.

46 Worksheet - Example 1 The proportion of smokers among persons who graduated from a four-year college has been widely reported as 22%. A sociologist wonders if this is still true. a. Identify the population and parameter p the sociologist wants to study. What is the null hypothesis? b. What is the alternative hypothesis? (I.e., what does the sociologist wonder?)

47 Worksheet - Example 1 The proportion of smokers among persons who graduated from a four-year college has been widely reported as 22%. A sociologist wonders if this is still true. a. Identify the population and parameter p the sociologist wants to study. What is the null hypothesis? p = proportion of graduates of four-year colleges who smoke H 0 : p = 0.22 b. What is the alternative hypothesis? (I.e., what does the sociologist wonder?) Ha : p 0.22

48 d. The sociologist plans to sample 785 college graduates. Her investigation will therefore be based on a hypothetical normal distribution with mean and standard error (se) How do you know it s a normal distribution?

49 d. The sociologist plans to sample 785 college graduates. Her investigation will therefore be based on a normal distribution with mean 0.22 and standard error (se) 0.22(.78) How do you know it s a normal distribution? 785* and 785*

50 Some time after setting up the claim to be investigated, the sociologist surveys a random sample of 785 college graduates and finds that 153 are smokers. e. Find p-hat from the data and draw a normal curve showing where this sample falls in the distribution of all possible samples. Shade all the samples that are further away from the claim than this sample, consistent with H a.

51 Some time after setting up the claim to be investigated, the sociologist surveys a random sample of 785 college graduates and finds that 153 are smokers. f. Calculate the (two-tail) P-value for this sample. (Remember, the P-value is the measure of how unusual this sample is. Technically, it s the probability, assuming the claim is true, of getting a random sample this far or further away from the center of the hypothetical distribution.)

52 Some time after setting up the claim to be investigated, the sociologist surveys a random sample of 785 college graduates and finds that 153 are smokers. f. Calculate the (two-tail) P-value for this sample. (Remember, the P-value is the measure of how unusual this sample is. Technically, it s the probability, assuming the claim is true, of getting a random sample this far or further away from the center of the hypothetical distribution.) Z-score: pˆ p = 1.69 σ P-value: Go to Table A. Find area to left of z=-1.69 and double it. Get * 2 = Or use normalcdf(-100, -1.69) and double it.

53 (g, h, i). The researcher decided in advance to use a cutoff of α = 0.05 in making her decision of whether or not to reject the claim. What is her decision, based on the P-value you calculated? Choose the correct answer. She did find enough evidence to conclude that p OR She did not find enough evidence to conclude that p 0.22.

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

Review. March 21, 2011. 155S7.1 2_3 Estimating a Population Proportion. Chapter 7 Estimates and Sample Sizes. Test 2 (Chapters 4, 5, & 6) Results

Review. March 21, 2011. 155S7.1 2_3 Estimating a Population Proportion. Chapter 7 Estimates and Sample Sizes. Test 2 (Chapters 4, 5, & 6) Results MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 7 Estimates and Sample Sizes 7 1 Review and Preview 7 2 Estimating a Population Proportion 7 3 Estimating a Population

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Unit 26 Estimation with Confidence Intervals

Unit 26 Estimation with Confidence Intervals Unit 26 Estimation with Confidence Intervals Objectives: To see how confidence intervals are used to estimate a population proportion, a population mean, a difference in population proportions, or a difference

More information

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1. Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

More information

Chapter 8 Section 1. Homework A

Chapter 8 Section 1. Homework A Chapter 8 Section 1 Homework A 8.7 Can we use the large-sample confidence interval? In each of the following circumstances state whether you would use the large-sample confidence interval. The variable

More information

AMS 5 CHANCE VARIABILITY

AMS 5 CHANCE VARIABILITY AMS 5 CHANCE VARIABILITY The Law of Averages When tossing a fair coin the chances of tails and heads are the same: 50% and 50%. So if the coin is tossed a large number of times, the number of heads and

More information

Introduction to Hypothesis Testing OPRE 6301

Introduction to Hypothesis Testing OPRE 6301 Introduction to Hypothesis Testing OPRE 6301 Motivation... The purpose of hypothesis testing is to determine whether there is enough statistical evidence in favor of a certain belief, or hypothesis, about

More information

p ˆ (sample mean and sample

p ˆ (sample mean and sample Chapter 6: Confidence Intervals and Hypothesis Testing When analyzing data, we can t just accept the sample mean or sample proportion as the official mean or proportion. When we estimate the statistics

More information

MATH 140 Lab 4: Probability and the Standard Normal Distribution

MATH 140 Lab 4: Probability and the Standard Normal Distribution MATH 140 Lab 4: Probability and the Standard Normal Distribution Problem 1. Flipping a Coin Problem In this problem, we want to simualte the process of flipping a fair coin 1000 times. Note that the outcomes

More information

Probability Distributions

Probability Distributions Learning Objectives Probability Distributions Section 1: How Can We Summarize Possible Outcomes and Their Probabilities? 1. Random variable 2. Probability distributions for discrete random variables 3.

More information

Chapter 5 Analysis of variance SPSS Analysis of variance

Chapter 5 Analysis of variance SPSS Analysis of variance Chapter 5 Analysis of variance SPSS Analysis of variance Data file used: gss.sav How to get there: Analyze Compare Means One-way ANOVA To test the null hypothesis that several population means are equal,

More information

Chapter 23 Inferences About Means

Chapter 23 Inferences About Means Chapter 23 Inferences About Means Chapter 23 - Inferences About Means 391 Chapter 23 Solutions to Class Examples 1. See Class Example 1. 2. We want to know if the mean battery lifespan exceeds the 300-minute

More information

Sample Size and Power in Clinical Trials

Sample Size and Power in Clinical Trials Sample Size and Power in Clinical Trials Version 1.0 May 011 1. Power of a Test. Factors affecting Power 3. Required Sample Size RELATED ISSUES 1. Effect Size. Test Statistics 3. Variation 4. Significance

More information

The Normal Distribution

The Normal Distribution Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

More information

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) Limitations of the t-test Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

More information

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples

Comparing Two Groups. Standard Error of ȳ 1 ȳ 2. Setting. Two Independent Samples Comparing Two Groups Chapter 7 describes two ways to compare two populations on the basis of independent samples: a confidence interval for the difference in population means and a hypothesis test. The

More information

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph.

c. Construct a boxplot for the data. Write a one sentence interpretation of your graph. MBA/MIB 5315 Sample Test Problems Page 1 of 1 1. An English survey of 3000 medical records showed that smokers are more inclined to get depressed than non-smokers. Does this imply that smoking causes depression?

More information

Lesson 17: Margin of Error When Estimating a Population Proportion

Lesson 17: Margin of Error When Estimating a Population Proportion Margin of Error When Estimating a Population Proportion Classwork In this lesson, you will find and interpret the standard deviation of a simulated distribution for a sample proportion and use this information

More information

Probability Distributions

Probability Distributions CHAPTER 5 Probability Distributions CHAPTER OUTLINE 5.1 Probability Distribution of a Discrete Random Variable 5.2 Mean and Standard Deviation of a Probability Distribution 5.3 The Binomial Distribution

More information

Hypothesis Testing for Beginners

Hypothesis Testing for Beginners Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes

More information

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1. General Method: Difference of Means 1. Calculate x 1, x 2, SE 1, SE 2. 2. Combined SE = SE1 2 + SE2 2. ASSUMES INDEPENDENT SAMPLES. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n

More information

Study Guide for the Final Exam

Study Guide for the Final Exam Study Guide for the Final Exam When studying, remember that the computational portion of the exam will only involve new material (covered after the second midterm), that material from Exam 1 will make

More information

Non-Parametric Tests (I)

Non-Parametric Tests (I) Lecture 5: Non-Parametric Tests (I) KimHuat LIM lim@stats.ox.ac.uk http://www.stats.ox.ac.uk/~lim/teaching.html Slide 1 5.1 Outline (i) Overview of Distribution-Free Tests (ii) Median Test for Two Independent

More information

Math 201: Statistics November 30, 2006

Math 201: Statistics November 30, 2006 Math 201: Statistics November 30, 2006 Fall 2006 MidTerm #2 Closed book & notes; only an A4-size formula sheet and a calculator allowed; 90 mins. No questions accepted! Instructions: There are eleven pages

More information

AP STATISTICS (Warm-Up Exercises)

AP STATISTICS (Warm-Up Exercises) AP STATISTICS (Warm-Up Exercises) 1. Describe the distribution of ages in a city: 2. Graph a box plot on your calculator for the following test scores: {90, 80, 96, 54, 80, 95, 100, 75, 87, 62, 65, 85,

More information

Solutions to Homework 6 Statistics 302 Professor Larget

Solutions to Homework 6 Statistics 302 Professor Larget s to Homework 6 Statistics 302 Professor Larget Textbook Exercises 5.29 (Graded for Completeness) What Proportion Have College Degrees? According to the US Census Bureau, about 27.5% of US adults over

More information

Online 12 - Sections 9.1 and 9.2-Doug Ensley

Online 12 - Sections 9.1 and 9.2-Doug Ensley Student: Date: Instructor: Doug Ensley Course: MAT117 01 Applied Statistics - Ensley Assignment: Online 12 - Sections 9.1 and 9.2 1. Does a P-value of 0.001 give strong evidence or not especially strong

More information

Math 58. Rumbos Fall 2008 1. Solutions to Review Problems for Exam 2

Math 58. Rumbos Fall 2008 1. Solutions to Review Problems for Exam 2 Math 58. Rumbos Fall 2008 1 Solutions to Review Problems for Exam 2 1. For each of the following scenarios, determine whether the binomial distribution is the appropriate distribution for the random variable

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key

University of Chicago Graduate School of Business. Business 41000: Business Statistics Solution Key Name: OUTLINE SOLUTIONS University of Chicago Graduate School of Business Business 41000: Business Statistics Solution Key Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper

More information

The Wilcoxon Rank-Sum Test

The Wilcoxon Rank-Sum Test 1 The Wilcoxon Rank-Sum Test The Wilcoxon rank-sum test is a nonparametric alternative to the twosample t-test which is based solely on the order in which the observations from the two samples fall. We

More information

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing

Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing Business Statistics, 9e (Groebner/Shannon/Fry) Chapter 9 Introduction to Hypothesis Testing 1) Hypothesis testing and confidence interval estimation are essentially two totally different statistical procedures

More information

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING LAB 4 INSTRUCTIONS CONFIDENCE INTERVALS AND HYPOTHESIS TESTING In this lab you will explore the concept of a confidence interval and hypothesis testing through a simulation problem in engineering setting.

More information

Statistics 2014 Scoring Guidelines

Statistics 2014 Scoring Guidelines AP Statistics 2014 Scoring Guidelines College Board, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks of the College Board. AP Central is the official online home

More information

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

More information

The Procedures of Monte Carlo Simulation (and Resampling)

The Procedures of Monte Carlo Simulation (and Resampling) 154 Resampling: The New Statistics CHAPTER 10 The Procedures of Monte Carlo Simulation (and Resampling) A Definition and General Procedure for Monte Carlo Simulation Summary Until now, the steps to follow

More information

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS

An Introduction to Statistics Course (ECOE 1302) Spring Semester 2011 Chapter 10- TWO-SAMPLE TESTS The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 130) Spring Semester 011 Chapter 10- TWO-SAMPLE TESTS Practice

More information

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

More information

Research Methods & Experimental Design

Research Methods & Experimental Design Research Methods & Experimental Design 16.422 Human Supervisory Control April 2004 Research Methods Qualitative vs. quantitative Understanding the relationship between objectives (research question) and

More information

Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

More information

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters. Sample Multiple Choice Questions for the material since Midterm 2. Sample questions from Midterms and 2 are also representative of questions that may appear on the final exam.. A randomly selected sample

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing Introduction Hypothesis Testing Mark Lunt Arthritis Research UK Centre for Ecellence in Epidemiology University of Manchester 13/10/2015 We saw last week that we can never know the population parameters

More information

WISE Power Tutorial All Exercises

WISE Power Tutorial All Exercises ame Date Class WISE Power Tutorial All Exercises Power: The B.E.A.. Mnemonic Four interrelated features of power can be summarized using BEA B Beta Error (Power = 1 Beta Error): Beta error (or Type II

More information

Review #2. Statistics

Review #2. Statistics Review #2 Statistics Find the mean of the given probability distribution. 1) x P(x) 0 0.19 1 0.37 2 0.16 3 0.26 4 0.02 A) 1.64 B) 1.45 C) 1.55 D) 1.74 2) The number of golf balls ordered by customers of

More information

Comparing Means in Two Populations

Comparing Means in Two Populations Comparing Means in Two Populations Overview The previous section discussed hypothesis testing when sampling from a single population (either a single mean or two means from the same population). Now we

More information

Chapter 7 Section 7.1: Inference for the Mean of a Population

Chapter 7 Section 7.1: Inference for the Mean of a Population Chapter 7 Section 7.1: Inference for the Mean of a Population Now let s look at a similar situation Take an SRS of size n Normal Population : N(, ). Both and are unknown parameters. Unlike what we used

More information

Chapter 3 RANDOM VARIATE GENERATION

Chapter 3 RANDOM VARIATE GENERATION Chapter 3 RANDOM VARIATE GENERATION In order to do a Monte Carlo simulation either by hand or by computer, techniques must be developed for generating values of random variables having known distributions.

More information

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025.

Elementary Statistics and Inference. Elementary Statistics and Inference. 16 The Law of Averages (cont.) 22S:025 or 7P:025. Elementary Statistics and Inference 22S:025 or 7P:025 Lecture 20 1 Elementary Statistics and Inference 22S:025 or 7P:025 Chapter 16 (cont.) 2 D. Making a Box Model Key Questions regarding box What numbers

More information

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4

STATISTICS 8, FINAL EXAM. Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 STATISTICS 8, FINAL EXAM NAME: KEY Seat Number: Last six digits of Student ID#: Circle your Discussion Section: 1 2 3 4 Make sure you have 8 pages. You will be provided with a table as well, as a separate

More information

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test Experimental Design Power and Sample Size Determination Bret Hanlon and Bret Larget Department of Statistics University of Wisconsin Madison November 3 8, 2011 To this point in the semester, we have largely

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

You flip a fair coin four times, what is the probability that you obtain three heads.

You flip a fair coin four times, what is the probability that you obtain three heads. Handout 4: Binomial Distribution Reading Assignment: Chapter 5 In the previous handout, we looked at continuous random variables and calculating probabilities and percentiles for those type of variables.

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Social Studies 201 Notes for November 19, 2003

Social Studies 201 Notes for November 19, 2003 1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is

More information

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/

Statistiek I. Proportions aka Sign Tests. John Nerbonne. CLCG, Rijksuniversiteit Groningen. http://www.let.rug.nl/nerbonne/teach/statistiek-i/ Statistiek I Proportions aka Sign Tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/34 Proportions aka Sign Test The relative frequency

More information

Binomial Probability Distribution

Binomial Probability Distribution Binomial Probability Distribution In a binomial setting, we can compute probabilities of certain outcomes. This used to be done with tables, but with graphing calculator technology, these problems are

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

Lesson 9 Hypothesis Testing

Lesson 9 Hypothesis Testing Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem)

NONPARAMETRIC STATISTICS 1. depend on assumptions about the underlying distribution of the data (or on the Central Limit Theorem) NONPARAMETRIC STATISTICS 1 PREVIOUSLY parametric statistics in estimation and hypothesis testing... construction of confidence intervals computing of p-values classical significance testing depend on assumptions

More information

Stat 20: Intro to Probability and Statistics

Stat 20: Intro to Probability and Statistics Stat 20: Intro to Probability and Statistics Lecture 16: More Box Models Tessa L. Childers-Day UC Berkeley 22 July 2014 By the end of this lecture... You will be able to: Determine what we expect the sum

More information

Independent t- Test (Comparing Two Means)

Independent t- Test (Comparing Two Means) Independent t- Test (Comparing Two Means) The objectives of this lesson are to learn: the definition/purpose of independent t-test when to use the independent t-test the use of SPSS to complete an independent

More information

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1 Hypothesis testing So far, we ve talked about inference from the point of estimation. We ve tried to answer questions like What is a good estimate for a typical value? or How much variability is there

More information

STATISTICAL QUALITY CONTROL (SQC)

STATISTICAL QUALITY CONTROL (SQC) Statistical Quality Control 1 SQC consists of two major areas: STATISTICAL QUALITY CONTOL (SQC) - Acceptance Sampling - Process Control or Control Charts Both of these statistical techniques may be applied

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

Testing Hypotheses About Proportions

Testing Hypotheses About Proportions Chapter 11 Testing Hypotheses About Proportions Hypothesis testing method: uses data from a sample to judge whether or not a statement about a population may be true. Steps in Any Hypothesis Test 1. Determine

More information

Ch. 13.2: Mathematical Expectation

Ch. 13.2: Mathematical Expectation Ch. 13.2: Mathematical Expectation Random Variables Very often, we are interested in sample spaces in which the outcomes are distinct real numbers. For example, in the experiment of rolling two dice, we

More information

Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

More information

Confidence Intervals for One Standard Deviation Using Standard Deviation

Confidence Intervals for One Standard Deviation Using Standard Deviation Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

The power of a test is the of. by using a particular and a. value of the that is an to the value

The power of a test is the of. by using a particular and a. value of the that is an to the value DEFINITION The power of a test is the of a hypothesis. The of the is by using a particular and a value of the that is an to the value assumed in the. POWER AND THE DESIGN OF EXPERIMENTS Just as is a common

More information

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1

Lecture 2: Discrete Distributions, Normal Distributions. Chapter 1 Lecture 2: Discrete Distributions, Normal Distributions Chapter 1 Reminders Course website: www. stat.purdue.edu/~xuanyaoh/stat350 Office Hour: Mon 3:30-4:30, Wed 4-5 Bring a calculator, and copy Tables

More information

UNDERSTANDING THE TWO-WAY ANOVA

UNDERSTANDING THE TWO-WAY ANOVA UNDERSTANDING THE e have seen how the one-way ANOVA can be used to compare two or more sample means in studies involving a single independent variable. This can be extended to two independent variables

More information

Confidence Intervals for the Difference Between Two Means

Confidence Intervals for the Difference Between Two Means Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

More information

Results from the 2014 AP Statistics Exam. Jessica Utts, University of California, Irvine Chief Reader, AP Statistics jutts@uci.edu

Results from the 2014 AP Statistics Exam. Jessica Utts, University of California, Irvine Chief Reader, AP Statistics jutts@uci.edu Results from the 2014 AP Statistics Exam Jessica Utts, University of California, Irvine Chief Reader, AP Statistics jutts@uci.edu The six free-response questions Question #1: Extracurricular activities

More information

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science

STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science STA 130 (Winter 2016): An Introduction to Statistical Reasoning and Data Science Mondays 2:10 4:00 (GB 220) and Wednesdays 2:10 4:00 (various) Jeffrey Rosenthal Professor of Statistics, University of Toronto

More information

22. HYPOTHESIS TESTING

22. HYPOTHESIS TESTING 22. HYPOTHESIS TESTING Often, we need to make decisions based on incomplete information. Do the data support some belief ( hypothesis ) about the value of a population parameter? Is OJ Simpson guilty?

More information

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont

CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont CONTINGENCY TABLES ARE NOT ALL THE SAME David C. Howell University of Vermont To most people studying statistics a contingency table is a contingency table. We tend to forget, if we ever knew, that contingency

More information

Lecture Notes Module 1

Lecture Notes Module 1 Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

More information

Chi-square test Fisher s Exact test

Chi-square test Fisher s Exact test Lesson 1 Chi-square test Fisher s Exact test McNemar s Test Lesson 1 Overview Lesson 11 covered two inference methods for categorical data from groups Confidence Intervals for the difference of two proportions

More information

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010

MONT 107N Understanding Randomness Solutions For Final Examination May 11, 2010 MONT 07N Understanding Randomness Solutions For Final Examination May, 00 Short Answer (a) (0) How are the EV and SE for the sum of n draws with replacement from a box computed? Solution: The EV is n times

More information

12: Analysis of Variance. Introduction

12: Analysis of Variance. Introduction 1: Analysis of Variance Introduction EDA Hypothesis Test Introduction In Chapter 8 and again in Chapter 11 we compared means from two independent groups. In this chapter we extend the procedure to consider

More information

TEACHER NOTES MATH NSPIRED

TEACHER NOTES MATH NSPIRED Math Objectives Students will understand that normal distributions can be used to approximate binomial distributions whenever both np and n(1 p) are sufficiently large. Students will understand that when

More information

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES

SCHOOL OF HEALTH AND HUMAN SCIENCES DON T FORGET TO RECODE YOUR MISSING VALUES SCHOOL OF HEALTH AND HUMAN SCIENCES Using SPSS Topics addressed today: 1. Differences between groups 2. Graphing Use the s4data.sav file for the first part of this session. DON T FORGET TO RECODE YOUR

More information

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1. Normal Distribution Definition A continuous random variable has a normal distribution if its probability density e -(y -µ Y ) 2 2 / 2 σ function can be written as for < y < as Y f ( y ) = 1 σ Y 2 π Notation:

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

HYPOTHESIS TESTING: POWER OF THE TEST

HYPOTHESIS TESTING: POWER OF THE TEST HYPOTHESIS TESTING: POWER OF THE TEST The first 6 steps of the 9-step test of hypothesis are called "the test". These steps are not dependent on the observed data values. When planning a research project,

More information

Key Concept. Density Curve

Key Concept. Density Curve MAT 155 Statistical Analysis Dr. Claude Moore Cape Fear Community College Chapter 6 Normal Probability Distributions 6 1 Review and Preview 6 2 The Standard Normal Distribution 6 3 Applications of Normal

More information