Outline. 1 Confidence Intervals for Proportions. 2 Sample Sizes for Proportions. 3 Student s t-distribution. 4 Confidence Intervals without σ

Size: px
Start display at page:

Download "Outline. 1 Confidence Intervals for Proportions. 2 Sample Sizes for Proportions. 3 Student s t-distribution. 4 Confidence Intervals without σ"

Transcription

1 Outline 1 Confidence Intervals for Proportions 2 Sample Sizes for Proportions 3 Student s t-distribution 4 Confidence Intervals without σ

2 Outline 1 Confidence Intervals for Proportions 2 Sample Sizes for Proportions 3 Student s t-distribution 4 Confidence Intervals without σ

3 Confidence Interval for µ (pretending we know σ) Suppose a population has standard deviation σ. Taking a sample of n individuals, you obtain a sample mean x. Then you can be y-confident that the true mean µ is in the interval (x z σ n, x + z σ n ), where z was a number got from the z-table (using y). That s great for numerical data, but what about categorical data? Question Suppose you take a sample of n individuals from a population and find that x of them are successes, so that your population proportion is p = x n. Then p is our estimate of p, but what is (say) a 95% confidence interval for p?

4 Confidence Interval for µ (pretending we know σ) Suppose a population has standard deviation σ. Taking a sample of n individuals, you obtain a sample mean x. Then you can be y-confident that the true mean µ is in the interval (x z σ n, x + z σ n ), where z was a number got from the z-table (using y). That s great for numerical data, but what about categorical data? Question Suppose you take a sample of n individuals from a population and find that x of them are successes, so that your population proportion is p = x n. Then p is our estimate of p, but what is (say) a 95% confidence interval for p?

5 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.)

6 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.)

7 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions in our formula! ) µ is somewhere in (x z n σ, x + z n σ

8 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions in our formula! ) p is somewhere in (x z n σ, x + z n σ

9 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions in our formula! ) p is somewhere in ( p z n σ, p + z n σ

10 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions ( in our formula! ) p is somewhere in p z pq n, p + z pq n

11 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions ( in our formula! ) p is somewhere in p z pq n, p + z pq n But wait! We don t know p that s the whole point! Happily, it s good enough to use p for p and q = 1 p for q.

12 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions ( in our formula! ) p is somewhere in p z pq n, p + z pq n But wait! We don t know p that s the whole point! Happily, it s good enough to use p for p and q = 1 p for q.

13 Some correspondences This... corresponds to... µ p (parameter) x p (statistic) σ n pq n (standard error) (Recall that q = 1 p.) Answer So let s just make those substitutions ( in our formula! ) p is somewhere in p z p q n, p + z p q n But wait! We don t know p that s the whole point! Happily, it s good enough to use p for p and q = 1 p for q.

14 Confidence Interval for Proportions If a sample of size n reveals a sample proportion of p, then the confidence interval for the population proportion p is ( ) p q p q p z n, p + z, n where z is the z-score gotten from the confidence level in the usual way. This is good enough as long as the sample size is fairly large, and the population proportion is not too close to 0 or to 1.

15 Confidence Interval for Proportions If a sample of size n reveals a sample proportion of p, then the confidence interval for the population proportion p is ( ) p q p q p z n, p + z, n where z is the z-score gotten from the confidence level in the usual way. This is good enough as long as the sample size is fairly large, and the population proportion is not too close to 0 or to 1.

16 Example: Fish Example 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish.

17 Example: Fish Example 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution First, let s see what a 97% confidence interval looks like.

18 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z. 2 Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

19 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z. 2 Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

20 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

21 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

22 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

23 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

24 Example: Fish We need a 97% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.17 standard errors!

25 Example Example: Fish 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

26 Example: Fish Example 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

27 Example Example: Fish 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

28 Example Example: Fish 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

29 Example Example: Fish 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

30 Example Example: Fish 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

31 Example Example: Fish 400 randomly chosen people were asked whether they like fish; 160 said yes. Find a 97% confidence interval for p, the proportion of people in the whole population who like fish. Solution So for 97% confidence we need 2.17 standard errors. Now p = = 0.4, so q = 0.6; also, n = 400. Thus the standard error is p q n = (0.4)(0.6) 400 = Hence our confidence interval is ( ) p q p 2.17 n, p q p n = ( (0.0245), (0.0245)) = (0.347, 0.453) Thus we can be 97% confident that the true proportion of people who like fish is somewhere between 34.7% and 45.3%.

32 Outline 1 Confidence Intervals for Proportions 2 Sample Sizes for Proportions 3 Student s t-distribution 4 Confidence Intervals without σ

33 Finding a good sample size for proportions Last time, we saw how to find the sample size you need to get a confidence interval of a certain size. Can we do that for proportions as well? Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points?

34 Finding a good sample size for proportions Last time, we saw how to find the sample size you need to get a confidence interval of a certain size. Can we do that for proportions as well? Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points?

35 Finding a good sample size for proportions Last time, we saw how to find the sample size you need to get a confidence interval of a certain size. Can we do that for proportions as well? Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution First, let s see what a 96% confidence interval looks like.

36 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z. 2 Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

37 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z. 2 Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

38 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

39 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

40 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

41 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

42 Example: Red Hair We need a 96% confidence interval. 1 Draw the standard normal curve Z Draw vertical bars and label the middle with That means the remaining area is = That means the left tail has area = The z-table (backwards) tells us the tail ends at So we need 2.05 standard errors!

43 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

44 Example: Red Hair Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

45 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

46 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

47 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

48 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

49 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

50 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

51 Example Example: Red Hair You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. The accuracy of our 96% confidence interval is thus 2.05 standard errors, or 2.05 p q n within 0.03, so we want We want that accuracy to be p q n. Solving for n, 0.03 n 2.05 p q n 2.05 p q 0.03 = p q ( 2 n = p q. We need n to be at least p q. But we don t know p and q until we do the survey!

52 What saves us p(1 p) 1 p Fortunately, we know p is somewhere between 0 and 1. Also, p q = p(1 p). If we graph the function p(1 p), we see that it can t get too large! In fact, the largest it can be is That is, p q 0.25.

53 What saves us p(1 p) 1 p Fortunately, we know p is somewhere between 0 and 1. Also, p q = p(1 p). If we graph the function p(1 p), we see that it can t get too large! In fact, the largest it can be is That is, p q 0.25.

54 What saves us p(1 p) p Fortunately, we know p is somewhere between 0 and 1. Also, p q = p(1 p). If we graph the function p(1 p), we see that it can t get too large! In fact, the largest it can be is That is, p q 0.25.

55 What saves us p(1 p) p Fortunately, we know p is somewhere between 0 and 1. Also, p q = p(1 p). If we graph the function p(1 p), we see that it can t get too large! In fact, the largest it can be is That is, p q 0.25.

56 Example: Red Hair Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. We found out that we need n to be at least p q. Since the largest p q can be is 0.25, that means we need n = Thus we need at least 1,168 people in our survey in order to be 96% sure that our survey is accurate within three percentage points.

57 Example: Red Hair Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. We found out that we need n to be at least p q. Since the largest p q can be is 0.25, that means we need n = Thus we need at least 1,168 people in our survey in order to be 96% sure that our survey is accurate within three percentage points.

58 Example: Red Hair Example You want to find the true proportion of red-haired people in North Dakota. How many North Dakota residents should you choose randomly in order to be 96% confident that your conclusions are accurate within 3 percentage points? Solution, cont. We found out that we need n to be at least p q. Since the largest p q can be is 0.25, that means we need n = Thus we need at least 1,168 people in our survey in order to be 96% sure that our survey is accurate within three percentage points.

59 Summary Finding sample size for proportions 1 Find out what z-score you need for the desired confidence level. 2 Set your desired accuracy equal to z p q n. 3 Plug in the z-score you found. 4 Instead of p q, use their maximum value, namely Now solve for n.

60 Outline 1 Confidence Intervals for Proportions 2 Sample Sizes for Proportions 3 Student s t-distribution 4 Confidence Intervals without σ

61 Getting rid of σ The 95% confidence interval is (x 2 σ n, x + 2 σ n ) In practice, we know x and n, but we don t know σ. What can we do? Our best guess for σ is s, the sample standard deviation.

62 Sample Variance and Standard Deviation Definition If our sample yields the list of numbers {x 1, x 2,..., x n }, then the sample variance is given by s 2 = (x 1 x) 2 + (x 2 x) (x n x) 2. n 1 The sample standard deviation s is the square root of the sample variance. Alternate form An easier version for computing the sample variance is s 2 = (x 1) 2 + (x 2 ) (x n ) 2 nx 2. n 1

63 Using s instead of σ The simplest thing to do would be to use s instead of σ in our confidence interval formula. We could try (x z σ n, x + z σ n ). This actually works surprisingly well... For the rest of the time, we need another approach, known as Student s t-distribution.

64 Using s instead of σ The simplest thing to do would be to use s instead of σ in our confidence interval formula. We could try (x z σ n, x + z σ n ). This actually works surprisingly well... For the rest of the time, we need another approach, known as Student s t-distribution.

65 Using s instead of σ The simplest thing to do would be to use s instead of σ in our confidence interval formula. We could try ( x z s n, x + z s n ). This actually works surprisingly well... For the rest of the time, we need another approach, known as Student s t-distribution.

66 Using s instead of σ The simplest thing to do would be to use s instead of σ in our confidence interval formula. We could try ( x z s n, x + z s n ). This actually works surprisingly well... For the rest of the time, we need another approach, known as Student s t-distribution.

67 Using s instead of σ The simplest thing to do would be to use s instead of σ in our confidence interval formula. We could try ( x z s n, x + z s n ). This actually works surprisingly well... some of the time. For the rest of the time, we need another approach, known as Student s t-distribution.

68 Using s instead of σ The simplest thing to do would be to use s instead of σ in our confidence interval formula. We could try ( x z s n, x + z s n ). This actually works surprisingly well... some of the time. For the rest of the time, we need another approach, known as Student s t-distribution.

69 Where it began... William S. Gosset Student

70 Where it began... William S. Gosset Student Arthur Guinness Son & Co. Ltd. good

71 Where it began... William S. Gosset Student Arthur Guinness Son & Co. Ltd. good In the early 1900 s, Guinness employed Gosset as a statistician to help improve their beer. Brewing is a long, expensive process, and Gosset often had only a few batches of beer in his samples. Gosset found that using s n worked well when he had a large n, but when n was small, it was producing confidence intervals that were too small.

72 Where it began... William S. Gosset Student Arthur Guinness Son & Co. Ltd. good In the early 1900 s, Guinness employed Gosset as a statistician to help improve their beer. Brewing is a long, expensive process, and Gosset often had only a few batches of beer in his samples. Gosset found that using s n worked well when he had a large n, but when n was small, it was producing confidence intervals that were too small.

73 Where it began... William S. Gosset Student Arthur Guinness Son & Co. Ltd. good In the early 1900 s, Guinness employed Gosset as a statistician to help improve their beer. Brewing is a long, expensive process, and Gosset often had only a few batches of beer in his samples. Gosset found that using s n worked well when he had a large n, but when n was small, it was producing confidence intervals that were too small.

74 Why it goes wrong Before, when we constructed the 95% confidence interval off of µ, we got error bars of 2 σ n from the uncertainty of where µ was. x But if we don t know σ, then that just adds to our uncertainty! x

75 Why it goes wrong Before, when we constructed the 95% confidence interval off of µ, we got error bars of 2 σ n from the uncertainty of where µ was. x 2 σ n x x + 2 σ n But if we don t know σ, then that just adds to our uncertainty! x

76 Why it goes wrong Before, when we constructed the 95% confidence interval off of µ, we got error bars of 2 σ n from the uncertainty of where µ was. uncertainty from unknown µ x 2 σ n x x + 2 σ n But if we don t know σ, then that just adds to our uncertainty! x

77 Why it goes wrong Before, when we constructed the 95% confidence interval off of µ, we got error bars of 2 σ n from the uncertainty of where µ was. uncertainty from unknown µ x 2 σ n x x + 2 σ n But if we don t know σ, then that just adds to our uncertainty! x

78 Why it goes wrong Before, when we constructed the 95% confidence interval off of µ, we got error bars of 2 σ n from the uncertainty of where µ was. uncertainty from unknown µ x 2 σ n x x + 2 σ n But if we don t know σ, then that just adds to our uncertainty! uncertainty from unknown µ x 2 s n x x + 2 s n

79 Why it goes wrong Before, when we constructed the 95% confidence interval off of µ, we got error bars of 2 σ n from the uncertainty of where µ was. uncertainty from unknown µ x 2 σ n x x + 2 σ n But if we don t know σ, then that just adds to our uncertainty! uncertainty from unknown µ extra uncertainty from unknown σ x 2 s n x x + 2 s n

80 Gosset s observation If we use s instead of σ, then we re more uncertain. Therefore we need more s n s than we would need of σ n s. We got the number of standard errors to use from the Z -distribution. So that s the wrong distribution to use!

81 If we were using σ, within 2 standard errors we would have 95% confidence. Because we re working with s instead of σ, we have less confidence! So we need a flatter distribution than Z!

82 95% If we were using σ, within 2 standard errors we would have 95% confidence. Because we re working with s instead of σ, we have less confidence! So we need a flatter distribution than Z!

83 88% If we were using σ, within 2 standard errors we would have 95% confidence. Because we re working with s instead of σ, we have less confidence! So we need a flatter distribution than Z!

84 88% If we were using σ, within 2 standard errors we would have 95% confidence. Because we re working with s instead of σ, we have less confidence! So we need a flatter distribution than Z!

85 The Story of Student Wm. S. Gosset discovered the flatter distribution that gives the confidence intervals with small sample sizes. Some years earlier, a Guinness employee had published some of the company s brewing secrets, so Guinness prohibited its employees from publishing. Gosset pleaded with Guinness to let him publish math. They finally gave him permission, under one condition.

86

87 The t-distribution Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

88 z-distribution The t-distribution t-distribution with 1 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

89 z-distribution The t-distribution t-distribution with 2 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

90 z-distribution The t-distribution t-distribution with 3 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

91 z-distribution The t-distribution t-distribution with 4 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

92 z-distribution The t-distribution t-distribution with 5 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

93 z-distribution The t-distribution t-distribution with 6 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

94 z-distribution The t-distribution t-distribution with 7 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

95 z-distribution The t-distribution t-distribution with 8 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

96 z-distribution The t-distribution t-distribution with 9 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

97 z-distribution The t-distribution t-distribution with 10 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

98 z-distribution The t-distribution t-distribution with 11 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

99 z-distribution The t-distribution t-distribution with 12 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

100 z-distribution The t-distribution t-distribution with 13 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

101 z-distribution The t-distribution t-distribution with 14 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

102 z-distribution The t-distribution t-distribution with 15 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

103 z-distribution The t-distribution t-distribution with 16 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

104 z-distribution The t-distribution t-distribution with 17 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

105 z-distribution The t-distribution t-distribution with 18 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

106 z-distribution The t-distribution t-distribution with 19 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

107 z-distribution The t-distribution t-distribution with 20 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

108 z-distribution The t-distribution t-distribution with 21 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

109 z-distribution The t-distribution t-distribution with 22 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

110 z-distribution The t-distribution t-distribution with 23 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

111 z-distribution The t-distribution t-distribution with 24 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

112 z-distribution The t-distribution t-distribution with 25 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

113 z-distribution The t-distribution t-distribution with 26 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

114 z-distribution The t-distribution t-distribution with 27 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

115 z-distribution The t-distribution t-distribution with 28 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

116 z-distribution The t-distribution t-distribution with 29 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

117 z-distribution The t-distribution t-distribution with 30 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom.

118 z-distribution The t-distribution t-distribution with 30 degrees of freedom Gosset found the formula for the right distribution for small samples. There s a different distribution for each sample size. If your sample size is n, you use the t-distribution with n 1 degrees of freedom. If n 30, then the t-distribution is almost exactly the normal curve Z.

119 Outline 1 Confidence Intervals for Proportions 2 Sample Sizes for Proportions 3 Student s t-distribution 4 Confidence Intervals without σ

120 Student s Conclusions To make a confidence interval when we don t know σ, we replace σ n with our estimate s n. If our sample size n is at least 30, we use the Z -curve just like last time. If our sample size n is less than 30, we use the t-curve for n 1 degrees of freedom. So the only change in our procedure is to look up the numbers in a different table!

121 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

122 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

123 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. y 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

124 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. y 1 y 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

125 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. 1 y 2 y 1 y 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

126 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. 1 y 2 y 1 y t 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

127 Finding a y-confidence interval from a small sample 1 Subtract 1 from the sample size n to get n 1 degrees of freedom. 2 Draw Student s t-distribution with n 1 degrees of freedom. 1 y 2 y 1 y t 3 Draw two vertical bars symmetrically on the graph, and label the middle with y. 4 That means the remaining area is 1 y. 5 That means the left tail has area 1 y 2. 6 Use the appropriate t-table to learn where that tail ends! 7 Use that many standard errors s n!

128 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean µ.

129 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean µ. Solution First we need to find the sample mean x and sample standard deviation s x = = s 2 = = 0.545, 5 1 so s = = Next, we need to see what a 94% confidence interval looks like for a sample size of n = 5.

130 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean µ. Solution First we need to find the sample mean x and sample standard deviation s x = = s 2 = = 0.545, 5 1 so s = = Next, we need to see what a 94% confidence interval looks like for a sample size of n = 5.

131 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean µ. Solution First we need to find the sample mean x and sample standard deviation s x = = s 2 = = 0.545, 5 1 so s = = Next, we need to see what a 94% confidence interval looks like for a sample size of n = 5.

132 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom. 3 Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

133 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom. 3 Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

134 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

135 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

136 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

137 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

138 Example: Sugar 1 n = 5, so we need 5 1 = 4 degrees of freedom. 2 Draw Student s t-distribution with 4 degrees of freedom Draw two vertical bars symmetrically on the graph, and label the middle with That means the remaining area is That means the left tail has area The t-table for 4 degrees of freedom says the tail ends at So we need 2.60 standard errors s n!

139 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean consumption µ. Solution So we need 2.60 standard errors; recall that n = 5, x = 4.6, and s = So the confidence interval is ( x 2.60 s, x s ) n n ( = , ) 5 5 = (3.741, 5.459). Thus Mrs. Smith can be 94% sure that her family averages between pounds and pounds of sugar per week.

140 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean consumption µ. Solution So we need 2.60 standard errors; recall that n = 5, x = 4.6, and s = So the confidence interval is ( x 2.60 s, x s ) n n ( = , ) 5 5 = (3.741, 5.459). Thus Mrs. Smith can be 94% sure that her family averages between pounds and pounds of sugar per week.

141 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean consumption µ. Solution So we need 2.60 standard errors; recall that n = 5, x = 4.6, and s = So the confidence interval is ( x 2.60 s, x s ) n n ( = , ) 5 5 = (3.741, 5.459). Thus Mrs. Smith can be 94% sure that her family averages between pounds and pounds of sugar per week.

142 Example Example: Sugar Mrs. Smith is worried about her family s health, so she keeps track of how much sugar they use. In five randomly picked weeks, they used the following amounts of sugar (in pounds): Construct a 94% confidence interval for the true mean consumption µ. Solution So we need 2.60 standard errors; recall that n = 5, x = 4.6, and s = So the confidence interval is ( x 2.60 s, x s ) n n ( = , ) 5 5 = (3.741, 5.459). Thus Mrs. Smith can be 94% sure that her family averages between pounds and pounds of sugar per week.

MEASURES OF VARIATION

MEASURES OF VARIATION NORMAL DISTRIBTIONS MEASURES OF VARIATION In statistics, it is important to measure the spread of data. A simple way to measure spread is to find the range. But statisticians want to know if the data are

More information

Lesson 7 Z-Scores and Probability

Lesson 7 Z-Scores and Probability Lesson 7 Z-Scores and Probability Outline Introduction Areas Under the Normal Curve Using the Z-table Converting Z-score to area -area less than z/area greater than z/area between two z-values Converting

More information

Simple Regression Theory II 2010 Samuel L. Baker

Simple Regression Theory II 2010 Samuel L. Baker SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

More information

Confidence intervals

Confidence intervals Confidence intervals Today, we re going to start talking about confidence intervals. We use confidence intervals as a tool in inferential statistics. What this means is that given some sample statistics,

More information

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)

Objectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI) Objectives 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) Statistical confidence (CIS gives a good explanation of a 95% CI) Confidence intervals. Further reading http://onlinestatbook.com/2/estimation/confidence.html

More information

Standard Deviation Estimator

Standard Deviation Estimator CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

More information

Chapter 4 Online Appendix: The Mathematics of Utility Functions

Chapter 4 Online Appendix: The Mathematics of Utility Functions Chapter 4 Online Appendix: The Mathematics of Utility Functions We saw in the text that utility functions and indifference curves are different ways to represent a consumer s preferences. Calculus can

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 5 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data As

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Fall 2001 Professor Paul Glasserman B6014: Managerial Statistics 403 Uris Hall Properties of Point Estimates 1 We have already encountered two point estimators: th e

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

5.1 Identifying the Target Parameter

5.1 Identifying the Target Parameter University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

More information

4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions 4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

More information

Normal distribution. ) 2 /2σ. 2π σ

Normal distribution. ) 2 /2σ. 2π σ Normal distribution The normal distribution is the most widely known and used of all distributions. Because the normal distribution approximates many natural phenomena so well, it has developed into a

More information

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1.

Def: The standard normal distribution is a normal probability distribution that has a mean of 0 and a standard deviation of 1. Lecture 6: Chapter 6: Normal Probability Distributions A normal distribution is a continuous probability distribution for a random variable x. The graph of a normal distribution is called the normal curve.

More information

z-scores AND THE NORMAL CURVE MODEL

z-scores AND THE NORMAL CURVE MODEL z-scores AND THE NORMAL CURVE MODEL 1 Understanding z-scores 2 z-scores A z-score is a location on the distribution. A z- score also automatically communicates the raw score s distance from the mean A

More information

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test

Outline. Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test The t-test Outline Definitions Descriptive vs. Inferential Statistics The t-test - One-sample t-test - Dependent (related) groups t-test - Independent (unrelated) groups t-test Comparing means Correlation

More information

Frequency Distributions

Frequency Distributions Descriptive Statistics Dr. Tom Pierce Department of Psychology Radford University Descriptive statistics comprise a collection of techniques for better understanding what the people in a group look like

More information

2 ESTIMATION. Objectives. 2.0 Introduction

2 ESTIMATION. Objectives. 2.0 Introduction 2 ESTIMATION Chapter 2 Estimation Objectives After studying this chapter you should be able to calculate confidence intervals for the mean of a normal distribution with unknown variance; be able to calculate

More information

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7.

THE FIRST SET OF EXAMPLES USE SUMMARY DATA... EXAMPLE 7.2, PAGE 227 DESCRIBES A PROBLEM AND A HYPOTHESIS TEST IS PERFORMED IN EXAMPLE 7. THERE ARE TWO WAYS TO DO HYPOTHESIS TESTING WITH STATCRUNCH: WITH SUMMARY DATA (AS IN EXAMPLE 7.17, PAGE 236, IN ROSNER); WITH THE ORIGINAL DATA (AS IN EXAMPLE 8.5, PAGE 301 IN ROSNER THAT USES DATA FROM

More information

6 3 The Standard Normal Distribution

6 3 The Standard Normal Distribution 290 Chapter 6 The Normal Distribution Figure 6 5 Areas Under a Normal Distribution Curve 34.13% 34.13% 2.28% 13.59% 13.59% 2.28% 3 2 1 + 1 + 2 + 3 About 68% About 95% About 99.7% 6 3 The Distribution Since

More information

Week 3&4: Z tables and the Sampling Distribution of X

Week 3&4: Z tables and the Sampling Distribution of X Week 3&4: Z tables and the Sampling Distribution of X 2 / 36 The Standard Normal Distribution, or Z Distribution, is the distribution of a random variable, Z N(0, 1 2 ). The distribution of any other normal

More information

Point and Interval Estimates

Point and Interval Estimates Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

More information

CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

More information

WHERE DOES THE 10% CONDITION COME FROM?

WHERE DOES THE 10% CONDITION COME FROM? 1 WHERE DOES THE 10% CONDITION COME FROM? The text has mentioned The 10% Condition (at least) twice so far: p. 407 Bernoulli trials must be independent. If that assumption is violated, it is still okay

More information

Evaluating trigonometric functions

Evaluating trigonometric functions MATH 1110 009-09-06 Evaluating trigonometric functions Remark. Throughout this document, remember the angle measurement convention, which states that if the measurement of an angle appears without units,

More information

Lesson 17: Margin of Error When Estimating a Population Proportion

Lesson 17: Margin of Error When Estimating a Population Proportion Margin of Error When Estimating a Population Proportion Classwork In this lesson, you will find and interpret the standard deviation of a simulated distribution for a sample proportion and use this information

More information

1. How different is the t distribution from the normal?

1. How different is the t distribution from the normal? Statistics 101 106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M 7.1 and 7.2, ignoring starred parts. Reread M&M 3.2. The effects of estimated variances on normal approximations. t-distributions.

More information

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means

Lesson 1: Comparison of Population Means Part c: Comparison of Two- Means Lesson : Comparison of Population Means Part c: Comparison of Two- Means Welcome to lesson c. This third lesson of lesson will discuss hypothesis testing for two independent means. Steps in Hypothesis

More information

Descriptive Statistics and Measurement Scales

Descriptive Statistics and Measurement Scales Descriptive Statistics 1 Descriptive Statistics and Measurement Scales Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample

More information

Statistical estimation using confidence intervals

Statistical estimation using confidence intervals 0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena

More information

Z-table p-values: use choice 2: normalcdf(

Z-table p-values: use choice 2: normalcdf( P-values with the Ti83/Ti84 Note: The majority of the commands used in this handout can be found under the DISTR menu which you can access by pressing [ nd ] [VARS]. You should see the following: NOTE:

More information

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MBA 611 STATISTICS AND QUANTITATIVE METHODS MBA 611 STATISTICS AND QUANTITATIVE METHODS Part I. Review of Basic Statistics (Chapters 1-11) A. Introduction (Chapter 1) Uncertainty: Decisions are often based on incomplete information from uncertain

More information

T O P I C 1 2 Techniques and tools for data analysis Preview Introduction In chapter 3 of Statistics In A Day different combinations of numbers and types of variables are presented. We go through these

More information

14.02 Principles of Macroeconomics Problem Set 1 Fall 2005 ***Solution***

14.02 Principles of Macroeconomics Problem Set 1 Fall 2005 ***Solution*** Part I. True/False/Uncertain Justify your answer with a short argument. 14.02 Principles of Macroeconomics Problem Set 1 Fall 2005 ***Solution*** Posted: Monday, September 12, 2005 Due: Wednesday, September

More information

c 2008 Je rey A. Miron We have described the constraints that a consumer faces, i.e., discussed the budget constraint.

c 2008 Je rey A. Miron We have described the constraints that a consumer faces, i.e., discussed the budget constraint. Lecture 2b: Utility c 2008 Je rey A. Miron Outline: 1. Introduction 2. Utility: A De nition 3. Monotonic Transformations 4. Cardinal Utility 5. Constructing a Utility Function 6. Examples of Utility Functions

More information

Describing Populations Statistically: The Mean, Variance, and Standard Deviation

Describing Populations Statistically: The Mean, Variance, and Standard Deviation Describing Populations Statistically: The Mean, Variance, and Standard Deviation BIOLOGICAL VARIATION One aspect of biology that holds true for almost all species is that not every individual is exactly

More information

One-Way Analysis of Variance

One-Way Analysis of Variance One-Way Analysis of Variance Note: Much of the math here is tedious but straightforward. We ll skim over it in class but you should be sure to ask questions if you don t understand it. I. Overview A. We

More information

Descriptive statistics; Correlation and regression

Descriptive statistics; Correlation and regression Descriptive statistics; and regression Patrick Breheny September 16 Patrick Breheny STA 580: Biostatistics I 1/59 Tables and figures Descriptive statistics Histograms Numerical summaries Percentiles Human

More information

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions.

Unit 1 Number Sense. In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions. Unit 1 Number Sense In this unit, students will study repeating decimals, percents, fractions, decimals, and proportions. BLM Three Types of Percent Problems (p L-34) is a summary BLM for the material

More information

TEACHER NOTES MATH NSPIRED

TEACHER NOTES MATH NSPIRED Math Objectives Students will understand that normal distributions can be used to approximate binomial distributions whenever both np and n(1 p) are sufficiently large. Students will understand that when

More information

Constructing and Interpreting Confidence Intervals

Constructing and Interpreting Confidence Intervals Constructing and Interpreting Confidence Intervals Confidence Intervals In this power point, you will learn: Why confidence intervals are important in evaluation research How to interpret a confidence

More information

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median

The right edge of the box is the third quartile, Q 3, which is the median of the data values above the median. Maximum Median CONDENSED LESSON 2.1 Box Plots In this lesson you will create and interpret box plots for sets of data use the interquartile range (IQR) to identify potential outliers and graph them on a modified box

More information

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm

99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the

More information

TImath.com. Statistics. Areas in Intervals

TImath.com. Statistics. Areas in Intervals Areas in Intervals ID: 9472 TImath.com Time required 30 minutes Activity Overview In this activity, students use several methods to determine the probability of a given normally distributed value being

More information

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013 Statistics I for QBIC Text Book: Biostatistics, 10 th edition, by Daniel & Cross Contents and Objectives Chapters 1 7 Revised: August 2013 Chapter 1: Nature of Statistics (sections 1.1-1.6) Objectives

More information

Lesson 9 Hypothesis Testing

Lesson 9 Hypothesis Testing Lesson 9 Hypothesis Testing Outline Logic for Hypothesis Testing Critical Value Alpha (α) -level.05 -level.01 One-Tail versus Two-Tail Tests -critical values for both alpha levels Logic for Hypothesis

More information

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k.

REPEATED TRIALS. The probability of winning those k chosen times and losing the other times is then p k q n k. REPEATED TRIALS Suppose you toss a fair coin one time. Let E be the event that the coin lands heads. We know from basic counting that p(e) = 1 since n(e) = 1 and 2 n(s) = 2. Now suppose we play a game

More information

3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples 3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

The Normal Distribution

The Normal Distribution Chapter 6 The Normal Distribution 6.1 The Normal Distribution 1 6.1.1 Student Learning Objectives By the end of this chapter, the student should be able to: Recognize the normal probability distribution

More information

7.6 Approximation Errors and Simpson's Rule

7.6 Approximation Errors and Simpson's Rule WileyPLUS: Home Help Contact us Logout Hughes-Hallett, Calculus: Single and Multivariable, 4/e Calculus I, II, and Vector Calculus Reading content Integration 7.1. Integration by Substitution 7.2. Integration

More information

2.5 Zeros of a Polynomial Functions

2.5 Zeros of a Polynomial Functions .5 Zeros of a Polynomial Functions Section.5 Notes Page 1 The first rule we will talk about is Descartes Rule of Signs, which can be used to determine the possible times a graph crosses the x-axis and

More information

Elasticity. I. What is Elasticity?

Elasticity. I. What is Elasticity? Elasticity I. What is Elasticity? The purpose of this section is to develop some general rules about elasticity, which may them be applied to the four different specific types of elasticity discussed in

More information

Means, standard deviations and. and standard errors

Means, standard deviations and. and standard errors CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

More information

Notes on Continuous Random Variables

Notes on Continuous Random Variables Notes on Continuous Random Variables Continuous random variables are random quantities that are measured on a continuous scale. They can usually take on any value over some interval, which distinguishes

More information

Random variables, probability distributions, binomial random variable

Random variables, probability distributions, binomial random variable Week 4 lecture notes. WEEK 4 page 1 Random variables, probability distributions, binomial random variable Eample 1 : Consider the eperiment of flipping a fair coin three times. The number of tails that

More information

Characteristics of Binomial Distributions

Characteristics of Binomial Distributions Lesson2 Characteristics of Binomial Distributions In the last lesson, you constructed several binomial distributions, observed their shapes, and estimated their means and standard deviations. In Investigation

More information

Social Studies 201 Notes for November 19, 2003

Social Studies 201 Notes for November 19, 2003 1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is

More information

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing

Chapter 8 Hypothesis Testing Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing Chapter 8 Hypothesis Testing 1 Chapter 8 Hypothesis Testing 8-1 Overview 8-2 Basics of Hypothesis Testing 8-3 Testing a Claim About a Proportion 8-5 Testing a Claim About a Mean: s Not Known 8-6 Testing

More information

Recall this chart that showed how most of our course would be organized:

Recall this chart that showed how most of our course would be organized: Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

More information

SAMPLE SIZE CONSIDERATIONS

SAMPLE SIZE CONSIDERATIONS SAMPLE SIZE CONSIDERATIONS Learning Objectives Understand the critical role having the right sample size has on an analysis or study. Know how to determine the correct sample size for a specific study.

More information

Squaring, Cubing, and Cube Rooting

Squaring, Cubing, and Cube Rooting Squaring, Cubing, and Cube Rooting Arthur T. Benjamin Harvey Mudd College Claremont, CA 91711 benjamin@math.hmc.edu I still recall my thrill and disappointment when I read Mathematical Carnival [4], by

More information

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means

Chapter Study Guide. Chapter 11 Confidence Intervals and Hypothesis Testing for Means OPRE504 Chapter Study Guide Chapter 11 Confidence Intervals and Hypothesis Testing for Means I. Calculate Probability for A Sample Mean When Population σ Is Known 1. First of all, we need to find out the

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe

Introduction to the Practice of Statistics Fifth Edition Moore, McCabe Introduction to the Practice of Statistics Fifth Edition Moore, McCabe Section 5.1 Homework Answers 5.7 In the proofreading setting if Exercise 5.3, what is the smallest number of misses m with P(X m)

More information

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction

CA200 Quantitative Analysis for Business Decisions. File name: CA200_Section_04A_StatisticsIntroduction CA200 Quantitative Analysis for Business Decisions File name: CA200_Section_04A_StatisticsIntroduction Table of Contents 4. Introduction to Statistics... 1 4.1 Overview... 3 4.2 Discrete or continuous

More information

The Normal Distribution

The Normal Distribution The Normal Distribution Continuous Distributions A continuous random variable is a variable whose possible values form some interval of numbers. Typically, a continuous variable involves a measurement

More information

A Short Guide to Significant Figures

A Short Guide to Significant Figures A Short Guide to Significant Figures Quick Reference Section Here are the basic rules for significant figures - read the full text of this guide to gain a complete understanding of what these rules really

More information

Demand. Lecture 3. August 2015. Reading: Perlo Chapter 4 1 / 58

Demand. Lecture 3. August 2015. Reading: Perlo Chapter 4 1 / 58 Demand Lecture 3 Reading: Perlo Chapter 4 August 2015 1 / 58 Introduction We saw the demand curve in chapter 2. We learned about consumer decision making in chapter 3. Now we bridge the gap between the

More information

Math 201: Statistics November 30, 2006

Math 201: Statistics November 30, 2006 Math 201: Statistics November 30, 2006 Fall 2006 MidTerm #2 Closed book & notes; only an A4-size formula sheet and a calculator allowed; 90 mins. No questions accepted! Instructions: There are eleven pages

More information

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared. jn2@ecs.soton.ac.uk

COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared. jn2@ecs.soton.ac.uk COMP6053 lecture: Relationship between two variables: correlation, covariance and r-squared jn2@ecs.soton.ac.uk Relationships between variables So far we have looked at ways of characterizing the distribution

More information

Independent samples t-test. Dr. Tom Pierce Radford University

Independent samples t-test. Dr. Tom Pierce Radford University Independent samples t-test Dr. Tom Pierce Radford University The logic behind drawing causal conclusions from experiments The sampling distribution of the difference between means The standard error of

More information

Section 6.1 Discrete Random variables Probability Distribution

Section 6.1 Discrete Random variables Probability Distribution Section 6.1 Discrete Random variables Probability Distribution Definitions a) Random variable is a variable whose values are determined by chance. b) Discrete Probability distribution consists of the values

More information

Simple linear regression

Simple linear regression Simple linear regression Introduction Simple linear regression is a statistical method for obtaining a formula to predict values of one variable from another where there is a causal relationship between

More information

Chapter 7: Simple linear regression Learning Objectives

Chapter 7: Simple linear regression Learning Objectives Chapter 7: Simple linear regression Learning Objectives Reading: Section 7.1 of OpenIntro Statistics Video: Correlation vs. causation, YouTube (2:19) Video: Intro to Linear Regression, YouTube (5:18) -

More information

Statistical Confidence Calculations

Statistical Confidence Calculations Statistical Confidence Calculations Statistical Methodology Omniture Test&Target utilizes standard statistics to calculate confidence, confidence intervals, and lift for each campaign. The student s T

More information

Normal and Binomial. Distributions

Normal and Binomial. Distributions Normal and Binomial Distributions Library, Teaching and Learning 14 By now, you know about averages means in particular and are familiar with words like data, standard deviation, variance, probability,

More information

8. THE NORMAL DISTRIBUTION

8. THE NORMAL DISTRIBUTION 8. THE NORMAL DISTRIBUTION The normal distribution with mean μ and variance σ 2 has the following density function: The normal distribution is sometimes called a Gaussian Distribution, after its inventor,

More information

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables Discrete vs. continuous random variables Examples of continuous distributions o Uniform o Exponential o Normal Recall: A random

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Mathematics 57 Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved TABLE OF CONTENTS PAGE Helpful Hints for All Tests...1 Tests

More information

Kenken For Teachers. Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles June 27, 2010. Abstract

Kenken For Teachers. Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles June 27, 2010. Abstract Kenken For Teachers Tom Davis tomrdavis@earthlink.net http://www.geometer.org/mathcircles June 7, 00 Abstract Kenken is a puzzle whose solution requires a combination of logic and simple arithmetic skills.

More information

6.4 Normal Distribution

6.4 Normal Distribution Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

More information

The fundamental question in economics is 2. Consumer Preferences

The fundamental question in economics is 2. Consumer Preferences A Theory of Consumer Behavior Preliminaries 1. Introduction The fundamental question in economics is 2. Consumer Preferences Given limited resources, how are goods and service allocated? 1 3. Indifference

More information

7. Normal Distributions

7. Normal Distributions 7. Normal Distributions A. Introduction B. History C. Areas of Normal Distributions D. Standard Normal E. Exercises Most of the statistical analyses presented in this book are based on the bell-shaped

More information

Playing with Numbers

Playing with Numbers PLAYING WITH NUMBERS 249 Playing with Numbers CHAPTER 16 16.1 Introduction You have studied various types of numbers such as natural numbers, whole numbers, integers and rational numbers. You have also

More information

Lesson one. Proportions in the Port of Long Beach 1. Terminal Objective. Lesson 1

Lesson one. Proportions in the Port of Long Beach 1. Terminal Objective. Lesson 1 Proportions in the Port of Long Beach Lesson one Terminal Objective Content Standard Reference: Students will solve Port of Long Beach word problems by writing a proportion and using the cross product

More information

Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

More information

Coins, Presidents, and Justices: Normal Distributions and z-scores

Coins, Presidents, and Justices: Normal Distributions and z-scores activity 17.1 Coins, Presidents, and Justices: Normal Distributions and z-scores In the first part of this activity, you will generate some data that should have an approximately normal (or bell-shaped)

More information

AK 4 SLUTSKY COMPENSATION

AK 4 SLUTSKY COMPENSATION AK 4 SLUTSKY COMPENSATION ECON 210 A. JOSEPH GUSE (1) (a) First calculate the demand at the original price p b = 2 b(p b,m) = 1000 20 5p b b 0 = b(2) = 40 In general m c = m+(p 1 b p0 b )b 0. If the price

More information

c. Given your answer in part (b), what do you anticipate will happen in this market in the long-run?

c. Given your answer in part (b), what do you anticipate will happen in this market in the long-run? Perfect Competition Questions Question 1 Suppose there is a perfectly competitive industry where all the firms are identical with identical cost curves. Furthermore, suppose that a representative firm

More information

Partial Fractions. Combining fractions over a common denominator is a familiar operation from algebra:

Partial Fractions. Combining fractions over a common denominator is a familiar operation from algebra: Partial Fractions Combining fractions over a common denominator is a familiar operation from algebra: From the standpoint of integration, the left side of Equation 1 would be much easier to work with than

More information

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives.

5/31/2013. 6.1 Normal Distributions. Normal Distributions. Chapter 6. Distribution. The Normal Distribution. Outline. Objectives. The Normal Distribution C H 6A P T E R The Normal Distribution Outline 6 1 6 2 Applications of the Normal Distribution 6 3 The Central Limit Theorem 6 4 The Normal Approximation to the Binomial Distribution

More information

Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

More information

The Normal distribution

The Normal distribution The Normal distribution The normal probability distribution is the most common model for relative frequencies of a quantitative variable. Bell-shaped and described by the function f(y) = 1 2σ π e{ 1 2σ

More information

Basic numerical skills: EQUATIONS AND HOW TO SOLVE THEM. x + 5 = 7 2 + 5-2 = 7-2 5 + (2-2) = 7-2 5 = 5. x + 5-5 = 7-5. x + 0 = 20.

Basic numerical skills: EQUATIONS AND HOW TO SOLVE THEM. x + 5 = 7 2 + 5-2 = 7-2 5 + (2-2) = 7-2 5 = 5. x + 5-5 = 7-5. x + 0 = 20. Basic numerical skills: EQUATIONS AND HOW TO SOLVE THEM 1. Introduction (really easy) An equation represents the equivalence between two quantities. The two sides of the equation are in balance, and solving

More information

Charlesworth School Year Group Maths Targets

Charlesworth School Year Group Maths Targets Charlesworth School Year Group Maths Targets Year One Maths Target Sheet Key Statement KS1 Maths Targets (Expected) These skills must be secure to move beyond expected. I can compare, describe and solve

More information

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

 Y. Notation and Equations for Regression Lecture 11/4. Notation: Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

More information

Determine If An Equation Represents a Function

Determine If An Equation Represents a Function Question : What is a linear function? The term linear function consists of two parts: linear and function. To understand what these terms mean together, we must first understand what a function is. The

More information