# 2 ESTIMATION. Objectives. 2.0 Introduction

Save this PDF as:

Size: px
Start display at page:

## Transcription

1 2 ESTIMATION Chapter 2 Estimation Objectives After studying this chapter you should be able to calculate confidence intervals for the mean of a normal distribution with unknown variance; be able to calculate confidence intervals for the variance and standard deviation of a normal distribution; be able to calculate approximate confidence intervals for a proportion; be able to calculate approximate confidence intervals for the mean of a Poisson distribution. 2.0 Introduction In Chapter 9 of the text Statistics the idea of a confidence interval was introduced. Confidence intervals are used when we want to estimate a population parameter from a sample. The parameter may be estimated by a single value (a point estimate) but it is usually preferable to estimate it by an interval which will give some indication of the amount of uncertainty attached to the estimate. In Statistics, estimation of the mean of a normal population with known standard deviation was considered. If x is the mean of a random sample of size n from a normal distribution with mean µ and standard deviation σ there is a probability of 0.95 that x lies within σ n of µ. If this is the case the interval x ±1. 96 σ n will contain µ. This interval is called a 95% confidence interval for µ. If further samples of size n were taken and the calculation repeated, different intervals would be calculated. 95% of these intervals would contain µ, but 5% would not Note: although µ is unknown, it does not vary, it is the intervals that vary. It is possible to calculate 99% or even 99.9% confidence intervals which would be wider than the 95% interval but it is not possible to calculate 100% confidence intervals. µ 1.96 σ n µ x µ σ n 21

2 Example The length of time a bus takes to travel from Chorlton to Saints in the morning rush hour is normally distributed with standard deviation 4 minutes. A random sample of 6 journeys took 23, 19, 25, 34, 24 and 28 minutes. Find All (i) a 95% confidence interval for the mean journey time, (ii) a 99% confidence interval for the mean journey time. Solution The sample mean is = 25.5 (i) 95% confidence interval is ± i.e 25.5 ± 3.20 or ( 22.3, 28.7) (ii) 99% confidence interval is 25.5 ± ( ) i.e ± 4.21 or 21.3, Confidence interval for mean: standard deviation unknown The example above of the bus journey times is somewhat unrealistic. How was it known that the bus journey times were normally distributed with a standard deviation of 4 minutes? If so much was known about the distribution how was it that the mean was unknown? Probably the journey times for similar journeys had been studied and the standard deviation found to be about four minutes. The statement that the standard deviation was 'known' was probably something of an exaggeration. The same argument applies to the normal distribution. However, as you are dealing with the sample mean, no great error will result from assuming a normal distribution unless the distribution is extremely unusual. If the standard deviation is completely unknown or if you do not wish to accept a rough estimate, there is still a way forward. Provided the sample has more than one member, an estimate of the standard deviation may be made from the sample. In the case of the bus journey times ˆσ = 5.09, this is most easily 22

3 found by using the σ n 1 or s n 1 button on a calculator. Alternatively the formula ( ˆσ 2 x x) 2 = ( n 1 ) or equivalent can be used. For a 95% confidence interval, σ known, the formula x ±1. 96 σ n was used. Now the known standard deviation σ will be replaced by an estimate ˆσ. There will be some uncertainty in this estimate since, if you started again and took a different sample of the same size, the estimate of ˆσ would almost certainly be different. It therefore seems reasonable that to allow for this extra uncertainty, the interval should be widened by increasing the figure of 1.96 which came from tables of the normal distribution. How much you need to increase it by has fortunately been calculated for you and is tabulated in tables of the t distribution. The required value of t will, however, depend on the sample size. If you had a random sample of 1000 bus journeys then you would be able to make a very accurate estimate of the population standard deviation and there would be no real need to change the figure However, in this case with a sample of 6 there is quite a lot of uncertainty in the estimate and you may need to make quite a large change. If the sample had been of size 2 there would have been a huge amount of uncertainty and a very large change may have been necessary. The amount of uncertainty is measured by the degrees of freedom. Degrees of freedom are a concept related to the mathematical definition of the chi-squared distribution. For your purposes they may be thought of as a measure of the number of pieces of information you have to estimate the standard deviation. It is impossible to use a sample of size one to make an estimate of the standard deviation. The standard deviation is a measure of spread and one item can tell nothing about how spread out a distribution is. A sample of size two gives one piece of information about the spread and a sample of size three gives two pieces. In general, a sample of size n gives n 1 pieces of information and an estimate made from such a sample is said to be based on n 1 degrees of freedom. In the example there were 6 bus journey times and so the estimate of 5.09 for the standard deviation is based on 5 degrees of freedom. To find a 95% confidence interval you therefore require the upper and lower tails of the t distribution with 5 degrees of freedom, denoted t 5. As with the standard normal distribution, the t distribution is symmetrical about zero and the required values are ± t 5 23

4 A 95% confidence interval for the mean, making no assumption about the standard deviation, is given by 25.5 ± i.e ± 5.34 or ( 20.2, 30.8) 6 Note: For the use of the t distribution to be valid the data must be normally distributed. However, small deviations from normality will not seriously affect the results. In general, the formula is x ± t n 1 ˆσ n Example The resistances (in ohms) of a random sample from a batch of resistors were Assuming that the sample is from a normal distribution calculate (i) a 95% confidence interval for the mean, (ii) a 90% confidence interval for the mean. Solution The data gives x = and ˆσ = (i) t 6, = 2.447, so the 95% confidence interval for the mean is given by ± t ± 43.8 (ii) ( 2330, 2417 ). t 6, 0.05 =1.943, giving 90% confidence interval for the mean as ± t ± 34.8 ( 2339, 2408 ). 24

5 Activity 1 The following data are random observations from a normal distribution with mean Starting at any point in the table take a sample of size 3 and calculate ˆσ. Calculate an 80% confidence interval for the mean using the t-distribution. Now take the next sample of 3 and repeat the calculation. Carry on until you have calculated at least 20, and preferably more, intervals. If possible work with a group so that the labour of calculation may be divided up between you. What proportion of your intervals contain 10, the population mean? Is this approximately the proportion you would have expected? 25

6 Recalculate the intervals using z-values, i.e. calculate, for each sample x ±1.282 ˆσ3. What proportion of these intervals contain 10? Which set of intervals gave results more in line with your expectations? Exercise 2A 1. Samples of a high temperature lubricant were tested and the temperature ( C) at which they ceased to be effective were as follows: Calculate a 95% confidence for the mean. 2. In a study aimed at improving the design of bus cabs the functional arm reach of a random sample of bus drivers was measured. The results, in mm, were as follows: 701, 642, 651, 700, 672, 674, 656, 649 Calculate a 95% confidence interval for the mean. 3. As part of a research study on pattern recognition a random sample of students on a design course were asked to examine a picture and see if they could recognise a word. The picture contained the word 'technology' written backwards. The times, in seconds, taken to recognise the word were as follows: 55, 28, 79, 54, 87, 61, 62, 68, 38 Calculate (a) a 95% confidence interval for the mean, (b) a 99% confidence interval for the mean. 2.2 Confidence interval for the standard deviation and the variance In Section 2.1 the standard deviation of a population of bus journey times was estimated from a sample of bus journey times. As was stated, the estimate is subject to uncertainty as, if another sample of the same size were taken, a different estimate would almost certainly result. Provided the data comes from a normal distribution it is possible to estimate the population standard deviation with a confidence interval rather than using a single figure (or point estimate). To do this you need to use the fact that for a sample of size n from a normal distribution is distributed as ( x x) 2 σ 2 2 χ n 1 (chi-squared with n 1 degrees of freedom). 26

7 ( x x) 2 is equal to ( n 1)ˆσ 2 and this is usually the easiest way of calculating it. In the case of the bus journey times a sample of size 6 gave ˆσ 2 = = If you find the upper and lower tails of χ 2, there will be ( ) 2 x x a probability of 0.95 that will lie between them. σ A 95% confidence interval for the variance is defined by < σ 2 < < 1 σ 2 < < σ 2 < and this is a 95% confidence interval for the variance. Although the variance has a very important role to play in the mathematical theory of statistics, for practical applications the standard deviation is a much more useful statistic. A 95% confidence interval for the standard deviation is found simply by taking the square root of the two limits, i.e. 3.2 < σ < 12.5 Note that the interval is not symmetrical about the point estimate which was Clearly the interval would not make sense unless it contained the point estimate and this is a useful check on the calculation. However, as the standard deviation cannot be negative but has no upper limit, it would not be expected that the point would lie exactly in the middle. Is it possible to calculate 100% confidence interval for the standard deviation? Example In processing grain in the brewing industry, the percentage extract recovered is measured. A particular brewery introduces a new source of grain and the percentage extract on eleven separate days is as follows: 95.2, 93.1, 93.5, 95.9, 94.0, 92.0, 94.4, 93.2, 95.5, 92.3,

8 (a) Regarding the sample as a random sample from a normal population, calculate (i) (ii) a 90% confidence interval for the population variance, a 90% confidence interval for the population mean. (b) The previous source of grain gave daily percentage extract figures which were normally distributed with mean 94.2 and standard deviation 2.5. A high percentage extract is desirable but the brewery manager also requires as little day to day variation as possible. Without further calculation, compare the two sources of grain. (AEB) Solution For this data n = 11 x = ˆσ = (a) (i) 90% confidence interval for variance is given by 3.94 < σ 2 < < 1 σ 2 < < σ 2 < (ii) 90% confidence interval for mean is given by ± ± ( 93.31, ) t 10 (b) The mean of the previous source of grain was This lies in the middle of the confidence interval calculated for the mean of the new source of grain. There is therefore no evidence that the means differ. The standard deviation of the previous source of grain was 2.5 and hence the variance was = This is above the upper limit of the confidence interval for the variance of the new source of grain. This suggests that the new source gives less variability. Combining these two conclusions suggests that the new source is preferable to the previous source. 28

9 Activity 2 Take a sample of size 6 from the data in Activity 1. Calculate an 80% confidence interval for the standard deviation. Take further samples of size 6 and repeat the calculation at least 20 times. Work in a group, if possible, so that the calculations may be shared. The data in Activity 1 came from a normal distribution with standard deviation 2. What proportion of your intervals contain the value 2? Is this proportion similar to the proportion you would expect? Exercise 2B 1. Using the data in Questions 1, 2 and 3 of Exercise 2A, calculate 95% confidence intervals for the population standard deviations. 2. The external diameter, in cm, of a random sample of piston rings produced on a particular machine were 9.91, 9.89, 10.12, 9.98, 10.09, 9.81, 10.01, 9.99, 9.86 Calculate a 95% confidence interval for the standard deviation. Assume normal distribution. Do your results support the manufacturer's claim that the standard deviation is 0.06 cm? 3. The vitamin C content of a random sample of 5 lemons was measured. The results in 'mg per 10 g' were 1.04, 0.95, 0.63, 1.62, 1.11 Assuming a normal distribution calculate a 95% confidence interval for the standard deviation. A greengrocer claimed that the method of determining the vitamin C content was extremely unreliable and that the observed variability was more due to errors in the determination rather than to actual differences between lemons. To check this 7 independent determinations were made of the vitamin C content of the same lemon. The results were as follows 1.21, 1.22, 1.21, 1.23, 1.24, 1.23, 1.22 Assuming a normal distribution, calculate a 90% confidence interval for the standard deviation of the determinations. Does your result support the greengrocer's claim? 2.3 Confidence interval for proportions An insurance company receives a large number of claims for storm damage. A manager wished to estimate the proportion of these claims which were for less than 500. He examined a random sample of 120 claims and found that 18 of them were for less than 500. Obviously the best estimate of the proportion of all claims which are for less than 500 is =

10 To calculate a confidence interval for the proportion we must observe that, r, the number of claims for less than 500 in a sample of size n, will be an observation from a binomial distribution. This will have mean np and variance np ( 1 p ). From Statistics you know that a binomial distribution can be approximated by a normal distribution if n is large and np is reasonably large. In this case n = 120 and p is estimated by Hence r can be approximated by a normal distribution with variance = 15.3 and standard deviation 15.3 = The proportion of claims for less than 500 is estimated by r n. This has variance np( 1 p) n 2 = p( 1 p) n. In this case the proportion can be estimated by a normal distribution with variance = and standard deviation Hence an approximate 95% confidence interval for the proportion is given by 0.15 ± ± ( 0.086, ). In general, the formula for an approximate confidence interval for a proportion is ˆp ± z ˆp ( 1 ˆp ) n where ˆp is an estimate of p and z is the appropriate value from normal tables to give the required percentage confidence interval. Note this formula is valid only if the parameters of the binomial distribution make it reasonable to use the normal approximation. 30

11 Example Employees of a firm carrying out motorway maintenance are issued with brightly coloured waterproof jackets. These come in five different sizes numbered 1 to 5. The last 40 jackets issued were of the following sizes (a) (i) Find the proportion in the sample requiring size 3. Assuming the 40 employees can be regarded as a random sample of all employees, calculate an approximate 95% confidence interval for the proportion, p, of all employees requiring size 3. (ii) Give two reasons why the confidence interval calculated in (i) is approximate rather than exact. (b) Your estimate of p is ˆp. (i) (ii) What percentage is associated with the approximate confidence interval ˆp ± 0.1? How large a sample would be needed to obtain an approximate 95% confidence interval of the form ˆp ± 0.1? (AEB) Solution (a) (i) There are 15 out of 40 requiring size 3, a proportion of 15 = An approximate 95% confidence interval 40 is given by ± ± ( 0.225, ) (ii) The confidence interval is approximate because an estimate of p is used (the true value is unknown) and because the normal distribution is used as an approximation to the binomial distribution. (b) (i) Confidence interval is given by ˆp ± z ˆp ( 1 ˆp ). n Hence if interval is ˆp ± 0.1, 0.1 = z

12 i.e. z = 1.306; this is ( ) 100 = 81 per cent confidence interval (ii) 95% confidence interval ˆp ± n If this is ˆp ± 0.1, = n n = n = Sample of size 90 needed. Exercise 2C 1. When a random sample of 80 climbing ropes were subjected to a strain equivalent to the weight of ten climbers, 12 of them broke. Calculate a 95% confidence interval for the proportion of ropes which would break under this strain. 2. Data from a completed questionnaire were entered into a computer as a series of binary digits (i.e. each digit was 0 or 1). A check on 1000 digits revealed errors in 19 of them. Assuming the probability of an error is the same for each digit entered, calculate a 90% confidence interval for the proportion of digits where an error will be made. 3. A large civil engineering firm issues every new employee with a safety helmet. Five different sizes are available numbered 1 to 5. A random sample of 90 employees required the following sizes Confidence interval for the mean of a Poisson distribution In an area of moorland, plants of a certain variety are known to be distributed at random at a constant average rate; that is, they are distributed according to the Poisson distribution. A biologist counts the number of plants in a randomly chosen square of area 10 m 2 and finds 142 plants. An approximate confidence interval for the average number of plants in a square Calculate an approximate 90% confidence interval for the proportion of employees requiring size 2. (AEB) 32

13 of area 10 m 2 can be found by approximating the Poisson distribution by a normal distribution with mean 142 and standard deviation 142. This approximation is valid since the mean is large. In this case a 95% approximate confidence interval is given by 142 ± ± 23.4 ( 118.6, ). In general the formula is m ± z m where m is the observed value and z is the appropriate value from normal tables to give the required percentage confidence interval. Remember this is only valid if the mean is reasonably large so that the normal distribution gives a good approximation. If a confidence interval for the mean number of plants per m 2 was required, the interval calculated for 10 m 2 would be divided by 10. The result would be 14.2 ± 2.34 or ( 11.86, 16.54). The calculation could have been carried out by finding the mean number of plants observed in 10 areas of 1m 2 and basing the calculation on this. However, there would be no advantage in this as it would give the identical answer. For example in this case, if 10 separate m 2 had been observed, the mean number of plants in each square would be 14.2 and the distribution would be approximated by normal mean 14.2 standard deviation Since you now have the mean of 10 observations the calculation for a 95% interval would be 14.2 ± ± 2.34, as before. It is probably always easiest to work in terms of the total number of events observed and then scale the final answer as required. Why is a confidence interval calculated for the mean of a Poisson distribution only an approximation? 33

14 Exercise 2D 1. Cars pass a point on a motorway during the morning rush hour at random at a constant average rate. An observer counts 212 cars passing during a 5 minute interval. Calculate (a) a 95% confidence interval for the mean number of cars passing in a 5 minute interval, (b) a 95% confidence interval for the mean number of cars passing in a one minute interval, (c) a 90% confidence interval for the mean number of cars passing in a two minute interval, (d) a 99% confidence interval for the mean number of cars passing in an hour. 2. The number of times a machine needs resetting on a night shift follows a Poisson distribution. On three randomly selected nights it was reset 9, 5 and 11 times. Calculate a 95% confidence interval for the average number of times it needs resetting per night. 3. The number of a certain type of organism suspended in a liquid follows a Poisson distribution. 10 cc of the liquid are found to contain 35 of the organisms. Calculate (a) a 90% confidence interval for the mean number of organisms per 10 cc, (b) a 95% confidence interval for the mean number of organisms per cc, (c) a 99% confidence interval for the mean number of organisms per 100 cc. A further 10 cc of the liquid were examined and found to contain 26 of the organisms. Modify your answers to (a), (b) and (c) to take account of this additional data. 2.5 Miscellaneous Exercises 1. The development engineer of a company making razors records the time it takes him to shave, on seven mornings, using a standard razor made by the company. The times, in seconds, were 217, 210, 254, 237, 232, 228, 243 Assuming that this may be regarded as a random sample from a normal distribution, calculate a 95% confidence interval for the mean. (AEB) 2. A car insurance company found that the average amount it was paying on bodywork claims was 435 with a standard deviation of 141. The next eight bodywork claims were subjected to extra investigation before payment was agreed. The payments, in pounds, on these claims were 48, 109, 237, 192, 403, 98, 264, 68. (a) Assuming the data can be regarded as a random sample from a normal distribution, calculate a 90% confidence interval for (i) the mean payment after extra investigation, (ii) the standard deviation of the payments after extra investigation. (b) Explain to the manager whether or not your results suggest that the distribution of payments has changed after special investigation and comment on her suggestion that in future all claims over 900 should be subject to special investigation. (AEB) 3. A car manufacturer purchases large quantities of a particular component. Tests have shown that 3% fail to function and that, of those that do function, the mean working life is 2400 hourswith a standard deviation of 650 hours. The manufacturer is particularly concerned with the large variability and the supplier undertakes to improve the design so that the standard deviation is reduced to 300 hours. (a) A random sample of 310 of the new components contained 12 which failed to function. Calculate an approximate 95% confidence interval for the proportion which fail to function. (b) A random sample of 3 new components tested had working lives of 2730, 3120 and 2300 hours. 34

### 4. Continuous Random Variables, the Pareto and Normal Distributions

4. Continuous Random Variables, the Pareto and Normal Distributions A continuous random variable X can take any value in a given range (e.g. height, weight, age). The distribution of a continuous random

### Module 2 Probability and Statistics

Module 2 Probability and Statistics BASIC CONCEPTS Multiple Choice Identify the choice that best completes the statement or answers the question. 1. The standard deviation of a standard normal distribution

### Exploratory Data Analysis

Exploratory Data Analysis Johannes Schauer johannes.schauer@tugraz.at Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Introduction

### 39.2. The Normal Approximation to the Binomial Distribution. Introduction. Prerequisites. Learning Outcomes

The Normal Approximation to the Binomial Distribution 39.2 Introduction We have already seen that the Poisson distribution can be used to approximate the binomial distribution for large values of n and

### The Standard Normal distribution

The Standard Normal distribution 21.2 Introduction Mass-produced items should conform to a specification. Usually, a mean is aimed for but due to random errors in the production process we set a tolerance

### 5.1 Identifying the Target Parameter

University of California, Davis Department of Statistics Summer Session II Statistics 13 August 20, 2012 Date of latest update: August 20 Lecture 5: Estimation with Confidence intervals 5.1 Identifying

### Lecture 8. Confidence intervals and the central limit theorem

Lecture 8. Confidence intervals and the central limit theorem Mathematical Statistics and Discrete Mathematics November 25th, 2015 1 / 15 Central limit theorem Let X 1, X 2,... X n be a random sample of

### " Y. Notation and Equations for Regression Lecture 11/4. Notation:

Notation: Notation and Equations for Regression Lecture 11/4 m: The number of predictor variables in a regression Xi: One of multiple predictor variables. The subscript i represents any number from 1 through

### Simple Random Sampling

Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

### 3.4 Statistical inference for 2 populations based on two samples

3.4 Statistical inference for 2 populations based on two samples Tests for a difference between two population means The first sample will be denoted as X 1, X 2,..., X m. The second sample will be denoted

### 1. How different is the t distribution from the normal?

Statistics 101 106 Lecture 7 (20 October 98) c David Pollard Page 1 Read M&M 7.1 and 7.2, ignoring starred parts. Reread M&M 3.2. The effects of estimated variances on normal approximations. t-distributions.

### Confidence Intervals for Cp

Chapter 296 Confidence Intervals for Cp Introduction This routine calculates the sample size needed to obtain a specified width of a Cp confidence interval at a stated confidence level. Cp is a process

### Binomial random variables

Binomial and Poisson Random Variables Solutions STAT-UB.0103 Statistics for Business Control and Regression Models Binomial random variables 1. A certain coin has a 5% of landing heads, and a 75% chance

### Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY

Biostatistics: DESCRIPTIVE STATISTICS: 2, VARIABILITY 1. Introduction Besides arriving at an appropriate expression of an average or consensus value for observations of a population, it is important to

### Point and Interval Estimates

Point and Interval Estimates Suppose we want to estimate a parameter, such as p or µ, based on a finite sample of data. There are two main methods: 1. Point estimate: Summarize the sample by a single number

### Confidence Intervals for the Difference Between Two Means

Chapter 47 Confidence Intervals for the Difference Between Two Means Introduction This procedure calculates the sample size necessary to achieve a specified distance from the difference in sample means

### Week 4: Standard Error and Confidence Intervals

Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

### HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

### 6.4 Normal Distribution

Contents 6.4 Normal Distribution....................... 381 6.4.1 Characteristics of the Normal Distribution....... 381 6.4.2 The Standardized Normal Distribution......... 385 6.4.3 Meaning of Areas under

### Tests of Hypotheses Using Statistics

Tests of Hypotheses Using Statistics Adam Massey and Steven J. Miller Mathematics Department Brown University Providence, RI 0292 Abstract We present the various methods of hypothesis testing that one

### 39.2. The Normal Approximation to the Binomial Distribution. Introduction. Prerequisites. Learning Outcomes

The Normal Approximation to the Binomial Distribution 39.2 Introduction We have already seen that the Poisson distribution can be used to approximate the binomial distribution for large values of n and

### HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1 PREVIOUSLY used confidence intervals to answer questions such as... You know that 0.25% of women have red/green color blindness. You conduct a study of men

### Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

### Review #2. Statistics

Review #2 Statistics Find the mean of the given probability distribution. 1) x P(x) 0 0.19 1 0.37 2 0.16 3 0.26 4 0.02 A) 1.64 B) 1.45 C) 1.55 D) 1.74 2) The number of golf balls ordered by customers of

### 7 CONTINUOUS PROBABILITY DISTRIBUTIONS

7 CONTINUOUS PROBABILITY DISTRIBUTIONS Chapter 7 Continuous Probability Distributions Objectives After studying this chapter you should understand the use of continuous probability distributions and the

### Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011

Chicago Booth BUSINESS STATISTICS 41000 Final Exam Fall 2011 Name: Section: I pledge my honor that I have not violated the Honor Code Signature: This exam has 34 pages. You have 3 hours to complete this

### Standard Deviation Estimator

CSS.com Chapter 905 Standard Deviation Estimator Introduction Even though it is not of primary interest, an estimate of the standard deviation (SD) is needed when calculating the power or sample size of

### Estimation and Confidence Intervals

Estimation and Confidence Intervals Fall 2001 Professor Paul Glasserman B6014: Managerial Statistics 403 Uris Hall Properties of Point Estimates 1 We have already encountered two point estimators: th e

### Name: Date: Use the following to answer questions 3-4:

Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

### 12.5: CHI-SQUARE GOODNESS OF FIT TESTS

125: Chi-Square Goodness of Fit Tests CD12-1 125: CHI-SQUARE GOODNESS OF FIT TESTS In this section, the χ 2 distribution is used for testing the goodness of fit of a set of data to a specific probability

### Opgaven Onderzoeksmethoden, Onderdeel Statistiek

Opgaven Onderzoeksmethoden, Onderdeel Statistiek 1. What is the measurement scale of the following variables? a Shoe size b Religion c Car brand d Score in a tennis game e Number of work hours per week

### CHI-SQUARE: TESTING FOR GOODNESS OF FIT

CHI-SQUARE: TESTING FOR GOODNESS OF FIT In the previous chapter we discussed procedures for fitting a hypothesized function to a set of experimental data points. Such procedures involve minimizing a quantity

### MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. A) ±1.88 B) ±1.645 C) ±1.96 D) ±2.

Ch. 6 Confidence Intervals 6.1 Confidence Intervals for the Mean (Large Samples) 1 Find a Critical Value 1) Find the critical value zc that corresponds to a 94% confidence level. A) ±1.88 B) ±1.645 C)

### 99.37, 99.38, 99.38, 99.39, 99.39, 99.39, 99.39, 99.40, 99.41, 99.42 cm

Error Analysis and the Gaussian Distribution In experimental science theory lives or dies based on the results of experimental evidence and thus the analysis of this evidence is a critical part of the

### 6 POISSON DISTRIBUTIONS

6 POISSON DISTRIBUTIONS Chapter 6 Poisson Distributions Objectives After studying this chapter you should be able to recognise when to use the Poisson distribution; be able to apply the Poisson distribution

### Statistical Process Control (SPC) Training Guide

Statistical Process Control (SPC) Training Guide Rev X05, 09/2013 What is data? Data is factual information (as measurements or statistics) used as a basic for reasoning, discussion or calculation. (Merriam-Webster

### CHAPTER 13 SIMPLE LINEAR REGRESSION. Opening Example. Simple Regression. Linear Regression

Opening Example CHAPTER 13 SIMPLE LINEAR REGREION SIMPLE LINEAR REGREION! Simple Regression! Linear Regression Simple Regression Definition A regression model is a mathematical equation that descries the

### Binomial random variables (Review)

Poisson / Empirical Rule Approximations / Hypergeometric Solutions STAT-UB.3 Statistics for Business Control and Regression Models Binomial random variables (Review. Suppose that you are rolling a die

### Confidence Intervals for One Standard Deviation Using Standard Deviation

Chapter 640 Confidence Intervals for One Standard Deviation Using Standard Deviation Introduction This routine calculates the sample size necessary to achieve a specified interval width or distance from

### Practice Midterm Exam #2

The Islamic University of Gaza Faculty of Engineering Department of Civil Engineering 12/12/2009 Statistics and Probability for Engineering Applications 9.2 X is a binomial random variable, show that (

### 1 Simple Linear Regression I Least Squares Estimation

Simple Linear Regression I Least Squares Estimation Textbook Sections: 8. 8.3 Previously, we have worked with a random variable x that comes from a population that is normally distributed with mean µ and

### Binomial Sampling and the Binomial Distribution

Binomial Sampling and the Binomial Distribution Characterized by two mutually exclusive events." Examples: GENERAL: {success or failure} {on or off} {head or tail} {zero or one} BIOLOGY: {dead or alive}

### Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Parkland College A with Honors Projects Honors Program 2014 Calculating P-Values Isela Guerra Parkland College Recommended Citation Guerra, Isela, "Calculating P-Values" (2014). A with Honors Projects.

### CHAPTER 13. Experimental Design and Analysis of Variance

CHAPTER 13 Experimental Design and Analysis of Variance CONTENTS STATISTICS IN PRACTICE: BURKE MARKETING SERVICES, INC. 13.1 AN INTRODUCTION TO EXPERIMENTAL DESIGN AND ANALYSIS OF VARIANCE Data Collection

### Northumberland Knowledge

Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

### Social Studies 201 Notes for November 19, 2003

1 Social Studies 201 Notes for November 19, 2003 Determining sample size for estimation of a population proportion Section 8.6.2, p. 541. As indicated in the notes for November 17, when sample size is

### Chi Square Tests. Chapter 10. 10.1 Introduction

Contents 10 Chi Square Tests 703 10.1 Introduction............................ 703 10.2 The Chi Square Distribution.................. 704 10.3 Goodness of Fit Test....................... 709 10.4 Chi Square

### Math 251, Review Questions for Test 3 Rough Answers

Math 251, Review Questions for Test 3 Rough Answers 1. (Review of some terminology from Section 7.1) In a state with 459,341 voters, a poll of 2300 voters finds that 45 percent support the Republican candidate,

### MINITAB ASSISTANT WHITE PAPER

MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

### Chapter 5: Normal Probability Distributions - Solutions

Chapter 5: Normal Probability Distributions - Solutions Note: All areas and z-scores are approximate. Your answers may vary slightly. 5.2 Normal Distributions: Finding Probabilities If you are given that

### Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Practice problems for Homework 1 - confidence intervals and hypothesis testing. Read sections 10..3 and 10.3 of the text. Solve the practice problems below. Open the Homework Assignment 1 and solve the

### MAT 155. Key Concept. September 27, 2010. 155S5.5_3 Poisson Probability Distributions. Chapter 5 Probability Distributions

MAT 155 Dr. Claude Moore Cape Fear Community College Chapter 5 Probability Distributions 5 1 Review and Preview 5 2 Random Variables 5 3 Binomial Probability Distributions 5 4 Mean, Variance and Standard

### Means, standard deviations and. and standard errors

CHAPTER 4 Means, standard deviations and standard errors 4.1 Introduction Change of units 4.2 Mean, median and mode Coefficient of variation 4.3 Measures of variation 4.4 Calculating the mean and standard

### Regression Analysis: A Complete Example

Regression Analysis: A Complete Example This section works out an example that includes all the topics we have discussed so far in this chapter. A complete example of regression analysis. PhotoDisc, Inc./Getty

### Chapter 2. Hypothesis testing in one population

Chapter 2. Hypothesis testing in one population Contents Introduction, the null and alternative hypotheses Hypothesis testing process Type I and Type II errors, power Test statistic, level of significance

### Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur Module No. #01 Lecture No. #15 Special Distributions-VI Today, I am going to introduce

### 1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

1 Final Review 2 Review 2.1 CI 1-propZint Scenario 1 A TV manufacturer claims in its warranty brochure that in the past not more than 10 percent of its TV sets needed any repair during the first two years

### Math 108 Exam 3 Solutions Spring 00

Math 108 Exam 3 Solutions Spring 00 1. An ecologist studying acid rain takes measurements of the ph in 12 randomly selected Adirondack lakes. The results are as follows: 3.0 6.5 5.0 4.2 5.5 4.7 3.4 6.8

### Name: (b) Find the minimum sample size you should use in order for your estimate to be within 0.03 of p when the confidence level is 95%.

Chapter 7-8 Exam Name: Answer the questions in the spaces provided. If you run out of room, show your work on a separate paper clearly numbered and attached to this exam. Please indicate which program

### Lecture Notes Module 1

Lecture Notes Module 1 Study Populations A study population is a clearly defined collection of people, animals, plants, or objects. In psychological research, a study population usually consists of a specific

### Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

### Numerical Methods for Option Pricing

Chapter 9 Numerical Methods for Option Pricing Equation (8.26) provides a way to evaluate option prices. For some simple options, such as the European call and put options, one can integrate (8.26) directly

### CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Some Continuous Probability Distributions CHAPTER 6: Continuous Uniform Distribution: 6. Definition: The density function of the continuous random variable X on the interval [A, B] is B A A x B f(x; A,

### Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

Chapter 7 Notes - Inference for Single Samples You know already for a large sample, you can invoke the CLT so: X N(µ, ). Also for a large sample, you can replace an unknown σ by s. You know how to do a

### Exact Confidence Intervals

Math 541: Statistical Theory II Instructor: Songfeng Zheng Exact Confidence Intervals Confidence intervals provide an alternative to using an estimator ˆθ when we wish to estimate an unknown parameter

### A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

CHAPTER 5. A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 5.1 Concepts When a number of animals or plots are exposed to a certain treatment, we usually estimate the effect of the treatment

### Permutation Tests for Comparing Two Populations

Permutation Tests for Comparing Two Populations Ferry Butar Butar, Ph.D. Jae-Wan Park Abstract Permutation tests for comparing two populations could be widely used in practice because of flexibility of

### Simple Regression Theory II 2010 Samuel L. Baker

SIMPLE REGRESSION THEORY II 1 Simple Regression Theory II 2010 Samuel L. Baker Assessing how good the regression equation is likely to be Assignment 1A gets into drawing inferences about how close the

### Statistics 104: Section 6!

Page 1 Statistics 104: Section 6! TF: Deirdre (say: Dear-dra) Bloome Email: dbloome@fas.harvard.edu Section Times Thursday 2pm-3pm in SC 109, Thursday 5pm-6pm in SC 705 Office Hours: Thursday 6pm-7pm SC

### Probability Calculator

Chapter 95 Introduction Most statisticians have a set of probability tables that they refer to in doing their statistical wor. This procedure provides you with a set of electronic statistical tables that

### CALCULATIONS & STATISTICS

CALCULATIONS & STATISTICS CALCULATION OF SCORES Conversion of 1-5 scale to 0-100 scores When you look at your report, you will notice that the scores are reported on a 0-100 scale, even though respondents

### Chapter 4 Lecture Notes

Chapter 4 Lecture Notes Random Variables October 27, 2015 1 Section 4.1 Random Variables A random variable is typically a real-valued function defined on the sample space of some experiment. For instance,

### Annex 6 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION. (Version 01.

Page 1 BEST PRACTICE EXAMPLES FOCUSING ON SAMPLE SIZE AND RELIABILITY CALCULATIONS AND SAMPLING FOR VALIDATION/VERIFICATION (Version 01.1) I. Introduction 1. The clean development mechanism (CDM) Executive

### MATH4427 Notebook 2 Spring 2016. 2 MATH4427 Notebook 2 3. 2.1 Definitions and Examples... 3. 2.2 Performance Measures for Estimators...

MATH4427 Notebook 2 Spring 2016 prepared by Professor Jenny Baglivo c Copyright 2009-2016 by Jenny A. Baglivo. All Rights Reserved. Contents 2 MATH4427 Notebook 2 3 2.1 Definitions and Examples...................................

### Chapter 7 - Practice Problems 1

Chapter 7 - Practice Problems 1 SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Provide an appropriate response. 1) Define a point estimate. What is the

### WHERE DOES THE 10% CONDITION COME FROM?

1 WHERE DOES THE 10% CONDITION COME FROM? The text has mentioned The 10% Condition (at least) twice so far: p. 407 Bernoulli trials must be independent. If that assumption is violated, it is still okay

### SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions

SOLUTIONS: 4.1 Probability Distributions and 4.2 Binomial Distributions 1. The following table contains a probability distribution for a random variable X. a. Find the expected value (mean) of X. x 1 2

### Chapter 4. iclicker Question 4.4 Pre-lecture. Part 2. Binomial Distribution. J.C. Wang. iclicker Question 4.4 Pre-lecture

Chapter 4 Part 2. Binomial Distribution J.C. Wang iclicker Question 4.4 Pre-lecture iclicker Question 4.4 Pre-lecture Outline Computing Binomial Probabilities Properties of a Binomial Distribution Computing

Name: University of Chicago Graduate School of Business Business 41000: Business Statistics Special Notes: 1. This is a closed-book exam. You may use an 8 11 piece of paper for the formulas. 2. Throughout

### MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

STT315 Practice Ch 5-7 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Solve the problem. 1) The length of time a traffic signal stays green (nicknamed

### General and statistical principles for certification of RM ISO Guide 35 and Guide 34

General and statistical principles for certification of RM ISO Guide 35 and Guide 34 / REDELAC International Seminar on RM / PT 17 November 2010 Dan Tholen,, M.S. Topics Role of reference materials in

### Lean Six Sigma Analyze Phase Introduction. TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY

TECH 50800 QUALITY and PRODUCTIVITY in INDUSTRY and TECHNOLOGY Before we begin: Turn on the sound on your computer. There is audio to accompany this presentation. Audio will accompany most of the online

### AP Statistics 2002 Scoring Guidelines

AP Statistics 2002 Scoring Guidelines The materials included in these files are intended for use by AP teachers for course and exam preparation in the classroom; permission for any other use must be sought

### THE PROCESS CAPABILITY ANALYSIS - A TOOL FOR PROCESS PERFORMANCE MEASURES AND METRICS - A CASE STUDY

International Journal for Quality Research 8(3) 399-416 ISSN 1800-6450 Yerriswamy Wooluru 1 Swamy D.R. P. Nagesh THE PROCESS CAPABILITY ANALYSIS - A TOOL FOR PROCESS PERFORMANCE MEASURES AND METRICS -

### Chapter 4. Probability Distributions

Chapter 4 Probability Distributions Lesson 4-1/4-2 Random Variable Probability Distributions This chapter will deal the construction of probability distribution. By combining the methods of descriptive

### The Binomial Probability Distribution

The Binomial Probability Distribution MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2015 Objectives After this lesson we will be able to: determine whether a probability

### Introduction to Analysis of Variance (ANOVA) Limitations of the t-test

Introduction to Analysis of Variance (ANOVA) The Structural Model, The Summary Table, and the One- Way ANOVA Limitations of the t-test Although the t-test is commonly used, it has limitations Can only

### 2 Binomial, Poisson, Normal Distribution

2 Binomial, Poisson, Normal Distribution Binomial Distribution ): We are interested in the number of times an event A occurs in n independent trials. In each trial the event A has the same probability

### Engineering Problem Solving and Excel. EGN 1006 Introduction to Engineering

Engineering Problem Solving and Excel EGN 1006 Introduction to Engineering Mathematical Solution Procedures Commonly Used in Engineering Analysis Data Analysis Techniques (Statistics) Curve Fitting techniques

### Using simulation to calculate the NPV of a project

Using simulation to calculate the NPV of a project Marius Holtan Onward Inc. 5/31/2002 Monte Carlo simulation is fast becoming the technology of choice for evaluating and analyzing assets, be it pure financial

### Recall this chart that showed how most of our course would be organized:

Chapter 4 One-Way ANOVA Recall this chart that showed how most of our course would be organized: Explanatory Variable(s) Response Variable Methods Categorical Categorical Contingency Tables Categorical

### Sampling Strategies for Error Rate Estimation and Quality Control

Project Number: JPA0703 Sampling Strategies for Error Rate Estimation and Quality Control A Major Qualifying Project Report Submitted to the faculty of the Worcester Polytechnic Institute in partial fulfillment

### Practice Problems and Exams

Practice Problems and Exams 1 The Islamic University of Gaza Faculty of Commerce Department of Economics and Political Sciences An Introduction to Statistics Course (ECOE 1302) Spring Semester 2009-2010

### Statistical estimation using confidence intervals

0894PP_ch06 15/3/02 11:02 am Page 135 6 Statistical estimation using confidence intervals In Chapter 2, the concept of the central nature and variability of data and the methods by which these two phenomena

### An Introduction to Basic Statistics and Probability

An Introduction to Basic Statistics and Probability Shenek Heyward NCSU An Introduction to Basic Statistics and Probability p. 1/4 Outline Basic probability concepts Conditional probability Discrete Random

### Chapter 7. One-way ANOVA

Chapter 7 One-way ANOVA One-way ANOVA examines equality of population means for a quantitative outcome and a single categorical explanatory variable with any number of levels. The t-test of Chapter 6 looks

### Introduction to Hypothesis Testing

I. Terms, Concepts. Introduction to Hypothesis Testing A. In general, we do not know the true value of population parameters - they must be estimated. However, we do have hypotheses about what the true

### AP Physics 1 and 2 Lab Investigations

AP Physics 1 and 2 Lab Investigations Student Guide to Data Analysis New York, NY. College Board, Advanced Placement, Advanced Placement Program, AP, AP Central, and the acorn logo are registered trademarks