Continuous Probability Distributions (120 Points)

Economics : Statistics for Economics Problem Set 5 - Suggested Solutions University of Notre Dame Instructor: Julio Garín Spring 22 Continuous Probability Distributions 2 Points. A management consulting firm conducted a study of service time at the drive-up window at McDonalds. They found that the average time between placing an order and receiving the order was 2.7 minutes. Note that X expµ, µ 2 so that fx µ e x µ with µ 2.7. a What is the variance of the waiting time? V arx σ 2 µ 2 2.7 2 7.7 b What is the probability that a customer s waiting time is more than 5 minutes? P x 5 P x 5 F 5 5 e 5 2.7.655 2.7 e x 2.7 dx c What is the probability that a customer s waiting time is less than 2.7 minutes? Another way is to use the definition of CDF for an exponential distribution that is given by F x e x µ : P x 2.7 F 2.7 e 2.7 2.7 e.62 2. Eric s weekly expenditures at North Dining Hall are normally distributed with a mean of $ and a standard deviation of $. Eric wants to budget an amount each week he can spend at the cafeteria. a How much should Eric budget if he wants to spend less than the budgeted amount 9 percent of the time? Let X be the amount spent by Eric, X N,. P x budget.9 P z budget µ σ From the table, Φ.25.9 budget µ σ.25, which solving for budget gives us 92.5. The latter is the value such that 9% of the time Eric will spend less than that.

. Let z N,. Calculate, a P z. P z P x P x Φ Φ.5.57.4 b P 2.5 z. c P.75 z.4. P 2.5 z P x P x 2.5 Φ Φ 2.5.5.62.494 P.75 z.4 P x.4 P x.75 d Find z such that P z z.695 Φ.4 Φ.75.492.4.9 P z z P z z Φz.695 Φz.5 From the table: Φ.5.5 z.5. 4. Your classmates Luke and Shawn are decent golfers but they are always bragging about their ability to hit the monster drive. Just last week, Shawn claimed that his drives routinely go 295 yards. The average drive for a professional player on the PGA tour travels 272 yards with a standard deviation of yards. We have that X N 272, 64 a If the distance of a professional drive is normally distributed, what fraction of drives exceed 295 yards? What can you said about Luke and Shawn s statements? P x 295 P x 295 295 272 P z Φ2..2 If Shawn s statement is true, he would be in the top.2% of PGA golfers in terms of average drive. Shawn and Luke are probably exaggerating about their skills. 2

b Again, assuming normality, how long would a drive have to be in the top 5 percent of drives hit on the professional tour? We want to find the value α such that P x α.5. P x α P x α.5 P z α 272 Therefore, P z α 272.95 α 272 Φ From the table, Φ.64.95 α 272.64 α 25.2. 5. Delta airlines quotes a flight time of 2 hours, 5 minutes for its flights from Cincinnati to Tampa. Suppose we believe that actual flight times are uniformly distributed between 2 and 2. hours 2 hours 2 minutes. Let X be the time flight for this route. a Graph the PDF of X. If X is uniformly distributed, its PDF is given by fx. Hence, in this particular b a case, { fx 2 2 x 4 otherwise fx /2 2 4 x b What is the probability that the flight will be no more than 5 minutes late? P x 2 2 dx 2 2.5

or, using the definition of the Uniform CDF: F x x a b a 2 4 2.5 c What is the expected flight time and what is the standard deviation of flight times? Recall that the expected value is defined as: Ex µ xfxdx So we have, Ex 4 2 x x 2 d x 4 42 2 2 Another way will be to use the definition we saw in class, Similarly, for the variance Ex a + b 2 2 + 4 2 b a2 V arx 2 4 22 2. σ. 5.77 6. Most publications that describe colleges and universities report the 25th and 75th percentile of SAT scores for the entering students. Suppose that the SAT scores for entering students at ND are normally distributed with a mean of 44 and a standard deviation of 2. a What are the 25th and 75th percentile SAT scores? Let X be the SAT score, then X N 44, 2 2 P x Π 75.75 P z Π 75 44 2 Π From the table, Φ.67.75. Hence, 75 44 2.67 and solving for the percentile we have Π 75 52.4 52. Since the normal distribution is symmetric, the 25th and 75th percentiles have to be the same distance from the mean. Therefore, Π 25 44 52 44 6. 4

b The 25th and 75th percentile of SAT scores for incoming freshman at UVA are 22 and 44, respectively. If SAT scores for these students are normally distributed, what is the mean and standard deviation of SAT scores at UVA? We know that the 25th and 75th percentiles are the same distance from the mean, therefore their average should be equal to the expected value. That is, 22 + 44 µ UV A 2 Now we are ready to calculate the standard deviation. P x 44.75 P.75 z 44 σ UV A From the table, Φ.67.75, 44 σ UV A.67. Solving the latter for the standard deviation, we have that σ UV A 64.2. Therefore, if X is the r.v. that represents the SAT scores for incoming freshman students at UVA we can describe it as X N, 64.2 2. 7. The proportion of time Y that an industrial robot is in operation during a 4-hour week is a random variable with probability density function { 2y for y fy elsewhere a Find EY and VarY. EY yfy dy y2y dy 2 Similarly, 2 V ary EY 2 [EX] 2 2 2 y 2 y dy 2 4 2 2 2 2 4 4 9 5

b For the robot under study, the profit X for a week is given by X 2Y 6. Find EX and VarX. EX E2Y 6 E2Y E6 2EY E6 2 2 6 22 V arx V ar2y 6 V ar2y V ar6 2 2 V ary 4 2 9 c Find an interval in which the profit should lie for at least 75% of the weeks that the robot is in use. Here we can use Chebyshev s Theorem: P x µ kσ P µ kσ x µ + kσ k 2 And, here we would like an interval with 75% of probability of occurring, so that k 2.75 which implies that k 2. Now that we know the number of standard deviations, we can calculate the numeric bounds for the interval. P µ 2σ x µ + 2σ P 2.94 x 67.64.75 Therefore, X 2.94, 67.64 at least 75% of the weeks in which the robot is in use.. The SAT and ACT college entrance exams are taken by thousands of students each year. The mathematics portions of each of these exams produce scores that are approximately normally distributed. In recent years SAT mathematics exam scores have averaged 4 with standard deviation. The average and standard deviation for ACT mathematics scores are and 6, respectively. a An engineering school sets 55 as the minimum SAT math score for new students. What percentage of students will score below 55 in a typical year? 6

Let X be the SAT score. Hence, X N 4, P x 55 P z P Z.7 Φ.7.75 55 4 Therefore, 75.% of students will score below 55 in a typical year. b What score should the engineering school set as a comparable standard on the ACT math test? Let X be the ACT score so that X N, 6 We want to find α such that P x α.75 and We know that.75 z α 6 Φ.7 This implies that α 6.7 which, after solving for α, gives us α 22.2. 9. As a measure of intelligence, mice are timed when going through a maze to reach a reward of food. The time in seconds required for any mouse is a random variable Y with a density function given by fy { b y 2 y b elsewhere where b is the minimum possible time needed to traverse the maze a Show that fy has the properties of a denstiy function. i. fy y. Since b is time, it is always greater than, so b/y 2 >. ii. fy dy So, b dy b y2 y b Find F y. b F y y b y b ft dt b b t 2 dt b y t b b y + b b b y 7

c Find P Y > b + c for a positive constant c. P Y > b + c P Y b + c F b + c We have then that F y b/y, so F b + c b/b + c. Hence, P Y > b + c F b + c b b + c b b + c d If c and d are both positive constants such that d > c, find P Y > b + d Y > b + c. Here we should use the definition of conditional probability, i.e.. P A B P A B P B P [Y > b + d Y > b + c] P Y > b + d Y > b + c P Y > b + c P Y > b + d P Y > b + c Where the last step follows from the fact that d > c so that the only way of obtaining, simultaneously, Y > b + d and Y > b + c is if Y > b + d. P Y < b + d P Y > b + d Y > b + c P Y > b + c F Y < b + d F Y > b + c b b+d b b+c b + c b + d. Suppose that X has a density function { kx x x fy elsewhere a Find the value of k that makes fx a probability density function. kx x dx k x dx x 2 dx [ k 2 ] k 2 6 k For fx to be a proper PDF, it must be that k 6. fx dx. That is, /6k, so

b Find P.4 X. P.4 x 6x x 2 dx.4 [ 6 2.42 ].4 6..64 c Find P X <.4 X.. P X <.4 X. d Find the 95th percentile of X. P X.4 P X. Π95 P.4 X P X..64. 6x x 2 dx.64.96.9 P x Π 95.95 6x x 2 dx 6 2 Π 95 2 Π 95 Π 95 2 2Π 95.95 Therefore, we have to solve the equation of third degree Π 95 2 2Π 95.95 whose only value between [, ] is equal to.6465. Therefore, the 95th percentile is given by Π 95.6465 e Find a value of x so that P X < x.95. Since Y is continuous, P X < x P X x, so x.6465 from part d. f Compare the values obtained in part d e. Explain the relationship between these two values. The values are exactly the same because in a continuous distribution, the probability of obtaining a particular number is equal to zero the mass of the distribution at any given point is zero. 9

. Show that the maximum value of the normal density with parameters µ and σ is /σ 2π and occurs when x µ. Hint: You are just maximizing a function. fx σ x µ 2 2π e σ 2 To make the problem a little easier, first take the natural log of the function sometimes is easier to work with a monotonically increasing transformation of the original function: ln [fx] ln ln σ 2π σ 2π x µ2 σ 2 x2 2xµ + µ 2 σ 2 Now maximize the function by taking the first derivative w.r.t. x and setting this equal to zero: ln [fx] x 2x 2µ σ2 We know this will gives as a maximum since the second order condition is negative, i.e. 2 ln[fx] 2 x 2 2/σ 2 < ; therefore, solving the first order condition w.r.t. x give us x µ at the maximum, which completes the proof. 2. Explain intuitively: a What happens to the first and second moments of a variable when every observation its doubled? By doubling each observation we are also increasing the dispersion by the same amount, therefore the standard deviation doubles or, in other words, the variance increase by four. Similarly, the mean is multiplied by two. The new variable is Y 2X. b What happens to the first and second moments of a variable when we add a to each observation? Adding to each observation shift the distribution, without affecting its dispersion. Therefore, the mean willincrease by but the variance remains the same. In this case the new variable Y + X. c What happens to the first and second moments of a variable when, after multiplying each observation by two, we add a to each value? By multiplying by two we had that the mean is doubled while the variance is multiplied by four and, by adding, only the mean increases by the same amount. Now the new variable can be written as Y + 2X. Recall that lna b lna + lnb and lne x x.