Exact Confidence Intervals



Similar documents
Maximum Likelihood Estimation

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Lecture 8. Confidence intervals and the central limit theorem

Department of Mathematics, Indian Institute of Technology, Kharagpur Assignment 2-3, Probability and Statistics, March Due:-March 25, 2015.

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

1 Sufficient statistics

3.4 The Normal Distribution

1 Prior Probability and Posterior Probability

Practice problems for Homework 11 - Point Estimation

4. Continuous Random Variables, the Pareto and Normal Distributions

12.5: CHI-SQUARE GOODNESS OF FIT TESTS

Confidence Intervals for Exponential Reliability


Confidence Intervals for Cp

Math 461 Fall 2006 Test 2 Solutions

Tests of Hypotheses Using Statistics

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

1 Inner Products and Norms on Real Vector Spaces

Dongfeng Li. Autumn 2010

Chapter 7 Notes - Inference for Single Samples. You know already for a large sample, you can invoke the CLT so:

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

Final Mathematics 5010, Section 1, Fall 2004 Instructor: D.A. Levin

Estimation and Confidence Intervals

MATH 10: Elementary Statistics and Probability Chapter 7: The Central Limit Theorem

Stat 5102 Notes: Nonparametric Tests and. confidence interval

Chi Square Tests. Chapter Introduction

Section 5.1 Continuous Random Variables: Introduction

Errata and updates for ASM Exam C/Exam 4 Manual (Sixteenth Edition) sorted by page

Brownian Motion and Stochastic Flow Systems. J.M Harrison

Stats Review Chapters 9-10

STAT 830 Convergence in Distribution

The Standard Normal distribution

Chapter 2. Hypothesis testing in one population

Joint Exam 1/P Sample Exam 1

TImath.com. F Distributions. Statistics

Chapter 4. Probability and Probability Distributions

Basics of Statistical Machine Learning

CHAPTER 6: Continuous Uniform Distribution: 6.1. Definition: The density function of the continuous random variable X on the interval [A, B] is.

Principle of Data Reduction

A LOGNORMAL MODEL FOR INSURANCE CLAIMS DATA

Chapter 4 - Lecture 1 Probability Density Functions and Cumul. Distribution Functions

Unit 26 Estimation with Confidence Intervals

Normal distribution. ) 2 /2σ. 2π σ

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

Chapter 3 RANDOM VARIATE GENERATION

Notes on Continuous Random Variables

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

SAMPLE SIZE CONSIDERATIONS

Normal Distribution. Definition A continuous random variable has a normal distribution if its probability density. f ( y ) = 1.

Chapter 5: Normal Probability Distributions - Solutions

Hypothesis Testing for Beginners

5. Continuous Random Variables

5.1 Identifying the Target Parameter

Practice with Proofs

Standard Deviation Estimator

Week 2: Exponential Functions

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

Lecture 8: More Continuous Random Variables

MATH 425, PRACTICE FINAL EXAM SOLUTIONS.

The Normal Distribution. Alan T. Arnholt Department of Mathematical Sciences Appalachian State University

Confidence Intervals for the Difference Between Two Means

Statistics 100A Homework 8 Solutions

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

Multivariate normal distribution and testing for means (see MKB Ch 3)

How To Solve A Sequential Mca Problem

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

The Normal distribution

Master s Theory Exam Spring 2006

VISUALIZATION OF DENSITY FUNCTIONS WITH GEOGEBRA

1.1 Introduction, and Review of Probability Theory Random Variable, Range, Types of Random Variables CDF, PDF, Quantiles...

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Is a Brownian motion skew?

Confidence Intervals for Cpk

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Calculating P-Values. Parkland College. Isela Guerra Parkland College. Recommended Citation

Key Concept. Density Curve

INSURANCE RISK THEORY (Problems)

Reliability and Risk Analysis. Analysis of Model Tasks from Selected Application Areas Using a Computer (software R, Excel)

Properties of Future Lifetime Distributions and Estimation

Math 425 (Fall 08) Solutions Midterm 2 November 6, 2008

2WB05 Simulation Lecture 8: Generating random variables

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 18. A Brief Introduction to Continuous Probability

HYPOTHESIS TESTING (ONE SAMPLE) - CHAPTER 7 1. used confidence intervals to answer questions such as...

Math 151. Rumbos Spring Solutions to Assignment #22

CHAPTER II THE LIMIT OF A SEQUENCE OF NUMBERS DEFINITION OF THE NUMBER e.

Efficiency and the Cramér-Rao Inequality

Quadratic forms Cochran s theorem, degrees of freedom, and all that

Department of Civil Engineering-I.I.T. Delhi CEL 899: Environmental Risk Assessment Statistics and Probability Example Part 1

Schooling, Political Participation, and the Economy. (Online Supplementary Appendix: Not for Publication)

Nonparametric adaptive age replacement with a one-cycle criterion

Gamma Distribution Fitting

Continuous Random Variables

Probability Theory. Florian Herzog. A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T..

Sales forecasting # 2

No: Bilkent University. Monotonic Extension. Farhad Husseinov. Discussion Papers. Department of Economics

6.4 Normal Distribution

Aggregate Loss Models

Confidence Intervals for One Standard Deviation Using Standard Deviation

Transcription:

Math 541: Statistical Theory II Instructor: Songfeng Zheng Exact Confidence Intervals Confidence intervals provide an alternative to using an estimator ˆθ when we wish to estimate an unknown parameter θ. We can find an interval A, B) that we think has high probability of containing θ. The length of such an interval gives us an idea of how closely we can estimate θ. In some situations, we can find the mathematical formula for the sampling distribution of the estimator, and we can further make use of this sampling distribution to get a confidence interval for the parameter. The confidence interval obtained in this case are called exact confidence intervals. To find an exact confidence interval, one need to know the distribution of the population to find out the sampling distribution of the statistic used to estimate the parameter. Once you know the sampling distribution of the statistic, you can construct the interval. Suppose we have a random sample X 1,, X n from a population distribution, and the parameter of interest is θ. Given a value α 0, 1), we want to construct a 1 α confidence interval. Usually α takes values 0.01, 0.0, 0.05. The general tricks to construct an exact confidence interval for θ is: 1. Find a variable that is a function of the data and of the parameter. Call this function h, and denote it as hx 1,, X n, θ).. The distribution of this newly created variable should not depend on the parameter. 3. Using the distribution of hx 1,, X n, θ), let a and b be the values of h such that the probability that h is between these values is 1 α, i.e. P a h b). Then we manipulate the relation so that we can find a lower and upper bound within which the parameter could be contained, i.e. P a hx 1,, X n, θ) b) = 1 α = P LX 1,, X n ) θ UX 1,, X n )) = 1 α Several Notes: A function that satisfies conditions 1) and ) is called a pivot. For example, if we have a random sample X 1,, X n from a normal distribution Nµ, σ ) where µ is unknown 1

but σ is known. We define function h to be hx 1,, X n, µ) = X µ σ/ n, then the function h depends on both the data and the unknown parameter µ, and the distribution of h is N0, 1) which does not depend on µ, therefore it is a pivot. Please note that pivot is not a statistic, because pivot contains the unknown parameter, but statistic cannot contain any unknown parameter. We use statistic to estimate a parameter, therefore a statistic must be computable from the sample data, and this is why statistic cannot contain unknown parameter. On the other hand, in constructing confidence interval, we want to manipulate the pivot to get an interval about the unknown parameter, so a pivot must contain the unknown parameter. In the third step above, when choosing a and b, such that P a h b), we want the interval length b a as small as possible. The shorter the interval, the more precise it is. Usually when the distribution of h is symmetric, obviously the interval [a, b] should be symmetric as well. Example 1: Suppose X 1,, X n from a normal distribution Nµ, σ ) where µ is unknown but σ is known. Find a 1 α confidence interval for µ. Solution: We usually use X = X 1+ +X n to estimate the parameter µ, and, as the above n shows, we define the pivot as and therefore where zα) is defined as ) α P z hx 1,, X n, µ) = X µ σ/ n hx 1,, X n, µ) N0, 1) X µ 1 σ/ n z α ) ) zα) φx)dx = α and φx) is the density function of standard normal distribution. Because normal distribution is symmetric about 0, we have ) α z = z 1 α ) and z 1 α ) X µ 1 σ/ n z α ) X z 1 α ) σ n µ X + z 1 α ) σ n

3 so, P { X z 1 α ) σ n µ X + z 1 α ) } σ n That is, the 1 α confidence interval for µ is [ X z 1 α ) σ n, X + z 1 α ) ] σ n Example : Suppose X 1,, X n from a normal distribution Nµ, σ ) where µ is known and σ is unknown. Find a 1 α confidence intervals for σ. Solution: We use σ = 1 n ni=1 X i µ) to estimate σ, and we define the pivot to be h = σ It was shown that h follows chi-square distribution with n degrees of freedom. Let χ nα/) and χ n1 α/) be the α/) 100-th and 1 α/) 100-th percentiles, respectively. Then P χ nα/) χ n1 α/) σ therefore P χ n1 α/) σ χ nα/) So the 1 α confidence interval for σ is χ n1 α/), χ nα/) Note this interval is not symmetric, because the chi-square distribution is not symmetric. From the above two examples, we can see that the general procedure for getting a pivot in exact confidence interval is as following: find an estimator for θ; build a connection between the estimator and the parameter, usually this will give us some functions involving both the parameter and the estimator; among the many candidates obtained in last step, we choose the one which will give us standard distribution as the pivot.

4 Example 3: Suppose X 1,, X n from a normal distribution Nµ, σ ) where both µ and σ are unknown. Find a 1 α confidence intervals for µ and σ. Solution: The MLE or Method of moment estimate for µ and σ are ˆµ = X, σ = 1 n X i X) n i=1 Define function nx µ) h 1 = S where S = 1 n X i X) n 1 i=1 then h 1 is a pivot and h 1 t n 1, and t n 1 stands for t distribution with n 1 degrees of freedom. Let t n 1 α/) and t n 1 1 α/) be the α/) 100-th and 1 α/) 100-th percentiles, respectively. Then P t n 1 α/) h 1 t n 1 1 α/)) Since t distribution is symmetric about 0, t n 1 α/) = t n 1 1 α/), therefore the above inequality reads ) nx µ) P t n 1 1 α/) t n 1 1 α/) S The inequality can be manipulated to yield X t n 1 1 α/) n S µ X + t n 1 1 α/) n S ) P Therefore the 1 α confidence interval for µ is [X t n 1 1 α/) S n, X + t n 1 1 α/) S n ] and the probability of µ lies in the interval is 1 α. This interval is symmetric about X. Now let us turn to a confidence interval for σ. We define h = σ then h is a pivot and h χ n 1, where χ n 1 is the chi-square distribution with n 1 degrees of freedom. Let χ n 1α/) and χ n 11 α/) be the α/) 100-th and 1 α/) 100-th percentiles, respectively. Then P χ n 1α/) χ n 11 α/) σ

5 therefore, after manipulation, we have P χ n 11 α/) σ χ n 1α/) So the 1 α confidence interval for σ is χ n 11 α/), χ n 1α/) Of course, this can also be written as [ n 1)S χ n 11 α/) ] n 1)S, χ n 1α/) Example 4: confidence interval for the parameter λ of an exponential. A theoretical model suggests that the time to breakdown of an insulating fluid between electrodes at a particular voltage has an exponential distribution with parameter λ. A random sample of n = 10 breakdown times yields the following sample data in minutes): 41.53, 18.73,.99, 30.34, 1.33, 117.5, 73.0, 3.63, 4.00, 6.78. We want to obtain a 95% confidence interval for λ and the average breakdown time µ = 1/λ. Solution: First let us prove that if X follows an exponential distribution with parameter λ, then Y = λx follows an exponential distribution with parameter 1/, i.e. χ. The density function for X is fx λ) = λe λx if x > 0 and 0 otherwise. It is easy to see the density function for Y is gy) = 1 e y/ for y > 0, and gy) = 0 otherwise. Therefore Y has an exponential distribution with parameter 1/, i.e. chi-square distribution with degree of freedom. Now, let us first find a pivot, define n n hx 1,, X n, λ) = λ X i = Y i i=1 i=1 and each Y i = λx i follows χ distribution, and they are independent. Therefore h follows χ n distribution. Let χ nα/) and χ n1 α/) be the α/) 100-th and 1 α/) 100-th percentiles, respectively. Then ) n P χ nα/) λ X i χ n1 α/) therefore, after manipulation, we have ) χ P n α/) n λ χ n1 α/) i=1 X i n i=1 X i i=1

6 So the 1 α confidence interval for σ is [ ] χ n α/) n, χ n1 α/) i=1 X i n i=1 X i In this problem, n = 0, α = 0.05, look up the table, we have χ 00.975) = 34.17, and χ 00.05) = 9.59, and 10 i=1 x i = 550.87. Inserting these numbers and we have the 95% confidence interval for λ is 0.00871, 0.03101). If we want to get a 95% confidence interval for the mean µ = 1/λ, we have ni=1 X i P χ n1 α/) µ ni=1 ) X i χ nα/) and in this problem, the interval is calculated to be 34.4, 114.87). As we can see in the definition of confidence interval, the general procedure to get a confidence interval, and from the above examples, confidence intervals are of the form [LX 1,, X n ), UX 1,, X n )], where L and U are two statistics, therefore are random variables since X 1,, X n are random variables). It follows that the confidences intervals are random intervals. Suppose now we are interested in the 1 α confidence intervals for a parameter θ. In different experiments, we will observe different values for the random variables X 1,, X n, and therefore we will have different confidence intervals in different experiments. If we repeat the experiment over and over, among all the resulting confidence intervals, there will be around 100 1 α) percent contain the value of the parameter of interest θ. This is the explanation of the meaning of confidence interval. Exercises: Page 415: 3, 4, 7 Hint to problem 3 on page 415: First express the length of the interval, L, and then find the distribution of L, then express EL ) in terms of σ. The answers to these questions are expression of σ.