SECOND PART, LECTURE 4: CONFIDENCE INTERVALS

Similar documents
Chapter 4: Statistical Hypothesis Testing

Hypothesis Testing for Beginners

Confidence Intervals for Spearman s Rank Correlation

Sample Size and Power in Clinical Trials

Non Parametric Inference

The Variability of P-Values. Summary

Hypothesis testing. c 2014, Jeffrey S. Simonoff 1

Hypothesis testing - Steps

CONTENTS OF DAY 2. II. Why Random Sampling is Important 9 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

Confidence Intervals for One Standard Deviation Using Standard Deviation

BA 275 Review Problems - Week 6 (10/30/06-11/3/06) CD Lessons: 53, 54, 55, 56 Textbook: pp , ,

The Basics of Graphical Models

Online 12 - Sections 9.1 and 9.2-Doug Ensley

An Introduction to Basic Statistics and Probability

Stat 411/511 THE RANDOMIZATION TEST. Charlotte Wickham. stat511.cwick.co.nz. Oct

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Introduction to Hypothesis Testing OPRE 6301

Basic Concepts in Research and Data Analysis

Lecture 2. Marginal Functions, Average Functions, Elasticity, the Marginal Principle, and Constrained Optimization

Maximum Likelihood Estimation

Mind on Statistics. Chapter 12

Comparison of frequentist and Bayesian inference. Class 20, 18.05, Spring 2014 Jeremy Orloff and Jonathan Bloom

Good luck! BUSINESS STATISTICS FINAL EXAM INSTRUCTIONS. Name:

Chapter 2. Hypothesis testing in one population

Practice problems for Homework 12 - confidence intervals and hypothesis testing. Open the Homework Assignment 12 and solve the problems.

Basics of Statistical Machine Learning

HYPOTHESIS TESTING: POWER OF THE TEST

Confidence Intervals for Cpk

Arkansas Tech University MATH 4033: Elementary Modern Algebra Dr. Marcel B. Finan

Point and Interval Estimates

Exact Nonparametric Tests for Comparing Means - A Personal Summary

1 Prior Probability and Posterior Probability

5.1 Identifying the Target Parameter

Multivariate Normal Distribution

SAS Certificate Applied Statistics and SAS Programming

Inference on the parameters of the Weibull distribution using records

Principle of Data Reduction

Hypothesis Testing. Hypothesis Testing

Statistical tests for SPSS

Confidence Intervals for Cp

Discrete Mathematics and Probability Theory Fall 2009 Satish Rao, David Tse Note 10

Master s Theory Exam Spring 2006

Lecture 7: Continuous Random Variables

Math/Stats 425 Introduction to Probability. 1. Uncertainty and the axioms of probability

Experimental Design. Power and Sample Size Determination. Proportions. Proportions. Confidence Interval for p. The Binomial Test

MA651 Topology. Lecture 6. Separation Axioms.

Using Excel for inferential statistics

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

Simple Linear Regression Inference

Exact Confidence Intervals

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 12: June 22, Abstract. Review session.

Interpreting GRE Scores: Reliability and Validity

Dongfeng Li. Autumn 2010

Statistical Rules of Thumb

Stochastic Inventory Control

Multivariate normal distribution and testing for means (see MKB Ch 3)

Odds ratio, Odds ratio test for independence, chi-squared statistic.

C. The null hypothesis is not rejected when the alternative hypothesis is true. A. population parameters.

In the general population of 0 to 4-year-olds, the annual incidence of asthma is 1.4%

Stat 5102 Notes: Nonparametric Tests and. confidence interval

2.1 Complexity Classes

Theory versus Experiment. Prof. Jorgen D Hondt Vrije Universiteit Brussel jodhondt@vub.ac.be

SIMULATION STUDIES IN STATISTICS WHAT IS A SIMULATION STUDY, AND WHY DO ONE? What is a (Monte Carlo) simulation study, and why do one?

Fairfield Public Schools

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

STAT 350 Practice Final Exam Solution (Spring 2015)

1 Sufficient statistics

Bayesian Analysis for the Social Sciences

Spatial Statistics Chapter 3 Basics of areal data and areal data modeling

E3: PROBABILITY AND STATISTICS lecture notes

Confidence Intervals for Exponential Reliability

Notes on metric spaces

Understanding Basic Calculus

Introduction. Hypothesis Testing. Hypothesis Testing. Significance Testing

Point Biserial Correlation Tests

What are confidence intervals and p-values?

Non-Inferiority Tests for One Mean

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

From the help desk: Bootstrapped standard errors

Fuzzy Probability Distributions in Bayesian Analysis

Linear Maps. Isaiah Lankham, Bruno Nachtergaele, Anne Schilling (February 5, 2007)

Unit 26 Estimation with Confidence Intervals

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

CHAPTER 2 Estimating Probabilities

LEARNING OBJECTIVES FOR THIS CHAPTER

QUANTITATIVE METHODS BIOLOGY FINAL HONOUR SCHOOL NON-PARAMETRIC TESTS

A POPULATION MEAN, CONFIDENCE INTERVALS AND HYPOTHESIS TESTING

How To Check For Differences In The One Way Anova

Continued Fractions and the Euclidean Algorithm

Introduction to Hypothesis Testing

23. RATIONAL EXPONENTS

Planning sample size for randomized evaluations

INDIRECT INFERENCE (prepared for: The New Palgrave Dictionary of Economics, Second Edition)

Econometrics Simple Linear Regression

A Basic Introduction to Missing Data

Colored Hats and Logic Puzzles

Transcription:

Massimo Guidolin Massimo.Guidolin@unibocconi.it Dept. of Finance STATISTICS/ECONOMETRICS PREP COURSE PROF. MASSIMO GUIDOLIN SECOND PART, LECTURE 4: CONFIDENCE INTERVALS Lecture 4: Confidence Intervals Prof. Guidolin

OVERVIEW 1) General notion sample (random) intervals 2) Coverage and confidence of intervals 3) Finding confidence intervals by inversion of (non-) rejection regions of tests 4) A few examples a.a. 13/14 p. 2 Lecture 4: Confidence Intervals Prof. Guidolin

FROM POINT TO SET (INTERVAL) INFERENCES Point estimation of a parameter boils down inference to a guess of a single value as the value of In most situations, it seems more plausible to provide a guess in the form of an interval Inference in a set problem is the statement that " C" where C and C = C(x) is determined by the value of the data If is real-valued, then we usually prefer the set estimate C to be an interval Definition: An interval estimate of a real-valued parameter is any pair of functions, L(X 1,..., X n ) and U(X 1,..., X n ) of a sample that satisfy L(X) U(X); if x is observed, the inference L(x) U(x) is made and the random interval [L(x), U(x)] is called an interval estimator Although in the majority of cases we will work with finite values for a.a. 13/14 p. 3 Lecture 4: Confidence Intervals Prof. Guidolin

FROM POINT TO SET (INTERVAL) INFERENCES L and U, there is sometimes interest in one-sided interval estimates In the definition, using an open or closed interval, [L(x), U(x)], is immaterial What is the gain from using interval estimators? On the one hand, we have given up precision, so there must be some gain The answer is that we benefit from some confidence or assurance as to our assertions concerning an estimate Let s stop and think: we know that the sample mean is MLE and as such unbiased, consistent, asymptotically efficient, etc. Great, but what is Pr(sample mean = )? The answer is Pr(sample mean = ) = 0! Therefore one needs to broaden the interval in order to obtain a stronger endorsement, e.g., in the case of an IID N(, 2 ) population a.a. 13/14 p. 4 Lecture 4: Confidence Intervals Prof. Guidolin

COVERAGE AND CONFIDENCE OF INTERVALS Sets or intervals? In general one estimates confidence sets, but in finance most of the time one looks for intervals, CIs What confidence or assurance? Definition [COVERAGE]: For an interval estimator [L(X), U(X)] of a parameter, the coverage probability is the probability that the random interval [L(X), U(X)] covers the true parameter,, or Pr( [L(X), U(X)]) Definition [CONFIDENCE]: For an interval estimator [L(X), U(X)] of a parameter, the confidence coefficient is the infimum of the coverage probabilities, inf Pr( [L(X), U(X)]) CIs are for the random sample, not for the parameter : when we write statements such as Pr( [L(X), U(X)]), these probability statements refer to X, not Re-write Pr( [L(X), U(X)]) as Pr(L(X) ) together with Pr(U(X) ) a.a. 13/14 p. 5 Lecture 4: Confidence Intervals Prof. Guidolin

METHODS TO FIND CONFIDENCE INTERVALS Since we do not know the true value of, we can only guarantee a coverage probability equal to the infimum, the confidence coefficient Now the good news: There is a strong correspondence between hypothesis testing and interval estimation: in general, every confidence set corresponds to a test and vice versa So you almost, almost, do not need to separately study this... Example: Let X 1,..., X n be IID N(, 2 ) and consider testing H 0 : = o versus H 1 : o with known. For a fixed level, a reasonable test (in fact, the MPU) has rejection region which implies that the null is not rejected for sample points with Since the test has size, this means that Pr(H o is not rejected = 0 ) = 1 -. Combining this with the characterization, we can write a.a. 13/14 p. 6 Lecture 4: Confidence Intervals Prof. Guidolin

METHODS TO FIND CONFIDENCE INTERVALS This interval is obtained by inverting the (complement of the) rejection region of the level test There is no guarantee that the confidence set obtained by test inversion will be an interval In most cases, however, one-sided tests give one-sided intervals, twosided tests give two-sided intervals, strange-shaped rejection regions give strange-shaped confidence sets Example: As another example, we show how do you go about inverting an LRT. Suppose we want a CI for the mean,, of an exponential( ) population, with PDF f(x; )=(1/ )exp(- x/ ). We can obtain such an interval by inverting a size test of H o : = 0 versus H 1 : 0. If we take a random sample X 1,..., X n, the LRT statistic is given by For fixed 0, the non-rejection region is given by a.a. 13/14 p. 7 Lecture 4: Confidence Intervals Prof. Guidolin

METHODS TO FIND CONFIDENCE INTERVALS where k* is a constant chosen to satisfy Pr{x not reject} = 1- Inverting this region gives the 1- set The expression defining C(x) depends on x only through the sample mean, so the CI can be expressed in the form At this point L and U can be determined by imposing that Pr{x not reject} = 1- and for which the solution is numerical a.a. 13/14 p. 8 Lecture 4: Confidence Intervals Prof. Guidolin

TERMINOLOGY ISSUES AND BAYESIAN CIS Notice that it is the CI that covers the parameter, not the opposite; in particular, our claim is not really about the probability of the parameter falling inside a CI This is a key remark because the random quantity, the sample statistic, is the interval, not the parameter For instance, suppose a 90% CI for some parameter is [0.24, 0.45] It is tempting to say (and many experimenters do) that "the probability is 90% that is in the interval [0.24, 0.45] However, such a statement is invalid since the parameter is assumed fixed and such it cannot go anywhere, it cannot fall or rise... Formally, the interval [0.24, 0.45] is one of the possible realized values of the random CI for and, since the parameter does not move, is in the realized interval [0.24, 0.45] with probability 0 or 1 That the realized interval [0.24, 0.45] has a 90% chance of coverage, we only mean that we know that 90% of the sample points of the a.a. 13/14 p. 9 Lecture 4: Confidence Intervals Prof. Guidolin

TERMINOLOGY ISSUES AND BAYESIAN CIS random interval cover the true but unknown parameter Coverage probability reflects the uncertainty in the sampling procedure and a classical assertion of (1- )% coverage means that in a long sequence of identical trials, (1- )% of the realized confidence sets will cover the true parameter In contrast, the Bayesian setup allows us to say that is inside [0.24, 0.45] with some probability, not 0 or 1 This is because, under the Bayesian model, is a random variable with a probability distribution To keep the distinction between Bayesian and classical sets clear, Bayesian sets are referred to as credible sets rather than CIs Going Bayesian or not? There is no one best way to do statistics; some problems are best solved with classical statistics and some (...) with Bayesian statistics a.a. 13/14 p. 10 Lecture 4: Confidence Intervals Prof. Guidolin

USEFUL NOTIONS REVIEWED IN THIS LECTURE Let me give you a list to follow up to: Definition of random confidence interval/set Coverage and confidence of a confidence interval Terminology issue what a CI really means a.a. 13/14 p. 11 Lecture 4: Confidence Intervals Prof. Guidolin