Order Statistics. Lecture 15: Order Statistics. Notation Detour. Order Statistics, cont.



Similar documents
Transformations and Expectations of random variables

Introduction to Probability

Math 431 An Introduction to Probability. Final Exam Solutions

MAS108 Probability I

Lecture 6: Discrete & Continuous Probability and Random Variables

Approximation of Aggregate Losses Using Simulation

Random variables, probability distributions, binomial random variable

Chapter 4 - Lecture 1 Probability Density Functions and Cumul. Distribution Functions

The Exponential Distribution

Lecture Notes 1. Brief Review of Basic Probability

Statistics 100A Homework 4 Solutions

Graphs. Exploratory data analysis. Graphs. Standard forms. A graph is a suitable way of representing data if:

Chapter 9 Monté Carlo Simulation

Random variables P(X = 3) = P(X = 3) = 1 8, P(X = 1) = P(X = 1) = 3 8.

( ) is proportional to ( 10 + x)!2. Calculate the

Lecture 8. Confidence intervals and the central limit theorem

Examples: Joint Densities and Joint Mass Functions Example 1: X and Y are jointly continuous with joint pdf

A Uniform Asymptotic Estimate for Discounted Aggregate Claims with Subexponential Tails

Lecture 3: Continuous distributions, expected value & mean, variance, the normal distribution

0 x = 0.30 x = 1.10 x = 3.05 x = 4.15 x = x = 12. f(x) =

Aggregate Loss Models

What is Statistics? Lecture 1. Introduction and probability review. Idea of parametric inference

University of California, Los Angeles Department of Statistics. Random variables

Background Information on Exponentials and Logarithms

UNIT I: RANDOM VARIABLES PART- A -TWO MARKS

. (3.3) n Note that supremum (3.2) must occur at one of the observed values x i or to the left of x i.

Maximum Likelihood Estimation

Review of Random Variables

Joint Exam 1/P Sample Exam 1

Overview of Monte Carlo Simulation, Probability Review and Introduction to Matlab

Generating Random Numbers Variance Reduction Quasi-Monte Carlo. Simulation Methods. Leonid Kogan. MIT, Sloan , Fall 2010

Definition: Suppose that two random variables, either continuous or discrete, X and Y have joint density

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

Lecture 8: More Continuous Random Variables

Dongfeng Li. Autumn 2010

INSURANCE RISK THEORY (Problems)

M1 in Economics and Economics and Statistics Applied multivariate Analysis - Big data analytics Worksheet 1 - Bootstrap

ECE302 Spring 2006 HW5 Solutions February 21,

Questions and Answers

A note on exact moments of order statistics from exponentiated log-logistic distribution

b g is the future lifetime random variable.

Binomial Random Variables

Core Maths C2. Revision Notes

**BEGINNING OF EXAMINATION** The annual number of claims for an insured has probability function: , 0 < q < 1.

STAT 830 Convergence in Distribution

Master s Theory Exam Spring 2006

Probability, Mean and Median

6.041/6.431 Spring 2008 Quiz 2 Wednesday, April 16, 7:30-9:30 PM. SOLUTIONS

The Kelly Betting System for Favorable Games.

Exponential Distribution

Practice problems for Homework 11 - Point Estimation

COMPLETE MARKETS DO NOT ALLOW FREE CASH FLOW STREAMS

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Substitute 4 for x in the function, Simplify.

RNA-seq. Quantification and Differential Expression. Genomics: Lecture #12

Finance 400 A. Penati - G. Pennacchi Market Micro-Structure: Notes on the Kyle Model

Stats on the TI 83 and TI 84 Calculator

Principle of Data Reduction

Example: 1. You have observed that the number of hits to your web site follow a Poisson distribution at a rate of 2 per day.

E190Q Lecture 5 Autonomous Robot Navigation

Probability and Statistics Prof. Dr. Somesh Kumar Department of Mathematics Indian Institute of Technology, Kharagpur

STA 256: Statistics and Probability I

5. Continuous Random Variables

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

MATH4427 Notebook 2 Spring MATH4427 Notebook Definitions and Examples Performance Measures for Estimators...

Lecture 7: Continuous Random Variables

seven Statistical Analysis with Excel chapter OVERVIEW CHAPTER

Math 151. Rumbos Spring Solutions to Assignment #22

Nonparametric adaptive age replacement with a one-cycle criterion

Tail inequalities for order statistics of log-concave vectors and applications

NIST GCR Probabilistic Models for Directionless Wind Speeds in Hurricanes

Permanents, Order Statistics, Outliers, and Robustness

Statistics for Analysis of Experimental Data

Lecture 4: BK inequality 27th August and 6th September, 2007

Chapter 4: Statistical Hypothesis Testing

3. The Economics of Insurance

An Introduction to Extreme Value Theory

Probability Theory. Florian Herzog. A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T..

Implicit Differentiation

3.4. The Binomial Probability Distribution. Copyright Cengage Learning. All rights reserved.

SF2940: Probability theory Lecture 8: Multivariate Normal Distribution

For a partition B 1,..., B n, where B i B j = for i. A = (A B 1 ) (A B 2 ),..., (A B n ) and thus. P (A) = P (A B i ) = P (A B i )P (B i )

MBA 611 STATISTICS AND QUANTITATIVE METHODS

MATH 10: Elementary Statistics and Probability Chapter 5: Continuous Random Variables

Semi-Supervised Support Vector Machines and Application to Spam Filtering

Sections 2.11 and 5.8

Math/Stats 342: Solutions to Homework

1 Prior Probability and Posterior Probability

Lecture Notes ApSc 3115/6115: Engineering Analysis III

Continuous Random Variables

8 9 +

The Monte Carlo Framework, Examples from Finance and Generating Correlated Random Variables

2WB05 Simulation Lecture 8: Generating random variables

STT315 Chapter 4 Random Variables & Probability Distributions KM. Chapter 4.5, 6, 8 Probability Distributions for Continuous Random Variables

PSTAT 120B Probability and Statistics

Transcription:

Lecture 5: Statistics 4 Let X, X 2, X 3, X 4, X 5 be iid random variables with a distribution F with a range of a, b). We can relabel these X s such that their labels correspond to arranging them in increasing order so that X ) X 2) X 3) X 4) X 5) Colin Rundel X) X2) X3) X4) X5) March 4, 22 a X5 X X4 X2 X3 b In the case where the distribution F is continuous we can make the stronger statement that X ) < X 2) < X 3) < X 4) < X 5) Since PX i X j ) for all i j for continuous random variables. Statistics 4 Colin Rundel) Lecture 5 March 4, 22 / 24, cont. For X, X 2,..., X n iid random variables X k is the kth smallest X, usually called the kth order statistic. X ) is therefore the smallest X and X ) minx,..., X n ) Notation Detour For a continuous random variable we can see that f )ɛ P X + ɛ) PX [, + ɛ]) lim f )ɛ lim PX [, + ɛ]) ɛ ɛ f ) lim ɛ PX [, + ɛ])/ɛ Similarly, X n) is the largest X and X n) max,..., X n ) P apple X apple + ) P X 2 [, + ]) f) f + ) + Statistics 4 Colin Rundel) Lecture 5 March 4, 22 2 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 3 / 24

Density of the maimum Density of the minimum For X, X 2,..., X n iid continuous random variables with pdf f and cdf F the density of the maimum is For X, X 2,..., X n iid continuous random variables with pdf f and cdf F the density of the minimum is PX n) [, + ɛ]) Pone of the X s [, + ɛ] and all others < ) n PX i [, + ɛ] and all others < ) i npx [, + ɛ] and all others < ) npx [, + ɛ])pall others < ) npx [, + ɛ])px 2 < ) PX n < ) nf )ɛf ) n PX ) [, + ɛ]) Pone of the X s [, + ɛ] and all others > ) n PX i [, + ɛ] and all others > ) i npx [, + ɛ] and all others > ) npx [, + ɛ])pall others > ) npx [, + ɛ])px 2 > ) PX n > ) nf )ɛ F )) n f n) ) nf )F ) n f ) ) nf ) F )) n Statistics 4 Colin Rundel) Lecture 5 March 4, 22 4 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 5 / 24 Density of the kth Order Statistic Cumulative Distribution of the min and ma For X, X 2,..., X n iid continuous random variables with pdf f and cdf F the density of the kth order statistic is PX k) [, + ɛ]) Pone of the X s [, + ɛ] and eactly k of the others < ) n PX i [, + ɛ] and eactly k of the others < ) i npx [, + ɛ] and eactly k of the others < ) npx [, + ɛ])peactly k of the others < ) ) n npx [, + ɛ]) )PX < ) k PX > ) n k k ) n f k) ) nf ) F ) k F )) n k k Statistics 4 Colin Rundel) Lecture 5 March 4, 22 6 / 24 For X, X 2,..., X n iid continuous random variables with pdf f and cdf F the density of the kth order statistic is F ) ) PX ) < ) PX ) > ) PX >,..., X n > ) PX > ) PX n > ) F )) n F n) ) PX n) < ) PX n) > ) ) PX <,..., X n < ) PX < ) PX n < ) n nf )ɛ F ) k F ) F n )) n k k f ) ) d d F ))n n F )) n df ) nf ) F )) n d f n) ) d d F )n nf ) n df ) nf )F ) n d Statistics 4 Colin Rundel) Lecture 5 March 4, 22 7 / 24

Order Statistic of Standard Uniforms Beta Distribution Let X, X 2,..., X n iid Unif, ) then the density of Xn) is given by ) n f k) ) nf ) F ) k F )) n k k n n k ) k ) n k if < < otherwise This is an eample of the Beta distribution where r k and s n k +. X k) Betak, n k + ) The Beta distribution is a continuous distribution defined on the range, ) where the density is given by f ) Br, s) r ) s where Br, s) is called the Beta function and it is a normalizing constant which ensures the density integrates to. Br, s) f )d Br, s) Br, s) r ) s d r ) s d r ) s d Statistics 4 Colin Rundel) Lecture 5 March 4, 22 8 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 9 / 24 Beta Function Beta Function - Epectation The connection between the Beta distribution and the kth order statistic of n standard Uniform random variables allows us to simplify the Beta function. Br, s) r ) s d Bk, n k + ) n ) n k k )!n k + )! nn )! r )!n k)! n! r )!s )! Γr)Γs) r + s )! Γr + s) Let X Betar, s) then EX ) Br, s) r ) s d, r+) ) s d Br, s) Br +, s) Br, s) r!s )! r + s )! r + s)! r )!s )! r! r + s )! r )! r + s)! r r + s Statistics 4 Colin Rundel) Lecture 5 March 4, 22 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 / 24

Beta Function - Variance Beta Distribution - Summary Let X Betar, s) then If X Betar, s) then EX 2 ) 2 Br, s) r ) s d Br + 2, s) Br, s) r + )r r + s + )r + s) VarX ) EX 2 ) EX ) 2 r + )!s )! r + s + )! r + )r r + s + )r + s) r 2 r + s) 2 r + )rr + s) r 2 r + s + ) r + s + )r + s) 2 rs r + s + )r + s) 2 r + s )! r )!s )! f ) Br, s) r ) s F ) Br, s) B r, s) Br, s) r ) s d B r, s) Br, s) r ) s d r ) s d EX ) r r + s rs VarX ) r + s) 2 r + s + ) r )!s )! r + s )! Γr)Γs) Γr + s) Statistics 4 Colin Rundel) Lecture 5 March 4, 22 2 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 3 / 24 Minimum of Eponentials Maimum of Eponentials Let X, X 2,..., X n iid Epλ), we previously derived a more general result where the X s were not identically distributed and showed that minx,..., X n ) Epλ + + λ n ) Epnλ) in this more restricted case. Lets confirm that result using our new more general methods f ) ) nf ) F )) n n λe λ) [ e λ ] ) n iid Let X, X 2,..., X n Epλ) then the density of Xn) is given by f n) ) nf )F ) n n λe λ) e λ) n Which we can t do much with, instead we can try the cdf of the maimum. nλe λ e λ ] ) n nλ e λ ] nλe nλ ) n Which is the density for Epnλ). Statistics 4 Colin Rundel) Lecture 5 March 4, 22 4 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 5 / 24

Maimum of Eponentials, cont. Limit Distributions of Maima and Minima Let X, X 2,..., X n iid Epλ) then the cdf of Xn) is given by Previous we have shown that F n) ) F ) n e λ) n ne λ n F n) ) ep ne λ ) ) n lim F n)) lim n n ep ne λ ) This result is not unique to the eponential distribution... When n tends to infinity we get F ) ) PX ) < ) F )) n F n) ) PX n) < ) F ) n lim F )) lim F n n ))n lim F n)) lim F n n )n if F ) if F ) < if F ) if F ) > Statistics 4 Colin Rundel) Lecture 5 March 4, 22 6 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 7 / 24 Limit Distributions of Maima and Minima, cont. Maimum of Eponentials, cont. These results show that the limit distributions are degenerate as they only take values of or. To avoid the degeneracy we would like to use a simple transform that such that the limit distributions are not degenerate. Let s consider simple linear transformations lim F n)a n + b n ) lim F a n + b n ) n F ) n n lim F )c n + d n ) lim F c n + d n )) n F ) n n Xn) a n F n) a n + b n ) PX n) < a n + b n ) P b n ) < ) X) c n F ) c n + d n ) PX ) < c n + d n ) P < d n Let X, X 2,..., X n iid Epλ) and an logn)/λ, b n /λ then the cdf of X n) is given by F n) a n + b n ) F logn) + )/λ) n e λlogn)+)/λ) n e logn) e ) n e /n ) n lim F n)a n + b n ) ep e ) n This is known as the standard Gumbel distribution. Statistics 4 Colin Rundel) Lecture 5 March 4, 22 8 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 9 / 24

Gumbel Distribution Let X Gumbel, ) then Maimum of Eponentials, cont. Let X, X 2,..., X n iid Epλ) and an logn)/λ, b n /λ then if n is large we can use the Standard Gumbel to calculate properties of X n). Standard Gumbel Distribution - pdf Standard Gumbel Distribution - cdf f)...2.3 F)..2.4.6.8. MedianX n) ) m n) PX n) < m n) ) /2 ) Xn) a n P < m G /2 b n PX n) < a n + b n m G ) /2-5 5 F ) ep e ) f ) e ep e ) -5 5 EX ) π/ 6 MedianX ) loglog2)) m n) a n + b n m G λ log n log log 2 λ Statistics 4 Colin Rundel) Lecture 5 March 4, 22 2 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 2 / 24 Maimum of Eponentials, Eample Maimum of Uniforms In 29 Usain Bolt broke the world record in the meters with a time of 9.58 seconds in Berlin, Germany. If we imagine that the running speed in m/s of competitive sprinters is given by an Eponential distribution with λ. How many sprinters would need to run to have a 5/5 chance of beating Usian Bolt s record having a faster running speed)? Let X Ep) then we need to find n such that MedianX n) ) /9.58 Let X, X 2,..., X n iid Unif, ) and an, b n /n then the cdf of X n) is given by F n) ) F ) n n F n) a n + b n ) F a n + b n ) n lim F n)a n + b n ) e n + /n) n This is eample of the Reverse Weibul distribution. Statistics 4 Colin Rundel) Lecture 5 March 4, 22 22 / 24 Statistics 4 Colin Rundel) Lecture 5 March 4, 22 23 / 24

Maimum of Paretos This is a distribution we have not seen yet, but is useful for describing many physical processes. It s key feature is that it has long tails meaning it goes to slower than a distribution like the normal. Let X, X 2,..., X iid n Paretoα, k) and a n, b n kn /α then the cdf of X is ) k α if k, F X ) otherwise The cdf of X n) is then given by F n) ) F ) n F n) a n + b n ) F a n + b n ) n lim F n)a n + b n ) e α n ) α ) n k k kn /α ) α ) n α n ) n This is eample of the Fréchet distribution. Statistics 4 Colin Rundel) Lecture 5 March 4, 22 24 / 24