Basic Measure Theory. 1.1 Classes of Sets



Similar documents
How To Find Out How To Calculate A Premeasure On A Set Of Two-Dimensional Algebra

Extension of measure

Mathematics for Econometrics, Fourth Edition

CHAPTER II THE LIMIT OF A SEQUENCE OF NUMBERS DEFINITION OF THE NUMBER e.

Basic Concepts of Point Set Topology Notes for OU course Math 4853 Spring 2011

Mathematical Methods of Engineering Analysis

SOLUTIONS TO ASSIGNMENT 1 MATH 576

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 5 9/17/2008 RANDOM VARIABLES

Lecture Notes on Measure Theory and Functional Analysis

MA651 Topology. Lecture 6. Separation Axioms.

1. Prove that the empty set is a subset of every set.

FUNCTIONAL ANALYSIS LECTURE NOTES: QUOTIENT SPACES

Metric Spaces Joseph Muscat 2003 (Last revised May 2009)

University of Miskolc

DISINTEGRATION OF MEASURES

An example of a computable

Metric Spaces. Chapter 1

The Henstock-Kurzweil-Stieltjes type integral for real functions on a fractal subset of the real line

1 if 1 x 0 1 if 0 x 1

The Ideal Class Group

Probability: Theory and Examples. Rick Durrett. Edition 4.1, April 21, 2013

Elements of probability theory

Lebesgue Measure on R n

No: Bilkent University. Monotonic Extension. Farhad Husseinov. Discussion Papers. Department of Economics

E3: PROBABILITY AND STATISTICS lecture notes

MEASURE AND INTEGRATION. Dietmar A. Salamon ETH Zürich

SOLUTIONS TO EXERCISES FOR. MATHEMATICS 205A Part 3. Spaces with special properties

Mathematics Course 111: Algebra I Part IV: Vector Spaces

CHAPTER 1 BASIC TOPOLOGY

Follow links for Class Use and other Permissions. For more information send to:

1 Sets and Set Notation.

LECTURE NOTES IN MEASURE THEORY. Christer Borell Matematik Chalmers och Göteborgs universitet Göteborg (Version: January 12)

Basic Probability Concepts

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

I. GROUPS: BASIC DEFINITIONS AND EXAMPLES

Metric Spaces. Chapter Metrics

Gambling Systems and Multiplication-Invariant Measures

Introduction to Topology

PUTNAM TRAINING POLYNOMIALS. Exercises 1. Find a polynomial with integral coefficients whose zeros include

and s n (x) f(x) for all x and s.t. s n is measurable if f is. REAL ANALYSIS Measures. A (positive) measure on a measurable space

ALMOST COMMON PRIORS 1. INTRODUCTION

9 More on differentiation

Dedekind s forgotten axiom and why we should teach it (and why we shouldn t teach mathematical induction in our calculus classes)

Point Set Topology. A. Topological Spaces and Continuous Maps

So let us begin our quest to find the holy grail of real analysis.

INTEGRAL OPERATORS ON THE PRODUCT OF C(K) SPACES

The sample space for a pair of die rolls is the set. The sample space for a random number between 0 and 1 is the interval [0, 1].

Probability Theory. Florian Herzog. A random variable is neither random nor variable. Gian-Carlo Rota, M.I.T..

Notes on metric spaces

EMBEDDING COUNTABLE PARTIAL ORDERINGS IN THE DEGREES

2.3 Convex Constrained Optimization Problems

Undergraduate Notes in Mathematics. Arkansas Tech University Department of Mathematics

Logic, Algebra and Truth Degrees Siena. A characterization of rst order rational Pavelka's logic

GENERIC COMPUTABILITY, TURING DEGREES, AND ASYMPTOTIC DENSITY

BANACH AND HILBERT SPACE REVIEW

RANDOM INTERVAL HOMEOMORPHISMS. MICHA L MISIUREWICZ Indiana University Purdue University Indianapolis

Convex analysis and profit/cost/support functions

The Banach-Tarski Paradox

Stationary random graphs on Z with prescribed iid degrees and finite mean connections

k, then n = p2α 1 1 pα k

1 Norms and Vector Spaces

Separation Properties for Locally Convex Cones

Chapter 4, Arithmetic in F [x] Polynomial arithmetic and the division algorithm.

MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set.

How To Prove The Dirichlet Unit Theorem

Duality of linear conic problems

Continued Fractions and the Euclidean Algorithm

Mathematics for Computer Science/Software Engineering. Notes for the course MSM1F3 Dr. R. A. Wilson

INTRODUCTORY SET THEORY

How To Know If A Domain Is Unique In An Octempo (Euclidean) Or Not (Ecl)

THE BANACH CONTRACTION PRINCIPLE. Contents

n k=1 k=0 1/k! = e. Example 6.4. The series 1/k 2 converges in R. Indeed, if s n = n then k=1 1/k, then s 2n s n = 1 n

God created the integers and the rest is the work of man. (Leopold Kronecker, in an after-dinner speech at a conference, Berlin, 1886)

CS 103X: Discrete Structures Homework Assignment 3 Solutions

Rate of growth of D-frequently hypercyclic functions

DEGREES OF ORDERS ON TORSION-FREE ABELIAN GROUPS

Quotient Rings and Field Extensions

WHAT ARE MATHEMATICAL PROOFS AND WHY THEY ARE IMPORTANT?

Research Article Stability Analysis for Higher-Order Adjacent Derivative in Parametrized Vector Optimization

TOPOLOGY: THE JOURNEY INTO SEPARATION AXIOMS

1 VECTOR SPACES AND SUBSPACES

ON FIBER DIAMETERS OF CONTINUOUS MAPS

INTRODUCTION TO TOPOLOGY

1 Local Brouwer degree

4. CLASSES OF RINGS 4.1. Classes of Rings class operator A-closed Example 1: product Example 2:

Fixed Point Theorems

3. Mathematical Induction

LEARNING OBJECTIVES FOR THIS CHAPTER

ON COMPLETELY CONTINUOUS INTEGRATION OPERATORS OF A VECTOR MEASURE. 1. Introduction

IRREDUCIBLE OPERATOR SEMIGROUPS SUCH THAT AB AND BA ARE PROPORTIONAL. 1. Introduction

FIRST YEAR CALCULUS. Chapter 7 CONTINUITY. It is a parabola, and we can draw this parabola without lifting our pencil from the paper.

11 Ideals Revisiting Z

1. Let X and Y be normed spaces and let T B(X, Y ).

Lecture 13: Martingales

A UNIQUENESS RESULT FOR THE CONTINUITY EQUATION IN TWO DIMENSIONS. Dedicated to Constantine Dafermos on the occasion of his 70 th birthday

Shape Optimization Problems over Classes of Convex Domains

F. ABTAHI and M. ZARRIN. (Communicated by J. Goldstein)

Sums of Independent Random Variables

Tail inequalities for order statistics of log-concave vectors and applications

Mathematics Review for MS Finance Students

Transcription:

1 Basic Measure Theory In this chapter, we introduce the classes of sets that allow for a systematic treatment of events and random observations in the framework of probability theory. Furthermore, we construct measures, in particular probability measures, on such classes of sets. Finally, we define random variables as measurable maps. 1.1 Classes of Sets In the following, let Ω be a nonempty set and let A 2 Ω (set of all subsets of Ω) be a class of subsets of Ω.Later,Ω will be interpreted as the space of elementary events and A will be the system of observable events. In this section, we introduce names for classes of subsets of Ω that are stable under certain set operations and we establish simple relations between such classes. Definition 1.1. A class of sets A is called -closed (closed under intersections) or a π-system if A B Awhenever A, B A, σ- -closed (closed under countable 1 intersections) if A n A for any choice of countably many sets A 1,A 2,... A, -closed (closed under unions) if A B Awhenever A, B A, σ- -closed (closed under countable unions) if A n A for any choice of countably many sets A 1,A 2,... A, \-closed (closed under differences) if A \ B Awhenever A, B A,and closed under complements if A c := Ω \ A Afor any set A A. 1 By countable we always mean either finite or countably infinite.

2 1 Basic Measure Theory Definition 1.2 (σ-algebra). A class of sets A 2 Ω is called a σ-algebra if it fulfils the following three conditions: (i) Ω A. (ii) A is closed under complements. (iii) A is closed under countable unions. Sometimes a σ-algebra is also named a σ-field. As we will see, we can define probabilities on σ-algebras in a consistent way. Hence these are the natural classes of sets to be considered as events in probability theory. Theorem 1.3. If A is closed under complements, then we have the equivalences A is -closed A is -closed, A is σ- -closed A is σ- -closed. Proof. The two statements are immediate consequences of de Morgan s rule (reminder: ( A i ) c = A c i ). For example, let A be σ- -closed and let A 1,A 2,... A.Hence ( c A n = An) c A. Thus A is σ- -closed. The other cases can be proved similarly. Theorem 1.4. Assume that A is \-closed. Then the following statements hold: (i) A is -closed. (ii) If in addition A is σ- -closed, then A is σ- -closed. (iii) Any countable (respectively finite) union of sets in A can be expressed as a countable (respectively finite) disjoint union of sets in A. Proof. (i) Assume that A, B A. Hence also A B = A \ (A \ B) A. (ii) Assume that A 1,A 2,... A.Hence A n = (A 1 A n )= A 1 \ (A 1 \ A n )=A 1 \ (A 1 \ A n ) A. (iii) n=2 n=2 n=2 Assume that A 1,A 2,... A.Hencearepresentationof A n as a countable disjoint union of sets in A is A n = A 1 (A 2 \ A 1 ) ((A 3 \ A 1 ) \ A 2 ) (((A 4 \ A 1 ) \ A 2 ) \ A 3 )...

1.1 Classes of Sets 3 Remark 1.5. Sometimes the disjoint union of sets is denoted by the symbol. Note that this is not a new operation but only stresses the fact that the sets involved are mutually disjoint. Definition 1.6. A class of sets A 2 Ω is called an algebra if the following three conditions are fulfilled: (i) Ω A. (ii) A is \-closed. (iii) A is -closed. If A is an algebra, then obviously = Ω \ Ω is in A. However,ingeneral,this property is weaker than (i) in Definition 1.6. Theorem 1.7. A class of sets A 2 Ω is an algebra if and only if the following three properties hold: (i) Ω A. (ii) A is closed under complements. (iii) A is closed under intersections. Proof. This is left as an exercise. Definition 1.8. A class of sets A 2 Ω is called a ring if the following three conditions hold: (i) A. (ii) A is \-closed. (iii) A is -closed. A ring is called a σ-ring if it is also σ- -closed. Definition 1.9. A class of sets A 2 Ω is called a semiring if (i) A, (ii) for any two sets A, B Athe difference set B \ A is a finite union of mutually disjoint sets in A, (iii) A is -closed.

4 1 Basic Measure Theory Definition 1.10. A class of sets A 2 Ω is called a λ-system (or Dynkin s λ-system) if (i) Ω A, (ii) for any two sets A, B Awith A B,thedifferencesetB \ A is in A, and (iii) A n A for any choice of countably many pairwise disjoint sets A 1,A 2,... A. Example 1.11. (i) For any nonempty set Ω, the classes A = {,Ω} and A =2 Ω are the trivial examples of algebras, σ-algebras and λ-systems. On the other hand, A = { } and A =2 Ω are the trivial examples of semirings, rings and σ-rings. (ii) (iii) Let Ω = R. ThenA = {A R : A is countable} is a σ-ring. A = {(a, b] : a, b R, a b} is a semiring on Ω = R (but is not a ring). (iv) The class of finite unions of bounded intervals is a ring on Ω = R (but is not an algebra). (v) The class of finite unions of arbitrary (also unbounded) intervals is an algebra on Ω = R (but is not a σ-algebra). (vi) Let E be a finite nonempty set and let Ω := E N be the set of all E-valued sequences ω =(ω n ) n N.Foranyω 1,...,ω n E, let [ω 1,...,ω n ] := {ω Ω : ω i = ω i for all i =1,...,n} be the set of all sequences whose first n values are ω 1,...,ω n.leta 0 = { }. For n N, define A n := {[ω 1,...,ω n ]: ω 1,...,ω n E}. (1.1) Hence A := n=0 A n is a semiring but is not a ring (if #E >1). (vii) Let Ω be an arbitrary nonempty set. Then A := {A Ω : A or A c is finite} is an algebra. However, if #Ω =, thena is not a σ-algebra. (viii) Let Ω be an arbitrary nonempty set. Then A := {A Ω : A or A c is countable} is a σ-algebra. (ix) Every σ-algebra is a λ-system. (x) Let Ω = {1, 2, 3, 4} and A = {, {1, 2}, {1, 4}, {2, 3}, {3, 4}, {1, 2, 3, 4} }. Hence A is a λ-system but is not an algebra.

1.1 Classes of Sets 5 Theorem 1.12 (Relations between classes of sets). (i) Every σ-algebra also is a λ-system, an algebra and a σ-ring. (ii) Every σ-ring is a ring, and every ring is a semiring. (iii) Every algebra is a ring. An algebra on a finite set Ω is a σ-algebra. Proof. (i) This is obvious. (ii) Let A be a ring. By Theorem 1.4, A is closed under intersections and is hence a semiring. (iii) Let A be an algebra. Then = Ω \Ω A,andhenceA is a ring. If in addition Ω is finite, then A is finite. Hence any countable union of sets in A is a finite union of sets. Definition 1.13 (liminf and limsup). Let A 1,A 2,...be subsets of Ω. The sets lim inf A n := A m and lim sup A n := A m n n m=n m=n are called limes inferior and limes superior,respectively,ofthesequence(a n ) n N. Remark 1.14. (i) lim inf and lim sup can be rewritten as lim inf A n = { ω Ω :#{n N : ω A n } < }, n lim sup A n = { ω Ω :#{n N : ω A n } = }. n In other words, limes inferior is the event where eventually all of the A n occur. On the other hand, limes superior is the event where infinitely many of the A n occur. In particular, A := lim inf n A n A := lim sup n A n. (ii) We define the indicator function on the set A by 1 A (x) := { 1, if x A, 0, if x A. (1.2) With this notation, 1 A = lim inf 1 A n n and 1 A =limsup1 An. n (iii) If A 2 Ω is a σ-algebra and if A n Afor every n N, thena Aand A A. Proof. This is left as an exercise.

6 1 Basic Measure Theory Theorem 1.15 (Intersection of classes of sets). Let I be an arbitrary index set, and assume that A i is a σ-algebra for every i I.Hencetheintersection A I := { A Ω : A A i for every i I } = i I A i is a σ-algebra. The analogous statement holds for rings, σ-rings, algebras and λ- systems. However, it fails for semirings. Proof. We give the proof for σ-algebras only. To this end, we check (i) (iii) of Definition 1.2. (i) Clearly, Ω A i for every i I,andhenceΩ A. (ii) Assume A A.HenceA A i for any i I. Thus also A c A i for any i I.WeconcludethatA c A. (iii) Assume A 1,A 2,... A.HenceA n A i for every n N and i I. Thus A := A n A i for every i I.WeconcludeA A. Counterexample for semirings: Let Ω = {1, 2, 3, 4}, A 1 = {,Ω,{1}, {2, 3}, {4}} and A 2 = {,Ω,{1}, {2}, {3, 4}}. ThenA 1 and A 2 are semirings but A 1 A 2 = {,Ω,{1}} is not. Theorem 1.16 (Generated σ-algebra). Let E 2 Ω. Then there exists a smallest σ-algebra σ(e) with E σ(e): σ(e) := A 2 Ω is a σ-algebra A E σ(e) is called the σ-algebra generated by E. E is called a generator of σ(e). Similarly, we define δ(e) as the λ-system generated by E. Proof. A =2 Ω is a σ-algebra with E A.Hencetheintersectionisnonempty.By Theorem 1.15, σ(e) is a σ-algebra. Clearly, it is the smallest σ-algebra that contains E. Forλ-systems the proof is similar. Remark 1.17. The following three statements hold: (i) E σ(e). (ii) If E 1 E 2,thenσ(E 1 ) σ(e 2 ). (iii) A is a σ-algebra if and only if σ(a) =A. A. The same statements hold for λ-systems. Furthermore, δ(e) σ(e).

Theorem 1.18 ( -closed λ-system). Let D 2 Ω be a λ-system. Then D is a π-system D is a σ-algebra. 1.1 Classes of Sets 7 Proof. = This is obvious. = We check (i) (iii) of Definition 1.2. (i) Clearly, Ω D. (ii) (Closedness under complements) Let A D. Since Ω Dand by property (ii) of the λ-system, we get that A c = Ω \ A D. (iii) (σ- -closedness) Let A, B D. By assumption, A B D,andhencetrivially A B A. ThusA \ B = A \ (A B) D.ThisimpliesthatD is \-closed. Now let A 1,A 2,... D. By Theorem 1.4(iii), there exist mutually disjoint sets B 1,B 2,... Dwith A n = B n D. σ- -stable algebra Ω A ring σ-algebra σ-ring σ- -stable -stable Ω A -stable λ-system semiring Fig. 1.1. Inclusions between classes of sets A 2 Ω. Theorem 1.19 (Dynkin s π-λ theorem). If E 2 Ω is a π-system, then σ(e) =δ(e). Proof. This follows from Remark 1.17. We have to show that δ(e) is a σ-algebra. By Theorem 1.18, it is enough to show that δ(e) is a π-system. For any B δ(e) define D B := {A δ(e) :A B δ(e)}.

8 1 Basic Measure Theory In order to show that δ(e) is a π-system, it is enough to show that δ(e) D B for any B δ(e). (1.3) In order to show that D E is a λ-system for any E δ(e), we check (i) (iii) of Definition 1.10: (i) Clearly, Ω E = E δ(e); henceω D E. (ii) For anya, B D E with A B,wehave(B \ A) E =(B E) \ (A E) δ(e). (iii) Assume that A 1,A 2,... D E are mutually disjoint. Hence ( ) A n E = (A n E) δ(e). By assumption, A E Eif A E;thusE D E if E E. By Remark 1.17(ii), we conclude that δ(e) D E for any E E.HencewegetthatB E δ(e) for any B δ(e) and E E.ThisimpliesthatE D B for any B δ(e). ThusE D B for any B δ(e), andhence(1.3)follows. We are particularly interested in σ-algebras that are generated by topologies. The most prominent role is played by the Euclidean space R n,howeverwewillalso consider the (infinite-dimensional) space C([0, 1]) of continuous functions [0, 1] R. OnC([0, 1]) the norm f = sup x [0,1] f(x) induces a topology. For the convenience of the reader, we recall the definition of a topology. Definition 1.20 (Topology). Let Ω be an arbitrary set. A class of sets τ Ω is called a topology on Ω if it has the following three properties: (i),ω τ. (ii) A B τ for any A, B τ. (iii) ( A F A) τ for any F τ. The pair (Ω,τ) is called a topological space. The sets A τ are called open, and the sets A Ω with A c τ are called closed. In contrast with σ-algebras, topologies are closed under finite intersections only, but they are also closed under arbitrary unions. Let d be a metric on Ω,anddenotetheopenballwithradiusr>0 centred at x Ω by B r (x) ={y Ω : d(x, y) <r}. Then the usual class of open sets is the topology { } τ = B r(x) : F Ω (0, ). (x,r) F

1.1 Classes of Sets 9 Definition 1.21 (Borel σ-algebra). Let (Ω,τ) be a topological space. The σ- algebra B(Ω) := B(Ω,τ) :=σ(τ) that is generated by the open sets is called the Borel σ-algebra on Ω.Theelements A B(Ω,τ) are called Borel sets or Borel measurable sets. Remark 1.22. In many cases, we are interested in B(R n ), where R n is equipped with the Euclidean distance d(x, y) = x y 2 = n (x i y i ) 2. (i) There are subsets of R n that are not Borel sets. These sets are not easy to construct like, for example, Vitali sets that can be found in calculus books (see also [35, Theorem 3.4.4]). Here we do not want to stress this point but state that, vaguely speaking, all sets that can be constructed explicitly are Borel sets. (ii) If C R n is a closed set, then C c τ is in B(R n ) and hence C is a Borel set. In particular, {x} B(R n ) for every x R n. (iii) B(R n ) is not a topology. To show this, let V R n such that V B(R n ).If B(R n ) were a topology, then it would be closed under arbitrary unions. As {x} B(R n ) for all x R n,wewouldgetthecontradictionv = x V {x} B(Rn ). In most cases the class of open sets that generates the Borel σ-algebra is too big to work with efficiently. Hence we aim at finding smaller (in particular, countable) classes of sets that generate the Borel σ-algebra and that are more amenable. In some of the examples, the elements of the generating class are simpler sets such as rectangles or compact sets. We introduce the following notation. We denote by Q the set of rational numbers and by Q + the set of strictly positive rational numbers. For a, b R n,wewrite a<b if a i <b i for all i =1,...,n. (1.4) For a<b,wedefinetheopenrectangle as the Cartesian product (a, b) := n (a i,b i ):=(a 1,b 1 ) (a 2,b 2 ) (a n,b n ). (1.5) Analogously, we define [a, b], (a, b] and [a, b). Furthermore,wedefine(,b):= n (,b i ),anduseananalogousdefinitionfor(,b] and so on. We introduce the following classes of sets:

10 1 Basic Measure Theory E 1 := {A R n : A is open}, E 2 := {A R n : A is closed}, E 3 := {A R n : A is compact}, E 4 := {B r (x) : x Q n,r Q + }, E 5 := {(a, b) : a, b Q n,a<b}, E 6 := {[a, b) : a, b Q n,a<b}, E 7 := {(a, b] : a, b Q n,a<b}, E 8 := {[a, b] : a, b Q n,a<b}, E 9 := {(,b): b Q n }, E 10 := {(,b]: b Q n }, E 11 := {(a, ) : a Q n }, E 12 := {[a, ) : a Q n }. Theorem 1.23. The Borel σ-algebra B(R n ) is generated by any of the classes of sets E 1,...,E 12, that is, B(R n )=σ(e i ) for any i =1,...,12. Proof. We only show some of the identities. (1) By definition, B(R n )=σ(e 1 ). (2) Let A E 1.ThenA c E 2,andhenceA =(A c ) c σ(e 2 ). It follows that E 1 σ(e 2 ). By Remark 1.17, this implies σ(e 1 ) σ(e 2 ). Similarly, we obtain σ(e 2 ) σ(e 1 ) and hence equality. (3) Any compact set is closed; hence σ(e 3 ) σ(e 2 ). Now let A E 2. The sets A K := A [ K, K] n, K N, arecompact;hencethecountableuniona = K=1 A K is in σ(e 3 ). It follows that E 2 σ(e 3 ) and thus σ(e 2 )=σ(e 3 ). (4) Clearly, E 4 E 1 ;henceσ(e 4 ) σ(e 1 ).NowletA R n be an open set. For any x A, definer(x) = min(1, sup{r > 0 : B r (x) A}). Notethat R(x) > 0, asa is open. Let r(x) (R(x)/2,R(x)) Q. Foranyy A and x (B R(y)/3 (y)) Q n,wehaver(x) R(y) x y 2 > 2 3R(y), andhence r(x) > 1 3 R(y) and thus y B r(x)(x). It follows that A = x A Q B n r(x) (x) is a countable union of sets from E 4 and is hence in σ(e 4 ).Wehaveshownthat E 1 σ(e 4 ). By Remark 1.17, this implies σ(e 1 ) σ(e 4 ). (5 12) Exhaustion arguments similar to that in (4) also work for rectangles. If in (4) we take open rectangles instead of open balls B r (x),wegetb(r n )=σ(e 5 ).For example, we have n [a i,b i )= n k=1 ( a i 1 ) k,b i σ(e 5 ). The other inclusions E i σ(e j ) can be shown similarly. Remark 1.24. Any of the classes E 1, E 2, E 3, E 5,...,E 12 (but not E 4 )isaπ-system. Hence, the Borel σ-algebra equals the generated λ-system: B(R n )=δ(e i ) for i = 1, 2, 3, 5,...,12. In addition, the classes E 4,...,E 12 are countable. This is a crucial property that will be needed later.

1.1 Classes of Sets 11 Definition 1.25 (Trace of a class of sets). Let A 2 Ω be an arbitrary class of subsets of Ω and let A 2 Ω \{ }.Theclass A A := {A B : B A} 2 A (1.6) is called the trace of A on A or the restriction of A to A. Theorem 1.26. Let A Ω be a nonempty set and let A be a σ-algebra on Ω or any of the classes of Definitions 1.6 1.10. Then A A is a class of sets of the same type as A;however,onA instead of Ω. Proof. This is left as an exercise. Exercise 1.1.1. Let A be a semiring. Show that any countable (respectively finite) union of sets in A can be written as a countable (respectively finite) disjoint union of sets in A. Exercise 1.1.2. Give a counterexample that shows that, in general, the union A A of two σ-algebras need not be a σ-algebra. Exercise 1.1.3. Let (Ω 1,d 1 ) and (Ω 2,d 2 ) be metric spaces and let f : Ω 1 Ω 2 be an arbitrary map. Denote by U f = { x Ω 1 : f is discontinuous at x } the set of points of discontinuity of f. ShowthatU f B(Ω 1 ). Hint: First show that for any ε>0 and δ>0 the set U δ,ε f := { x Ω 1 : there are y, z B ε (x) with d 2 (f(y),f(z)) >δ } is open (where B ε (x) ={y Ω 1 : d 1 (x, y) <ε}). Then construct U f from such U δ,ε f. Exercise 1.1.4. Let Ω be an uncountably infinite set and A = σ({ω} : ω Ω). Show that A = { A Ω : A is countable or A c is countable }. Exercise 1.1.5. Let A be a ring on the set Ω. ShowthatA is an Abelian algebraic ring with multiplication andaddition.

12 1 Basic Measure Theory 1.2 Set Functions Definition 1.27. Let A 2 Ω and let µ : A [0, ] be a set function. We say that µ is (i) monotone if µ(a) µ(b) for any two sets A, B Awith A B, ( n ) (ii) additive if µ A i = n µ(a i ) for any choice of finitely many mutually disjoint sets A 1,...,A n Awith n A i A, ( ) (iii) σ-additive if µ A i = µ(a i ) for any choice of countably many mutually disjoint sets A 1,A 2,... Awith A i A, (iv) subadditive if for any choice of finitely many sets A, A 1,...,A n A n A i,wehaveµ(a) n µ(a i ),and A with (v) σ-subadditive if for any choice of countably many sets A, A 1,A 2,... Awith A A i,wehaveµ(a) µ(a i ). Definition 1.28. Let A be a semiring and let µ : A [0, ] be a set function with µ( ) =0. µ is called a content if µ is additive, premeasure if µ is σ-additive, measure if µ is a premeasure and A is a σ-algebra, and probability measure if µ is a measure and µ(ω) =1. Definition 1.29. Let A be a semiring. A content µ on A is called (i) finite if µ(a) < for every A Aand (ii) σ-finite if there exists a sequence of sets Ω 1,Ω 2,... Asuch that Ω = and such that µ(ω n ) < for all n N. Ω n Example 1.30 (Contents, measures). (i) Let ω Ω and δ ω (A) =1 A (ω) (see (1.2)). Then δ ω is a probability measure on any σ-algebra A 2 Ω. δ ω is called the Dirac measure for the point ω.

1.2 Set Functions 13 (ii) Let Ω be a finite nonempty set. By µ(a) := #A #Ω for A Ω, we define a probability measure on A =2 Ω. This µ is called the uniform distribution on Ω. For this distribution, we introduce the symbol U Ω := µ. The resulting triple (Ω,A, U Ω ) is called a Laplace space. (iii) Let Ω be countably infinite and let A := {A Ω :#A< or #A c < }. Then A is an algebra. The set function µ on A defined by { 0, if A is finite, µ(a) =, if A c is finite, is a content but is not a premeasure. Indeed, µ ( ω Ω {ω}) = µ(ω) =, but µ ({ω}) =0. ω Ω (iv) Let (µ n ) n N be a sequence of measures (premeasures, contents) and let (α n ) n N be a sequence of nonnegative numbers. Then also µ := α nµ n is a measure (premeasure, content). (v) Let Ω be an (at most) countable nonempty set and let A =2 Ω.Further,let (p ω ) ω Ω be nonnegative numbers. Then A µ(a) := ω A p ω defines a σ-finite measure on 2 Ω.Wecallp =(p ω ) ω Ω the weight function of µ. Thenumberp ω is called the weight of µ at point ω. (vi) If in (v) the sum ω Ω p ω equals one, then µ is a probability measure. In this case, we interpret p ω as the probability of the elementary event ω. Thevector p =(p ω ) ω Ω is called a probability vector. (vii) If in (v) p ω =1for every ω Ω,thenµ is called counting measure on Ω.If Ω is finite, then so is µ. (viii) Let A be the ring of finite unions of intervals (a, b] R.Fora 1 <b 1 <a 2 < b 2 <...<b n and A = n (a i,b i ],define µ(a) = n (b i a i ). Then µ is a σ-finite content on A (even a premeasure) since µ(( n, n]) = 2n < for all n N. ( n, n] =R and

14 1 Basic Measure Theory (ix) Let f : R [0, ) be continuous. In a similar way to (viii), we define µ f (A) = n bi a i f(x) dx. Then µ f is a σ-finite content on A (even a premeasure). The function f is called the density of µ and plays a role similar to the weight function p in (v). Lemma 1.31 (Properties of contents). Let A be a semiring and let µ be a content on A.Thenthefollowingstatementshold. (i) If A is a ring, then µ(a B) +µ(a B) =µ(a) +µ(b) for any two sets A, B A. (ii) µ is monotone. If A is a ring, then µ(b) =µ(a)+µ(b \ A) for any two sets A, B Awith A B. (iii) µ is subadditive. If µ is σ-additive, then µ is also σ-subadditive. (iv) If A is a ring, then ( ) µ(a n ) µ A n for any choice of countably many mutually disjoint sets A 1,A 2,... Awith A n A. Proof. (i) Note that A B = A (B \ A) and B =(A B) (B \ A). Asµ is additive, we obtain µ(a B) =µ(a)+µ(b \ A) and µ(b) =µ(a B)+µ(B \ A). This implies (i). (ii) Let A B. SinceA B = A, weobtainµ(b) = µ(a (B \ A)) = µ(a)+µ(b \ A) if B \ A A. In particular, this is true if A is a ring. If A is only a semiring, then there exists an n N and mutually disjoint sets C 1,...,C n A such that B \ A = n C i.henceµ(b) =µ(a)+ n µ(c i) µ(a) and thus µ is monotone. (iii) Let n N and A, A 1,...,A n Awith A n A i.defineb 1 = A 1 and B k = A k \ k 1 A i = (A k \ (A k A i )) for k =2,...,n. k 1 By the definition of a semiring, any A k \ (A k A i ) is a finite disjoint union of sets in A. Hence there exists a c k N and sets C k,1,...,c k,ck A such that ck C k,i = B k A k. Similarly, there exist d k N and D k,1,...,d k,dk A such that A k \ B k = d k D k,i. Sinceµ is additive, we have µ(a k )= c k µ(c k,i )+ d k µ(d k,i ) c k µ(c k,i ).

Again due to additivity and monotonicity, we get ( n ) c k µ(a) =µ (C k,i A) = n k=1 c k k=1 µ(c k,i ) n n µ(a k ). k=1 c k k=1 1.2 Set Functions 15 µ(c k,i A) Hence µ is subadditive. By a similar argument, σ-subadditivity follows from σ- additivity. (iv) Let A be a ring and let A = A n A.Sinceµ is additive (and thus monotone), we have by (ii) ( m m ) µ(a n )=µ A n µ(a) for any m N. It follows that µ(a n ) µ(a). Remark 1.32. The inequality in (iv) can be strict (see Example 1.30(iii)). In other words, there are contents that are not premeasures. Theorem 1.33 (Inclusion-exclusion formula). Let A be a ring and let µ be a content on A. Letn N and A 1,...,A n A.Thenthefollowinginclusionandexclusion formulas hold: n µ(a 1... A n )= ( 1) k 1 µ(a i1... A ik ), µ(a 1... A n )= k=1 n ( 1) k 1 k=1 {i 1,...,i k } {1,...,n} {i 1,...,i k } {1,...,n} Here summation is over all subsets of {1,...,n} with k elements. µ(a i1... A ik ). Proof. This is left as an exercise. Hint: Use induction on n. The next goal is to characterise σ-subadditivity by a certain continuity property (Theorem 1.36). To this end, we agree on the following conventions. Definition 1.34. Let A, A 1,A 2,...be sets. We write A n A and say that (A n ) n N increases to A if A 1 A 2... and A n = A,and A n A and say that (A n ) n N decreases to A if A 1 A 2 A 3... and A n = A.

16 1 Basic Measure Theory Definition 1.35 (Continuity of contents). Let µ be a content on the ring A. (i) µ is called lower semicontinuous if µ(a n ) n µ(a) for any A Aand any sequence (A n ) n N in A with A n A. (ii) µ is called upper semicontinuous if µ(a n ) n µ(a) for any A Aand any sequence (A n ) n N in A with µ(a n ) < for some (and then eventually all) n N and A n A. (iii) µ is called -continuous if (ii) holds for A =. In the definition of upper semicontinuity, we needed the assumption µ(a n ) < since otherwise we would not even have -continuity for an example as simple as the counting measure µ on (N, 2 N ).Indeed, A n := {n, n +1,...} but µ(a n )= for all n N. Theorem 1.36 (Continuity and premeasure). Let µ be a content on the ring A. Consider the following five properties. (i) µ is σ-additive (and hence a premeasure). (ii) µ is σ-subadditive. (iii) µ is lower semicontinuous. (iv) µ is -continuous. (v) µ is upper semicontinuous. Then the following implications hold: (i) (ii) (iii)= (iv) (v). If µ is finite, then we also have (iv) = (iii). Proof. (i) = (ii) Let A, A 1,A 2,... Awith A A i.defineb 1 = A 1 and B n = A n \ n 1 A i Afor n =2, 3,...ThenA = (A B n).sinceµ is monotone and σ-additive, we infer Hence µ is σ-subadditive. µ(a) = µ(a B n ) µ(a n ). (ii) = (i) This follows from Lemma 1.31(iv). (i) = (iii) Let µ be a premeasure and A A.Let(A n ) n N be a sequence in A such that A n A and let A 0 =. Then µ(a) = µ(a i \ A i 1 ) = lim n n µ(a i \ A i 1 ) = lim n µ(a n).

1.2 Set Functions 17 (iii) = (i) Assume now that (iii) holds. Let B 1,B 2,... A be mutually disjoint, and assume that B = B n A.DefineA n = n B i for all n N.Then it follows from (iii) that µ(b) = lim n µ(a n)= Hence µ is σ-additive and therefore a premeasure. µ(b i ). (iv) = (v) Let A, A 1,A 2,... Awith A n A and µ(a 1 ) <. Define B n = A n \ A Afor all n N. ThenB n. This implies µ(a n ) µ(a) = µ(b n ) n 0. (v) = (iv) This is evident. (iii) = (iv) Let A 1,A 2,... Awith A n and µ(a 1 ) <. ThenA 1 \ A n Afor any n N and A 1 \ A n A 1.Hence µ(a 1 ) = lim n µ(a 1 \ A n )=µ(a 1 ) lim n µ(a n). Since µ(a 1 ) <, wehave lim n µ(a n)=0. (iv) = (iii) (for finite µ) Assume that µ(a) < for every A Aand that µ is -continuous. Let A, A 1,A 2,... Awith A n A.ThenwehaveA \ A n and µ(a) µ(a n )=µ(a \ A n ) n 0. Hence (iii) follows. Example 1.37. (Compare Example 1.30(iii).) Let Ω be a countable set, and define A = {A Ω :#A< or #A c < }, { 0, if A is finite, µ(a) =, if A is infinite. Then µ is an -continuous content but not a premeasure.

18 1 Basic Measure Theory Definition 1.38. (i) A pair (Ω,A) consisting of a nonempty set Ω and a σ-algebra A 2 Ω is called a measurable space. The sets A Aare called measurable sets.ifω is at most countably infinite and if A =2 Ω,thenthemeasurablespace (Ω,2 Ω ) is called discrete. (ii) A triple (Ω,A,µ) is called a measure space if (Ω,A) is a measurable space and if µ is a measure on A. (iii) If in addition µ(ω) =1,then(Ω,A,µ) is called a probability space. In this case, the sets A Aare called events. (iv) The set of all finite measures on (Ω,A) is denoted by M f (Ω) :=M f (Ω,A). The subset of probability measures is denoted by M 1 (Ω) :=M 1 (Ω,A). Finally, the set of σ-finite measures on (Ω,A) is denoted by M σ (Ω,A). Exercise 1.2.1. Let A = {(a, b] Q : a, b R, a b}. Defineµ : A [0, ) by µ ( (a, b] Q ) = b a. ShowthatA is a semiring and µ is a content on A that is lower and upper semicontinuous but is not σ-additive. 1.3 The Measure Extension Theorem In this section, we construct measures µ on σ-algebras. The starting point will be to define the values of µ on a smaller class of sets; that is, on a semiring. Under a mild consistency condition, the resulting set function can be extended to the whole σ-algebra. Before we develop the complete theory, we begin with two examples. Example 1.39 (Lebesgue measure). Let n N and let A = {(a, b] : a, b R n,a<b} be the semiring of half open rectangles (a, b] R n (see (1.5)). The n-dimensional volume of such a rectangle is µ((a, b]) = n (b i a i ). Can we extend the set function µ to a (uniquely determined) measure on the Borel σ-algebra B(R n )=σ(a)? We will see that this is indeed possible. The resulting measure is called Lebesgue measure (or sometimes Lebesgue-Borel measure) λ on ( R n, B(R n ) ). Example 1.40 (Product measure, Bernoulli measure). We construct a measure for an infinitely often repeated random experiment with finitely many possible outcomes. Let E be the set of possible outcomes. For e E, letp e 0 be the probability that e occurs. Hence e E p e =1.Forafixedrealisationoftherepeated

1.3 The Measure Extension Theorem 19 experiment, let ω 1,ω 2,... E be the observed outcomes. Hence the space of all possible outcomes of the repeated experiment is Ω = E N. As in Example 1.11(vi), we define the set of all sequences whose first n values are ω 1,...,ω n : [ω 1,...,ω n ] := {ω Ω : ω i = ω i for any i =1,...,n}. (1.7) Let A 0 = { }. For n N, definetheclassofcylindersetsthatdependonlyonthe first n coordinates and let A := n=0 A n. A n := {[ω 1,...,ω n ]: ω 1,...,ω n E}, (1.8) We interpret [ω 1,...,ω n ] as the event where the outcome of the first experiment is ω 1,theoutcomeofthesecondexperimentisω 2 and finally the outcome of the nth experiment is ω n.theoutcomesoftheotherexperimentsdonotplayarolefor the occurrence of this event. As the individual experiments ought to be independent, we should have for any choice ω 1,...,ω n E that the probability of the event [ω 1,...,ω n ] is the product of the probabilities of the individual events; that is, µ([ω 1,...,ω n ]) = n p ωi. This formula defines a content µ on the semiring A, andouraimistoextendµ in a unique way to a probability measure on the σ-algebra σ(a) that is generated by A. Before we do so, we make the following definition. Define the (ultra-)metric d on Ω by { 2 inf{n N: ω n ω d(ω, ω n }, if ω ω, )= (1.9) 0, if ω = ω. Hence (Ω,d) is a compact metric space. Clearly, [ω 1,...,ω n ]=B 2 n(ω) ={ω Ω : d(ω, ω ) < 2 n }. The complement of [ω 1,...,ω n ] is an open set, as it is the union of (#E) n 1 open balls [ω 1,...,ω n ] c = [ω 1,...,ω n]. (ω 1,...,ω n ) (ω 1,...,ω n ) Since Ω is compact, the closed subset [ω 1,...,ω n ] is compact. As in Theorem 1.23, it can be shown that σ(a) =B(Ω,d). Exercise: Prove the statements made above. The main result of this chapter is Carathéodory s measure extension theorem. Theorem 1.41 (Carathéodory). Let A 2 Ω be a ring and let µ be a σ-finite premeasure on A.There existsa unique measure µ on σ(a) such that µ(a) =µ(a) for all A A. Furthermore, µ is σ-finite.

20 1 Basic Measure Theory We prepare for the proof of this theorem with a couple of lemmas. In fact, we will show a slightly stronger statement in Theorem 1.53. Lemma 1.42 (Uniqueness by an -closed generator). Let (Ω,A,µ) be a σ-finite measure space and let E A be a π-system that generates A. Assumethatthere exist sets E 1,E 2,... Esuch that E n Ω and µ(e n ) < for all n N. Thenµ is uniquely determined by the values µ(e), E E. If µ is a probability measure, the existence of the sequence (E n ) n N is not needed. Proof. Let ν be a (possibly different) σ-finite measure on (Ω,A) such that µ(e) =ν(e) for every E E. Let E Ewith µ(e) <. Consider the class of sets D E = { A A: µ(a E) =ν(a E) }. In order to show that D E is a λ-system, we check the properties of Definition 1.10: (i) Clearly, Ω D E. (ii) Let A, B D E with A B.Then µ ((A \ B) E) =µ(a E) µ(b E) Hence A \ B D E. = ν(a E) ν(b E) =ν ((A \ B) E). (iii) Let A 1,A 2,... D E be mutually disjoint and A = µ(a E) = Hence A D E. µ(a n E) = A n.then ν(a n E) =ν(a E). Clearly, E D E ;henceδ(e) D E.SinceE is a π-system, Theorem 1.19 yields Hence D E = A. A D E δ(e) =σ(e) =A. This implies µ(a E) =ν(a E) for any A Aand E Ewith µ(e) <. Now let E 1,E 2,... Ebe a sequence such that E n Ω and µ(e n ) < for all n N. Sinceµ and ν are lower semicontinuous, for all A A,wehave µ(a) = lim n µ(a E n) = lim n ν(a E n)=ν(a).

1.3 The Measure Extension Theorem 21 The additional statement is trivial as Ẽ := E {Ω} is a π-system that generates A, andthevalue µ(ω) =1is given. Hence one can choose the constant sequence E n = Ω, n N. However,notethatitisnotenoughtoassumethatµ is finite. In this case, in general, the total mass µ(ω) is not uniquely determined by the values µ(e), E E; see Example 1.45(ii). Example 1.43. Let Ω = Z and E = { E n : n Z } where E n =(,n] Z.Then E is a π-system and σ(e) =2 Ω.Henceafinitemeasureµ on (Ω,2 Ω ) is uniquely determined by the values µ(e n ), n Z. However, a σ-finite measure on Z is not uniquely determined by the values on E: Let µ be the counting measure on Z and let ν =2µ. Henceµ(E) = = ν(e) for all E E. In order to distinguish µ and ν one needs a generator that contains sets of finite measure (of µ). Do the sets F n =[ n, n] Z, n N do the trick? Indeed, for any σ-finite measure µ, wehaveµ( F n ) < for all n N. However,thesets F n do not generate 2 Ω (but which σ-algebra?). We get things to work out better if we modify the definition: F n =[ n/2, (n+1)/2] Z.Nowσ({F n,n N}) =2 Ω,and hence E = {F n,n N} is a π-system that generates 2 Ω and such that µ(f n ) < for all n N. TheconditionsofthetheoremarefulfilledasF n Ω. ( Example 1.44 (Distribution function). A probability measure µ on the space R n, B(R n ) ) is uniquely determined by the values µ((,b]) (where (,b]= n (,b i ], b R n ). In fact, these sets form a π-system that generates B(R n ) (see Theorem 1.23). In particular, a probability measure µ on R is uniquely determined by its distribution function F : R [0, 1], x µ((,x]). Example 1.45. (i) Let Ω = {1, 2, 3, 4} and E = { { 1, 2}, {2, 3} }. Clearly, σ(e) = 2 Ω but E is not a π-system. In fact, here a probability measure µ is not uniquely determined by the values, say µ({1, 2}) =µ({2, 3}) = 1 2.Wejustgivetwodifferent possibilities: µ = 1 2 δ 1 + 1 2 δ 3 and µ = 1 2 δ 2 + 1 2 δ 4. (ii) Let Ω = {1, 2} and E = {{1}}.ThenE is a π-system that generates 2 Ω.Hence a probability measure µ is uniquely determined by the value µ({1}). However,a finite measure is not determined by its value on {1}, asµ = 0 and ν = δ 2 are different finite measures that agree on E.

22 1 Basic Measure Theory Definition 1.46 (Outer measure). Asetfunctionµ :2 Ω [0, ] is called an outer measure if (i) µ ( ) =0,and (ii) µ is monotone, (iii) µ is σ-subadditive. Lemma 1.47. Let A 2 Ω be an arbitrary class of sets with Aand let µ be a monotone set function on A with µ( ) =0.ForA Ω, definethesetofcountable coverings of F with sets F A: { U(A) = F A: F is at most countable and A } F. Define { } µ (A) :=inf µ(f ): F U(A), F F F F where inf =. Thenµ is an outer measure. If in addition µ is σ-subadditive, then µ (A) =µ(a) for all A A. Proof. We check properties (i) (iii) of an outer measure. (i) Since A,wehave{ } U( ); henceµ ( ) =0. (ii) If A B,thenU(A) U(B); henceµ (A) µ (B). (iii) Let A n Ω for any n N and let A A n.weshowthatµ (A) µ (A n ). Without loss of generality, assume µ (A n ) < and hence U(A n ) for all n N. Fixε>0. Foreveryn N, chooseacovering F n U(A n ) such that F F n µ(f ) µ (A n )+ε 2 n. Then F := F n U(A) and µ (A) µ(f ) F F F F n µ(f ) µ (A n )+ε. Let A A.Since{A} U(A), wehaveµ (A) µ(a). Ifµ is σ-subadditive, then for any F U(A), wehave F F µ(f ) µ(a); henceµ (A) µ(a). Definition 1.48 (µ -measurable sets). Let µ be an outer measure. A set A 2 Ω is called µ -measurable if µ (A E)+µ (A c E) =µ (E) for any E 2 Ω. (1.10) We write M(µ )={A 2 Ω : A is µ -measurable}.

1.3 The Measure Extension Theorem 23 Lemma 1.49. A M(µ ) if and only if µ (A E)+µ (A c E) µ (E) for any E 2 Ω. Proof. As µ is subadditive, the other inequality is trivial. Lemma 1.50. M(µ ) is an algebra. Proof. We check properties (i) (iii) of an algebra from Theorem 1.7. (i) Ω M(µ ) is evident. (ii) (Closedness under complements) By definition, A M(µ ) A c M(µ ). (iii) (π-system) Let A, B M(µ ) and E 2 Ω.Then µ ((A B) E)+µ ((A B) c E) = µ (A B E)+µ ( (A c B E) (A c B c E) (A B c E) ) µ (A B E)+µ (A c B E) = µ (B E)+µ (B c E) = µ (E). + µ (A c B c E)+µ (A B c E) Here we used A M(µ ) in the last but one equality and B M(µ ) in the last equality. Lemma 1.51. An outer measure µ is σ-additive on M(µ ). Proof. Let A, B M(µ ) with A B =. Then µ (A B) =µ (A (A B)) + µ (A c (A B)) = µ (A)+µ (B). Inductively, we get (finite) additivity. By definition, µ is σ-subadditive; hence we conclude by Theorem 1.36 that µ is also σ-additive. Lemma 1.52. If µ is an outer measure, then M(µ ) is a σ-algebra. In particular, µ is a measure on M(µ ). Proof. By Lemma 1.50, M(µ ) is an algebra and hence a π-system. By Theorem 1.18, it is sufficient to show that M(µ ) is a λ-system. Hence, let A 1,A 2,... M(µ ) be mutually disjoint, and define A := have to show A M(µ ); that is, A n.we µ (A E)+µ (A c E) µ (E) for any E 2 Ω. (1.11)

24 1 Basic Measure Theory Let B n = n A i for all n N. Foralln N, wehave µ (E B n+1 ) = µ ( ) (E B n+1 ) B n + µ ( (E B n+1 ) Bn c ) = µ (E B n )+µ (E A n+1 ). Inductively, we get µ (E B n )= n µ (E A i ).Themonotonicityofµ now implies that µ (E) = µ (E B n )+µ (E B c n) µ (E B n )+µ (E A c ) = n µ (E A i )+µ (E A c ). Letting n and using the σ-subadditivity of µ,weconclude µ (E) µ (E A i )+µ (E A c ) µ (E A)+µ (E A c ). Hence (1.11) holds and the proof is complete. We come to an extension theorem for measures that makes slightly weaker assumptions than Carathéodory s theorem (Theorem 1.41). Theorem 1.53 (Extension theorem for measures). Let A be a semiring and let µ : A [0, ] be an additive, σ-subadditive and σ-finite set function with µ( ) =0. Then there is a unique σ-finite measure µ : σ(a) [0, ] such that µ(a) = µ(a) for all A A. Proof. As A is a π-system, uniqueness follows by Lemma 1.42. In order to establish the existence of µ,wedefineasinlemma1.47 { } µ (A) :=inf µ(f ): F U(A) for any A 2 Ω. F F By Lemma 1.31(ii), µ is monotone. Hence µ is an outer measure by Lemma 1.47 and µ (A) =µ(a) for any A A.WehavetoshowthatM(µ ) σ(a). Since M(µ ) is a σ-algebra (Lemma 1.52), it is enough to show A M(µ ). To this end, let A Aand E 2 Ω with µ (E) <. Fixε>0. Thenthereisa sequence E 1,E 2,... Asuch that E E n and µ(e n ) µ (E)+ε.

1.3 The Measure Extension Theorem 25 Define B n := E n A A.SinceA is a semiring, for every n N there is an m n N and sets Cn,...,C 1 m n n Asuch that E n \ A = E n \ B n = m n Cn.Hence k k=1 E A B n, E A c m n k=1 C k n and E n = B n m n k=1 C k n. µ is σ-subadditive and by assumption µ is additive. From µ µ (we will see A that even equality holds), we infer that ( mn ) µ (E A)+µ (E A c ) µ (B n )+ µ µ(cn) k = = µ(b n )+ k=1 m n µ(cn) k k=1 ( ) m n µ(b n )+ µ(cn) k k=1 µ(e n ) µ (E)+ε. Hence µ (E A) +µ (E A c ) µ (E) and thus A M(µ ), which implies A M(µ ).Nowdefine µ : σ(a) [0, ], A µ (A). ByLemma1.51, µ is a measure and µ is σ-finite since µ is σ-finite. Example 1.54 (Lebesgue measure, continuation of Example 1.39). We aim at extending the volume µ((a, b]) = n (b i a i ) that was defined on the class of rectangles A = {(a, b] : a, b R n,a < b} to the Borel σ-algebra B(R n ).In order to check the assumptions of Theorem 1.53, we only have to check that µ is σ-subadditive. To this end, let (a, b], (a(1),b(1)], (a(2),b(2)],... Awith We show that (a, b] µ((a, b]) (a(k),b(k)]. k=1 µ ( (a(k),b(k)] ). (1.12) k=1 For this purpose we use a compactness argument to reduce (1.12) to finite additivity. Fix ε>0. Foranyk N, chooseb ε (k) >b(k) such that µ ( (a(k),b ε (k)] ) µ ( (a(k),b(k)] ) + ε 2 k 1.

26 1 Basic Measure Theory Further choose a ε compact and (a, b) such that µ((a ε,b]) µ((a, b]) ε 2. Now [a ε,b] is (a(k),b ε (k)) k=1 (a(k),b(k)] (a, b] [a ε,b], k=1 whence there exists a K 0 such that K 0 k=1 (a(k),b ε(k)) (a ε,b]. Asµ is (finitely) subadditive (see Lemma 1.31(iii)), we obtain µ((a, b]) ε 2 + µ((a ε,b]) ε K 0 2 + µ((a(k),b ε (k)]) k=1 ε K 0 2 + ( ε 2 k 1 + µ((a(k),b(k)]) ) ε + k=1 k=1 Letting ε 0 yields (1.12); hence µ is σ-subadditive. µ((a(k),b(k)]). Combining the last example with Theorem 1.53, we have shown the following theorem. Theorem 1.55 (Lebesgue measure). There exists a uniquely determined measure λ n on ( R n, B(R n ) ) with the property that λ n ((a, b]) = n (b i a i ) for all a, b R n with a<b. λ n is called the Lebesgue measure on ( R n, B(R n ) ) or Lebesgue-Borel measure. Example 1.56 (Lebesgue-Stieltjes measure). Let Ω = R and A = {(a, b] : a, b R, a b}. A is a semiring and σ(a) =B(R), whereb(r) is the Borel σ-algebra on R. Furthermore,letF : R R be monotone increasing and right continuous. We define a set function µ F : A [0, ), (a, b] F (b) F (a). Clearly, µ F ( ) =0and µ F is additive. Let (a, b], (a(1),b(1)], (a(2),b(2)],... Asuch that (a, b] (a(n),b(n)]. Fix ε>0 and choose a ε (a, b) such that F (a ε ) F (a) <ε/2. This is possible, as F is right continuous. For any k N, chooseb ε (k) >b(k) such that F (b ε (k)) F (b(k)) <ε2 k 1. As in Example 1.54, it can be shown that µ F ((a, b]) ε + k=1 µ F ((a(k),b(k)]). This implies that µ F is σ-subadditive. By Theorem 1.53, we can extend µ F uniquely to a σ-finite measure µ F on B(R).

1.3 The Measure Extension Theorem 27 Definition 1.57 (Lebesgue-Stieltjes measure). The measure µ F on ( R, B(R) ) defined by µ F ((a, b]) = F (b) F (a) for all a, b R with a<b is called the Lebesgue-Stieltjes measure with distribution function F. Example 1.58. Important special cases for the Lebesgue-Stieltjes measure are the following: (i) If F (x) =x,thenµ F = λ 1 is the Lebesgue measure on R. (ii) Let f : R [0, ) be continuous and let F (x) = x 0 f(t) dt for all x R. Then µ F is the extension of the premeasure with density f that was defined in Example 1.30(ix). (iii) Let x 1,x 2,... R and α n 0 for all n N such that α n <. Then F = α n 1 [xn, ) is the distribution function of the finite measure µ F = α nδ xn. (iv) Let x 1,x 2,... R such that µ = δ x n is a σ-finite measure. Then µ is a Lebesgue-Stieltjes measure if and only if the sequence (x n ) n N does not have a limit point. Indeed, if (x n ) n N does not have a limit point, then by the Bolzano- Weierstraß theorem, #{n N : x n [ K, K]} < for every K>0. Ifwelet F (x) =#{n N : x n [0,x]} for x 0 and F (x) = #{n N : x n [x, 0)}, then µ = µ F.Ontheotherhand,ifµ is a Lebesgue-Stieltjes measure, this is µ = µ F for some F,then#{n N : x n ( K, K]} = F (K) F ( K) < for all K>0; hence(x n ) n N does not have a limit point. (v) If lim x ( F (x) F ( x) ) =1,thenµF is a probability measure. We will now have a closer look at the case where µ F is a probability measure. Definition 1.59 (Distribution function). Arightcontinuousmonotoneincreasing function F : R [0, 1] with F ( ) := lim F (x) = 0 and F ( ) := x F (x) =1is called a (proper) probability distribution function (p.d.f.). If we lim x only have F ( ) 1 instead of F ( ) =1,thenF is called a (possibly) defective p.d.f. If µ is a (sub-)probability measure on ( R, B(R) ),thenf µ : x µ((,x]) is called the distribution function of µ. Clearly, F µ is right continuous and F ( ) =0,sinceµ is upper semicontinuous and finite (Theorem 1.36). Since µ is lower semicontinuous, we have F ( ) =µ(r); hence F µ is indeed a (possibly defective) distribution function if µ is a (sub-)probability measure. The argument of Example 1.56 yields the following theorem.

28 1 Basic Measure Theory Theorem 1.60. The map µ F µ is a bijection from the set of probability measures on ( R, B(R) ) to the set of probability distribution functions, respectively from the set of sub-probability measures to the set of defective distribution functions. We have established that every finite measure on ( R, B(R) ) is a Lebesgue-Stieltjes measure for some function F.For σ-finite measures the corresponding statement does not hold in this generality as we saw in Example 1.58(iv). We come now to a theorem that combines Theorem 1.55 with the idea of Lebesgue- Stieltjes measures. Later we will see that the following theorem is valid in greater generality. In particular, the assumption that the factors are of Lebesgue-Stieltjes type can be dropped. Theorem 1.61 (Finite products of measures). Let n N and let µ 1,...,µ n be finite measures or, more generally, Lebesgue-Stieltjes measures on ( R, B(R) ). Then there exists a unique σ-finite measure µ on ( R n, B(R n ) ) such that µ((a, b]) = n µ i ((a i,b i ]) for all a, b R n with a<b. We call µ =: n µ i the product measure of the measures µ 1,...,µ n. Proof. The proof is the same as for Theorem 1.55. One has to check that the intervals (a, b ε ] and so on can be chosen such that µ((a, b ε ]) <µ((a, b]) + ε.hereweemploy the right continuity of the increasing function F i that belongs to µ i. The details are left as an exercise. Remark 1.62. Later we will see in Theorem 14.14 that the statement holds even for arbitrary σ-finite measures µ 1,...,µ n on arbitrary (even different) measurable spaces. One can even construct infinite products if all factors are probability spaces (Theorem 14.36). Example 1.63 (Infinite product measure, continuation of Example 1.40). Let E be a finite set and let Ω = E N be the space of E-valued sequences. Further, let (p e ) e E be a probability vector. Define a content µ on A = {[ω 1,...,ω n ] : ω 1,...,ω n E, n N} by µ([ω 1,...,ω n ]) = n p ωi. We aim at extending µ to a measure on σ(a). Inordertochecktheassumptionsof Theorem 1.53, we have to show that µ is σ-subadditive. As in the preceding example, we use a compactness argument.

1.3 The Measure Extension Theorem 29 Let A, A 1,A 2,... Aand A A n. We are done if we can show that there exists an N N such that N A A n. (1.13) Indeed, due to the (finite) subadditivity of µ (see Lemma 1.31(iii)), this implies µ(a) N µ(a n ) µ(a n );henceµ is σ-subadditive. We now give two different proofs for (1.13). 1. Proof. The metric d from (1.9) induces the product topology on Ω; hence,as remarked in Example 1.40, (Ω,d) is a compact metric space. Every A Ais closed and thus compact. Since every A n is also open, A can be covered by finitely many A n ;hence(1.13)holds. 2. Proof. We now show by elementary means the validity of (1.13). The procedure imitates the proof that Ω is compact. Let B n := A\ n A i. We assume B n for all n N in order to get a contradiction. By Dirichlet s pigeonhole principle (recall that E is finite), we can choose ω 1 E such that [ω 1 ] B n for infinitely many n N. SinceB 1 B 2...,weobtain [ω 1 ] B n for all n N. Successively choose ω 2,ω 3,... E in such a way that [ω 1,...,ω k ] B n for all k, n N. B n is a disjoint union of certain sets C n,1,...,c n,mn A. Hence, for every n N there is an i n {1,...,m n } such that [ω 1,...,ω k ] C n,in for infinitely many k N. Since[ω 1 ] [ω 1,ω 2 ]...,weobtain [ω 1,...,ω k ] C n,in for all k, n N. For fixed n N and large k, we have [ω 1,...,ω k ] C n,in. Hence ω = (ω 1,ω 2,...) C n,in B n. This implies B n, contradictingtheassumption. Combining the last example with Theorem 1.53, we have shown the following theorem.

30 1 Basic Measure Theory Theorem 1.64 (Product measure, Bernoulli measure). Let E be a finite nonempty set and Ω = E N.Let(p e ) e E be a probability vector. Then there exists a unique probability measure µ on σ(a) =B(Ω) such that µ([ω 1,...,ω n ]) = n p ωi for all ω 1,...,ω n E and n N. µ is called the product measure or Bernoulli measure on Ω with weights (p e ) e E. We write ( e E p eδ e ) N := µ. Theσ-algebra (2 E ) N := σ(a) is called the product σ-algebra on Ω. We will study product measures in a systematic way in Chapter 14. The measure extension theorem yields an abstract statement of existence and uniqueness for measures on σ(a) that were first defined on a semiring A only. The following theorem, however, shows that the measure of a set from σ(a) can be well approximated by finite and countable operations with sets from A. Denote by A B := (A \ B) (B \ A) for A, B Ω (1.14) the symmetric difference of the two sets A and B. Theorem 1.65 (Approximation theorem for measures). Let A 2 Ω be a semiring and let µ be a measure on σ(a) that is σ-finite on A. (i) For any A σ(a) and ε>0, there ( exist mutually disjoint sets A 1,A 2,... A such that A ) A n and µ A n \ A <ε. (ii) For any A σ(a) with µ(a) < and any ε>0, ( there exists an) n N and mutually disjoint sets A 1,...,A n Asuch that µ A n A k <ε. (iii) For any A M(µ ), there are sets A,A + σ(a) with A A A + and µ(a + \ A )=0. Remark 1.66. (iii) implies that (i) and (ii) also hold for A M(µ ) (with µ instead of µ). If A is an algebra, then in (ii) for any A σ(a), weevenhave inf B A µ(a B) =0. Proof. (ii) As µ and the outer measure µ coincide on σ(a) and since µ(a) is finite on σ(a), by the very definition of µ (see Lemma 1.47) there exists a covering B 1,B 2,... Aof A such that µ(a) µ(b i ) ε/2. k=1

1.3 The Measure Extension Theorem 31 Let n N with i=n+1 sets C, D, E, wehave µ(b i ) < ε 2 (such an n exists since µ(a) < ). For any three C D =(D \ C) (C \ D) (D \ C) (C \ (D E)) E (C (D E)) E. Choosing C = A, D = n B i and E = i=n+1 B i, this yields ( ) ( ) ( n ) µ A B i µ A B i + µ B i i=n+1 ( ) µ B i µ(a)+ ε 2 ε. As A is a semiring, there exist a k N and A 1,...,A k Asuch that n n B i = B 1 i=2 i 1 (B i \ B j ) =: j=1 k A i. (i) Let A σ(a) and E n Ω, E n σ(a) with µ(e n ) < for any n N. For every n N, chooseacovering(b n,m ) m N of A E n with µ(a E n ) µ(b n,m ) 2 n ε. m=1 (This is possible due to the definition of the outer measure µ,whichcoincideswith µ on A.) Let B n,m = A n for certain A n A, n N (Exercise 1.1.1). Then m, ( ) ( ) µ A n \ A = µ B n,m \ A m=1 ( ( µ Bn,m \ (A E n ) )) m=1 (( ) ) µ(b n,m ) µ(a E n ) ε. m=1

32 1 Basic Measure Theory (iii) Let A M(µ ) and (E n ) n N as above. For any m, n N, choosea n,m σ(a) such that A n,m A E n and µ (A n,m ) µ (A E n )+ 2 n m. Define A m := A n,m σ(a). ThenA m A and µ (A m \ A) 1 m. Define A + := m=1 A m.thenσ(a) A + A and µ (A + \ A) =0. Similarly, choose (A ) c σ(a) with (A ) c A c and µ ((A ) c \ A c )=0.ThenA + A A and µ(a + \ A )=µ (A + \ A )=µ (A + \ A)+µ (A \ A )=0. Remark 1.67 (Regularity of measures). (Compare with Theorem 13.6.) Let λ n be the Lebesgue measure on ( R n, B(R n ) ).LetA be the semiring of rectangles of the form (a, b] R n ;henceb(r n )=σ(a) by Theorem 1.23. By the approximation theorem, for any A B(R n ) and ε>0 there exist countably many A 1,A 2,... A with ( ) λ n A i \ A <ε/2. For any A i, there exists an open rectangle B i A i with λ n (B i \ A i ) <ε2 i 1 (upper semicontinuity of λ n ). Hence U = B i is an open set U A with λ n (U \ A) <ε. This property of λ n is called outer regularity. If λ n (A) is finite, then for any ε>0 there exists a compact K A such that λ n (A \ K) <ε. This property of λ n is called inner regularity. Indeed,letN > 0 be such that λ n (A) λ n (A [ N,N] n ) <ε/2.chooseanopensetu (A [ N,N] n ) c such that λ n (U \ (A [ N,N] n ) c ) <ε/2, andletk := [ N,N] n \ U A. Definition 1.68 (Null set). Let (Ω,A,µ) be a measure space. (i) A set A A is called a µ-null set, or briefly a null set, if µ(a) =0.ByN µ we denote the class of all subsets of µ-null sets. (ii) Let E(ω) be a property that a point ω Ω can have or not have. We say that E holds µ-almost everywhere (a.e.) or for almost all (a.a.) ω if there exists a null set N such that E(ω) holds for every ω Ω \ N.IfA Aand if there exists a null set N such that E(ω) holds for every ω A \ N, thenwesaythate holds almost everywhere on A. If µ = P is a probability measure, then we say that E holds P -almost surely (a.s.), respectively almost surely on A. (iii) Let A, B A,andassumethatthereisanullsetN such that A B N. Then we write A = B (mod µ).