MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 4: Fourier Series and L 2 ([ π, π], µ) ( 1 π

MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION Chapter 4: Fourier Series and L ([, π], µ) Square Integrable Functions Definition. Let f : [, π] R be measurable. We say that f is square integrable if f is integrable, i.e., if For future use, we shall write f = f dµ < +. ( 1 π ) 1/ f dµ π and we shall call this the L -norm of f (pronounced L-two, not L-squared ). Remarks. (i) Of course, as f is real valued, f = f, so we could just write f dµ. However, the definition can be extended to complex valued functions when we really do want f. (ii) The factor 1/π is not essential but is introduced to be convenient for the calculations with Fourier series which occur later in the chapter. We d like to use to give a distance (more formally, a metric) on the set of square integrable functions by dist(f, g) = f g = ( 1 π 1/ f g dµ). π However, if f = g µ-a.e. then f g dµ = 0, so we would have distinct functions lying distance zero apart. To avoid this, we consider equivalence classes of square integrable functions under the equivalence relation Thus an equivalence class has the form f g f = g µ-a.e.. [f] = {g : [, π] R : f = g µ-a.e.}, for some square integrable function f : [, π] R. 1 Typeset by AMS-TEX

Definition. We write L ([, π], µ, R) for the set of equivalence classes of square integrable functions on [, π]. After having made this definition, we shall proceed more informally and write f L ([, π], µ, R) (rather than the pedantic [f] L ([, π], µ, R)) whenever f : [, π] R is square integrable. However, bear in mind that two functions which are equal almost everywhere are now considered to be the same. We may also write f L ([, π], µ, R) when f : [, π] R is such that f dµ < + but note that such an f must be finite µ-a.e. and hence equal µ-a.e. to a function with values in R. Now we shall show that L ([, π], µ, R) is a vector space and that d(f, g) = f g is a metric on L ([, π], µ, R). First we need a technical result. Lemma 4.1. If f, g L ([, π], µ, R) then π fg dµ f + g π, with equality if and only if f = g µ-a.e. (i.e. if f and g are the same element of L ([, π], µ, R)). Proof. We have so 0 (f(x) g(x)) = f(x) f(x)g(x) + g(x), f(x)g(x) f(x) + g(x) Note that fg is integrable since, as above but for ( f g ), fg f +g. Multiplying (*) by 1/π and integrating gives the first statement. Clearly, we get equality if and only if (f g) dµ = 0. By Proposition 3.1, this holds if and only if (f g) = 0 µ-a.e, i.e., f = g µ-a.e. Corollary. If f, g L ([, π], µ, R) then π fg dµ f + g π, with equality if and only if f = g µ-a.e. Lemma 4.. If f, g L ([, π], µ, R) then (i) Hölder Inequality: 1 π fg dµ f g, π with equality if and only if for some c R, f = c g µ-a.e. or f g = 0. (ii) Cauchy-Schwarz Inequality: 1 π fg dµ π f g, with equality if and only if for some c R, f = cg µ-a.e. or f g = 0. Proof. Exercise. ( )

Lemma 4.3. If f, g L ([, π], µ, R) then so is f + g and f + g f + g. Proof. Clearly f + g is measurable. Also 1 π + g) π (f dµ = 1 (f + fg + g ) dµ π 1 π (f + fg + g ) dµ f + f g + g = ( f + g ) (where we have used Hölder s inequality for the second inequality). Hence f + g L ([, π], µ, R) and taking square roots gives the inequality (called Minkowski s Inequality). Corollary. L ([, π], µ, R) is a vector space over R. Proof. It is trivial that if f L ([, π], µ, R) and c R then cf L ([, π], µ, R) (and cf = c f ). By the above lemma, if f, g L ([, π], µ, R), so is f + g L ([, π], µ, R). Definition. A metric on a set X is a function d : X X R such that, for x, y, z X, (1) d(x, y) 0 and d(x, y) = 0 if and only if x = y; () d(x, y) = d(y, x); (3) d(x, z) d(x, y) + d(y, z). Theorem 4.4. d(f, g) = f g is a metric on L ([, π], µ, R). Proof. It is immediate from its definition that f g 0. Suppose that f g = 1 π f g dµ = 0. Applying Proposition 3.1, we see that f g = 0 µ-a.e., i.e., f = g µ-a.e., so f and g represent the same element of L ([, π], µ, R). This shows (1). Condition () follows immediately from the definition. For (3), assume that f, g, h L ([, π], µ, R). By Minkowski s Inequality, d(f, h) = f h = (f g) + (g h) f g + g h = d(f, g) + d(g, h) as required. The next important result says that square integrable functions may be approximated arbitrarily well by continuous functions with respect to. 3

Theorem 4.5. Continuous functions are -dense in L ([, π], µ, R). In other words, given f L ([, π], µ, R) and ϵ > 0, we can find a continuous function g : [, π] R such that f g < ϵ. The proof is given in an appendix and is not examinable. The following slight strengthening will be needed later on. Corollary 4.6. Given f L ([, π], µ, R) and ϵ > 0, we can find a continuous function g : [, π] R such that g() = g(π) and f g < ϵ. Proof. Exercise. (Hint: use Theorem 4.5 to find a continuous function h : [, π] R such that f h < ϵ/. Modify h on [, + δ] [π δ, π], for an appropriately small δ > 0 to obtain a continuous function g : [, π] R with the required properties.) Definition. We say that a sequence of functions f n L ([, π], µ, R) converges to f L ([, π], µ, R) if f n f = 0. Definition. Recall that a metric space (X, d) is said to be complete if every Cauchy sequence converges to a point in X. (A sequence x n X is a Cauchy sequence if, for all ϵ > 0, there exists N 1 such that if n, m N then d(x n, x m ) < ϵ. Theorem 4.7. L ([, π], µ, R) is complete. Proof. Let f n, n 1, be a Cauchy sequence in L ([, π], µ, R) with respect to the metric d(f, g) = f g determined by the norm. By definition, this means that, given ϵ > 0, we can find an integer N 1 such that n, m N = f n f m < ϵ. Applying this definition with ϵ = i, i = 1,,..., we can find a increasing sequence of positive integers N i such that n, m N i = f n f m < 1 i. Define functions g 0 = 0 and g i = f Ni, i 1. Then g i+1 g i = f Ni+1 f Ni < 1 i for i 1. Thus, by the Comparison Test, the series g i+1 g i converges. Let the sum be denoted by S. 4

Now consider a new sequence of functions h n, n 1, defined by h n (x) = g i+1 (x) g i (x). For any fixed x, the sequence of numbers h n (x) is increasing, so we can defined h(x) := h n(x) R {+ }. Thus we have a measurable function h : [, π] R. We want to show that h L ([, π], µ, R). First note that so that h n g i+1 g i S, h n dµ = π h n πs. Since h n is an increasing sequence of non-negative measurable functions converging pointwise to h, the Monotone Convergence Theorem tells us that h dµ = h n dµ πs, so h = h is integrable. Thus, h L ([, π], µ, R), as we claimed. Since h is integrable it is finite µ-a.e. Thus, h is finite µ-a.e. For each x [, π] for which h(x) is finite, the series of real numbers (g i+1 (x) g i (x)) converges absolutely and hence converges. We will denote its sum by g(x). For x with h(x) = +, we set g(x) = 0. Note that (g i+1 (x) g i (x)) = g n (x) g 0 (x) = g n (x). Hence, for µ-a.e. x. Moreover, g n(x) = g(x) = (g i+1 (x) g i (x)) = g(x), g n(x) g i+1 (x) g i (x) = h n(x) = h(x), 5

for µ-a.e. x. Thus g(x) h(x) for µ-a.e. x and so g is integrable, giving g L ([, π], µ, R). We also observe that g(x) g n (x) ( g(x) + g n (x) ) (h(x)). Since g(x) g n (x) = 0 for µ-a.e. x, the Dominated Convergence Theorem tells us that This implies that g g n dµ = 0. g g n = 0. Hence, given ϵ > 0, we can choose an i sufficiently large that g g i < ϵ/ and i < ϵ/. Recall that g i = f Ni. Thus, whenever n N i, we have g f n g g i + g i f n < ϵ + ϵ = ϵ. This shows that g f n L ([, π], µ, R). = 0, i.e., the sequence f n converges in the space Inner Products and Hilbert Spaces Definition. Let V be a vector space over R. A map, : V V R is called an inner product if, for all u, v, w V and a, b R, (1) u, v = v, u ; () au + bv, w = a u, w + b v, w ; (3) u, u 0 and u, u = 0 if and only if u = 0. Lemma 4.8. The formula f, g = 1 π fg dµ defines an inner product on the vector space L ([, π], µ, R) and f = f, f 1/. Proof. Parts (1) and () of the definition of inner product are easy to check. Part (3) is equivalent to the statement that f = 0 if and only if f represents the 0 element in L ([, π], µ, R), which follows from Proposition 3.1. Thus the metric on L ([, π], µ, R) is obtained from this inner product by d(f, g) = f g = f g, f g 1/. (In fact, any inner product defines a metric in this way.) Definition A vector space with an inner product which is complete with respect to the associated metric is called a Hilbert space. Thus we have already proved: Theorem 4.9. L ([, π], µ, R) is a Hilbert space. 6

Orthogonality Definition. Let V be a vector space with an inner product,. We shall write for the associated norm v = v, v 1/. We say that a collection of vectors {v n } in V is orthogonal if v n, v m = 0 whenever n m. We say they are orthonormal if, in addition, v n = 1 for all n. We will use couple of standard results about orthogonal/orthonormal vectors. Lemma 4.10. Let {v k } n c 1,..., c n R. Then be a finite orthogonal family in a vector space V and let c k v k = c k v k. Proof. Note that, by the definition of inner product and orthogonality, c 1 v 1 + c v = c 1 v 1 + c v, c 1 v 1 + c v = c 1 v 1, v 1 + c 1 c v 1, v + c v, v = c 1 v 1 + c v. It is left as an exercise to complete the proof by induction. Lemma 4.11. Let {v k } n be a finite orthonormal family in a vector space V. Then, for w V, the minimum value of n w c k v k over all choices of c 1,..., c n R occurs when c k = w, v k. Proof. Let c 1,..., c n be arbitrary real numbers and set a k = w, v k. Write By the preceding lemma, u = u = a k v k and v = c k v k. a k and v = c k. Also w, v = w, c k v k = c k w, v k = c k a k. 7

Thus It follows that w v = w v, w v = w w, v + v = w c k a k + = w a k + = w u + c k (a k c k ) (a k c k ). w v w u with equality if and only if n (a k c k ) = 0, i.e. if and only if c k = a k = w, v k for all k = 1,..., n. Lemma 4.1. The family of functions F = { } 1, cos(nx), sin(nx) : n 1, is orthonormal in L ([, π], µ, R). Proof. We have 1 π { 0 if k 0 e ikx dµ = π if k = 0. Use the formulae cos(nx) = (e inx + e inx )/ and sin(nx) = (e inx e inx )/i to obtain the result. Fourier Series As in Chapter 1, the Fourier series of an integrable function f is where a 0 = 1 π a n = 1 π a 0 1 + (a n cos(nx) + b n sin(nx)), n=1 f(x)dµ and, for n 1, f(x) cos(nx)dµ, b n = 1 π f(x) sin(nx)dµ, n 1, where we have written the integrals with respect to µ as we do not assume f is Riemann integrable. (We have writtten the first term a 0 / as (a 0 / )(1/ ) for a reason.) 8

In terms of the inner product, we have and, for n 1, f, cos(nx) = 1 π 1 f, = 1 f(x) dµ = a 0, π f(x) cos(nx)dµ = a n, f, sin(nx) = 1 π the Fourier coefficients of f. Thus the Fourier series may be expressed as 1 1 f, + f, cos(nx) cos(nx) + n=1 f, sin(nx) sin(nx). n=1 f(x) sin(nx)dµ = b n, Define sin( nx) if n < 0 φ n (x) = 1 if n = 0 cos(nx) if n > 0. Then the Fourier series has the succinct expression Also, and S n (f, x) = = 1 1 f, + n= f, φ n φ n (x). f, cos(kx) cos(kx) + f, φ k φ k (x). f, sin(kx) sin(kx) σ n (f, x) = 1 n (S 0(f, x) + S 1 (f, x) + + S (f, x)) = k= () n k n f, φ k φ k (x). Theorem 4.13 (Riesz-Fischer Theorem). Let f L ([, π], µ, R). Then S n (f, ) converges to f in L ([, π], µ, R), i.e, S n (f, ) f = ( 1 π S n (f, ) f dµ) 1/ 0, as n +. 9

Before we prove this, we recall Fejér s Theorem (Theorem 1.3) from Chapter 1. Here is a slightly specialized version: Suppose that g : [, π] R is continuous and g() = g(π). Then the sequence of functions σ n (g, ) converges uniformly to g, as n +. If we define σ n (g, ) g = sup x [,π] σ n (g, x) g(x) then uniform convergence is equivalent to σ n (g, ) g = 0. Also note that σ n (g, ) g σ n (g, ) g. Proof. Suppose f L ([, π], µ, R) and let ϵ > 0 be given. By Theorem 4.5, we can find a continuous function g : [, π] R such that f g < ϵ/. By Fejér s Theorem, we can choose N sufficiently large that Thus, if n N then n N = σ n (g, ) g < ϵ. σ n (g, ) g σ n (g, ) g < ϵ. Combing the two estimates, if n N then f σ n (g, ) f g + g σ n (g, ) < ϵ + ϵ = ϵ. We may write σ n (g, x) = k= () d k φ k (x), for some d k R. By Lemma 4.11, f k= () f, φ k φ k (x) f k= () d k φ k (x). Thus, if n N then f S n (f, ) < ϵ, as required. Having proved the theorem, we are now entitled to say that for f L ([, π], µ, R), f = f, φ n φ n n= ( ) in L ([, π], µ, R). 10

Theorem 4.14. F = { } 1, cos(nx), sin(nx) : n 1 = {φ n : n Z} is a (Schauder) basis for the vector space L ([, π], µ, R). In other words, for each f L ([, π], µ, R) there is a unique sequence {c n, n Z} such that f = n= c nφ n in L ([, π], µ, R), that is, f n n c k φ k = 0. Consequently, the Fourier series of a function is that function written in terms of the basis. Proof. The existence of {c n : n Z} follows from ( ) above. It remains to show that the representation is unique. Suppose that for some f L ([, π], µ, R) we have {c n, n Z} and {d n, n Z} such that n n f c k φ k = 0, n f n d k φ k = 0. Then n d k φ k c k φ k = n k c k ) φ k = 0. (d Consider any m Z and choose n m. Then φ m, (d k c k ) φ k = (d k c k ) φ m, φ k = d m c m. On the other hand, n φ m, (d k c k ) φ k φ m k c k ) φ k 0, as n + (d (using the Cauchy-Schwarz Inequality). Taking these together, we see that c m = d m, for all m Z, so the uniqueness of the representation follows. Carleson s Theorem The Riesz-Fischer Theorem was proved in 1907. As it deals with convergence with respect to, it leaves open the question of pointwise convergence, of S n (f, x) to f(x). We have already seen (Theorem 1.4) that even for continuous f, we do not necessarily have convergence at every point. But what about almost every point? It turns out that the answer to this is yes for square integrable functions. This was proved by Lennart Carleson in 1966 and is regarded as one of the high points of mathematical analysis in the twentieth century. 11

Theorem 4.15 (Carleson s Theorem). Let f L ([, π], µ, R). Then S n (f, x) converges to f(x) for µ-a.e. x [, π], as n +. Remark. In contrast, the result is false if one only assumes that f is integrable. Indeed, there is an example of Kolmogorov (193) which shows that there is an integrable function f : [, π] R for which S n (f, x) does not converge at any point. 1