The Method of Least Squares. Lectures INF2320 p. 1/80



Similar documents
MATH 4552 Cubic equations and Cardano s formulae

Nonlinear Algebraic Equations. Lectures INF2320 p. 1/88

The Method of Partial Fractions Math 121 Calculus II Spring 2015

The Heat Equation. Lectures INF2320 p. 1/88

Math 132. Population Growth: the World

Second Order Linear Nonhomogeneous Differential Equations; Method of Undetermined Coefficients. y + p(t) y + q(t) y = g(t), g(t) 0.

Polynomials. Jackie Nicholas Jacquie Hargreaves Janet Hunter

Homework 2 Solutions

Inner Product Spaces

2008 AP Calculus AB Multiple Choice Exam

Univariate Regression

G. GRAPHING FUNCTIONS

Core Maths C1. Revision Notes

ab = c a If the coefficients a,b and c are real then either α and β are real or α and β are complex conjugates

Section 4.4. Using the Fundamental Theorem. Difference Equations to Differential Equations

CS 221. Tuesday 8 November 2011

Differentiation of vectors

Techniques of Integration

LINEAR INEQUALITIES. Mathematics is the art of saying many things in many different ways. MAXWELL

Homework 2 Solutions

1.7. Partial Fractions Rational Functions and Partial Fractions. A rational function is a quotient of two polynomials: R(x) = P (x) Q(x).

Math Assignment 6

Separable First Order Differential Equations

Mark Howell Gonzaga High School, Washington, D.C.

19.7. Applications of Differential Equations. Introduction. Prerequisites. Learning Outcomes. Learning Style

Method To Solve Linear, Polynomial, or Absolute Value Inequalities:

x 2 + y 2 = 1 y 1 = x 2 + 2x y = x 2 + 2x + 1

Quick Tour of Mathcad and Examples

Objectives. Materials

MATH 304 Linear Algebra Lecture 20: Inner product spaces. Orthogonal sets.

Integration. Topic: Trapezoidal Rule. Major: General Engineering. Author: Autar Kaw, Charlie Barker.

This unit will lay the groundwork for later units where the students will extend this knowledge to quadratic and exponential functions.

MA106 Linear Algebra lecture notes

Linear and quadratic Taylor polynomials for functions of several variables.

The KaleidaGraph Guide to Curve Fitting

Numerical Solution of Differential

Numerical Solution of Differential Equations

Solving Cubic Polynomials

Algebra. Exponents. Absolute Value. Simplify each of the following as much as possible. 2x y x + y y. xxx 3. x x x xx x. 1. Evaluate 5 and 123

4.9 Graph and Solve Quadratic

Trend and Seasonal Components

By Clicking on the Worksheet you are in an active Math Region. In order to insert a text region either go to INSERT -TEXT REGION or simply

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

General Theory of Differential Equations Sections 2.8, , 4.1

Eigenvalues, Eigenvectors, and Differential Equations

CHAPTER FIVE. Solutions for Section 5.1. Skill Refresher. Exercises

minimal polyonomial Example

Solutions to Homework 10

Regression III: Advanced Methods

Integrating algebraic fractions

Visualizing Differential Equations Slope Fields. by Lin McMullin

Linear algebra and the geometry of quadratic equations. Similarity transformations and orthogonal matrices

CHAPTER 5 Round-off errors

Rootfinding for Nonlinear Equations

Differentiation and Integration

EL-9650/9600c/9450/9400 Handbook Vol. 1

Examples of Functions

Kevin James. MTHSC 102 Section 1.5 Exponential Functions and Models

Readings this week. 1 Parametric Equations Supplement. 2 Section Sections Professor Christopher Hoffman Math 124

3 Contour integrals and Cauchy s Theorem

Definition 8.1 Two inequalities are equivalent if they have the same solution set. Add or Subtract the same value on both sides of the inequality.

Vieta s Formulas and the Identity Theorem

1. (from Stewart, page 586) Solve the initial value problem.

An Introduction to Applied Mathematics: An Iterative Process

F.IF.7b: Graph Root, Piecewise, Step, & Absolute Value Functions

Nonlinear Algebraic Equations Example

Tom wants to find two real numbers, a and b, that have a sum of 10 and have a product of 10. He makes this table.

Integrals of Rational Functions

HIBBING COMMUNITY COLLEGE COURSE OUTLINE

POLYNOMIAL FUNCTIONS

1 Review of Newton Polynomials

South Carolina College- and Career-Ready (SCCCR) Pre-Calculus

8. Linear least-squares

3.2 Sources, Sinks, Saddles, and Spirals

PYTHAGOREAN TRIPLES KEITH CONRAD

Equations, Inequalities & Partial Fractions

Solutions to old Exam 1 problems

Cooling and Euler's Method

4.3 Lagrange Approximation

CURVE FITTING LEAST SQUARES APPROXIMATION

Algebra I Credit Recovery

Tim Kerins. Leaving Certificate Honours Maths - Algebra. Tim Kerins. the date

G.A. Pavliotis. Department of Mathematics. Imperial College London

Georgia Department of Education Kathy Cox, State Superintendent of Schools 7/19/2005 All Rights Reserved 1

7.6 Approximation Errors and Simpson's Rule

5 Numerical Differentiation

Algebra Cheat Sheets

8.7 Exponential Growth and Decay

Examples of Tasks from CCSS Edition Course 3, Unit 5

Calculus with Parametric Curves

7.7 Solving Rational Equations

Numerical methods for American options

Airport Planning and Design. Excel Solver

AP Calculus AB First Semester Final Exam Practice Test Content covers chapters 1-3 Name: Date: Period:

Figure 2.1: Center of mass of four points.

Math 115 Spring 2011 Written Homework 5 Solutions

A three point formula for finding roots of equations by the method of least squares

Mathematics Pre-Test Sample Questions A. { 11, 7} B. { 7,0,7} C. { 7, 7} D. { 11, 11}

Mathematics Online Instructional Materials Correlation to the 2009 Algebra I Standards of Learning and Curriculum Framework

Transcription:

The Method of Least Squares Lectures INF2320 p. 1/80

Lectures INF2320 p. 2/80 The method of least squares We study the following problem: Given n points (t i,y i ) for i = 1,...,n in the (t,y)-plane. How can we determine a function p(t) such that p(t i ) y i, for i = 1,...,n? (1)

420 415 4 p(t) 405 400 395 390 1 2 3 4 5 6 7 8 9 t Figure 1: A set of discrete data marked by small circles is approximated with a linear function p = p(t) represented by the solid line. Lectures INF2320 p. 3/80

420 415 4 p(t) 405 400 395 390 1 2 3 4 5 6 7 8 9 t Figure 2: A set of discrete data marked by small circles is approximated with a quadratic function p = p(t) represented by the solid curve. Lectures INF2320 p. 4/80

Lectures INF2320 p. 5/80 The method of least square Above we saw a discrete data set being approximated by a continuous function We can also approximate continuous functions by simpler functions, see Figure 3 and Figure 4

Lectures INF2320 p. 6/80 8 7 6 5 4 3 2 1 0 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 t Figure 3: A function y = y(t) and a linear approximation p = p(t).

Lectures INF2320 p. 7/80 30 25 20 15 5 0 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 t Figure 4: A function y = y(t) and a quadratic approximation p = p(t).

Lectures INF2320 p. 8/80 World mean temperature deviations Calendar year Computational year Temperature deviation t i y i 1991 1 0.29 1992 2 0.14 1993 3 0.19 1994 4 0.26 1995 5 0.28 1996 6 0.22 1997 7 0.43 1998 8 0.59 1999 9 0.33 2000 0.29 Table 1: The global annual mean temperature deviation measured in C for years 1991-2000.

0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 Figure 5: The global annual mean temperature deviation measurements for the period 1991-2000. Lectures INF2320 p. 9/80

Lectures INF2320 p. /80 Approximating by a constant We will study how this set of data can be approximated by simple functions First, how can this data set be approximated by a constant function p(t) = α? The most obvious guess would be to choose α as the arithmetic average α = 1 We will study this guess in more detail y i = 0.312 (2)

Lectures INF2320 p. 11/80 Approximating by a constant Assume that we want the solution to minimize the function F(α) = (α y i ) 2 (3) The function F measures a sort of deviation from α to the set of data (t i,y i ) We want to find the α that minimizes F(α), i.e. we want to find α such that F (α) = 0 We have F (α) = 2 (α y i ) (4)

Lectures INF2320 p. 12/80 Approximating by a constant This leads to or 2 α = 2 y i, (5) α = 1 which is the arithmetic average y i, (6)

Lectures INF2320 p. 13/80 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 α*=0.312 Figure 6: A graph of F = F(α) given by (3).

Lectures INF2320 p. 14/80 Approximating by a constant Since F (α) = 2 1 = 20 > 0, (7) it follows that the arithmetic average is the minimizer for F We can say that the average value is the optimal constant approximating the global temperature This way of defining an optimal constant, where we minimize the sum of the square of the distances between the approximation and the data, is referred to as the method of least squares There are other ways to define an optimal constant

Lectures INF2320 p. 15/80 Approximating by a constant Define G(α) = (α y i ) 4 (8) G(α) also measures a sort of deviation from α to the data We have that G (α) = 4 (α y i ) 3 (9) And in order to minimize G we need to solve G (α) = 0, (and check that G (α) > 0)

Lectures INF2320 p. 16/80 Approximating by a constant Solving G (α) = 0 leads to a nonlinear equation that can be solved with the Newton iteration from the previous lecture We use Newton s method with initial approximation: α 0 = 0.312 tolerance specified by: ε = 8 This gives α 0.345, in three iterations α is a minimum of G since G (α ) = 12 (α y i ) 2 > 0

Lectures INF2320 p. 17/80 0.15 0.1 0.05 0 0.1 0.2 0.3 0.4 0.5 0.6 α*=0.345 Figure 7: A graph of G = G(α) given by (8).

Lectures INF2320 p. 18/80 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 Figure 8: Two constant approximations of the global annual mean temperature deviation measurements from year 1991 to 2000.

Lectures INF2320 p. 19/80 Approximating by a linear function Now we will study how we can approximate the world mean temperature deviation with a linear function We want to determine two constants α and β such that p(t) = α+βt () fits the data as good as possible in the sense of least squares

Lectures INF2320 p. 20/80 Approximating by a linear function Define F(α, β) = (α+βt i y i ) 2 (11) In order to minimize F with respect to α and β, we can solve F α = F β = 0 (12)

Lectures INF2320 p. 21/80 Approximating by a linear function We have that F α = 2 (α+βt i y i ), (13) and therefore the condition F α = 0 leads to α+ ( t i )β = y i. (14)

Lectures INF2320 p. 22/80 Approximating by a linear function Here and so we have t i = 1+2+3+ + = 55, y i = 0.29+0.14+0.19+ + 0.29 = 3.12, α+55β = 3.12. (15)

Lectures INF2320 p. 23/80 Approximating by a linear function Further, we have that F β = 2 (α+βt i y i )t i, and therefore the condition F β = 0 gives ( ( t i )α+ ti 2 ) β = y i t i.

Lectures INF2320 p. 24/80 Approximating by a linear function We can calculate and ti 2 = 1+2 2 + 3 2 + + 2 = 385, t i y i = 1 0.29+2 0.14+3 0.19+ + 0.29 = 20, so we arrive at the equation 55α+385β = 20. (16)

Lectures INF2320 p. 25/80 Approximating by a linear function We now have a 2 2 system of linear equations which determines α and β: ( )( ) ( ) 55 α 3.12 =. 55 385 β 20 With our knowledge of linear algebra, we see that ( α β ) ( ) 1 ( ) 55 3.12 = 55 385 20 ( )( = 1 385 55 3.12 825 55 20 ) ( 0.123 0.034 ).

Lectures INF2320 p. 26/80 Approximating by a linear function We conclude that the linear model p(t) = 0.123 + 0.034t (17) approximates the data optimally in the sense of least squares.

0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 Figure 9: Constant and linear least squares approximations of the global annual mean temperature deviation measurements from year 1991 to 2000. Lectures INF2320 p. 27/80

Lectures INF2320 p. 28/80 Approx. by a quadratic function We now want to determine constants α, β and γ, such that the quadratic polynomial p(t) = α+βt + γt 2 (18) fits the data optimally in the sense of least squares Minimizing F(α,β,γ) = (α+βt i + γti 2 y i ) 2 (19) requires F α = F β = F γ = 0 (20)

Lectures INF2320 p. 29/80 Approx. by a quadratic function F α = 2 ) ( α+βti + γti 2 y i = 0 leads to ) α+ ( )β+( t i ti 2 γ = y i F β = 2 ) ( α+βti + γti 2 y i ti = 0 leads to ) ) ( )α+( t i ti 2 β+( ti 3 γ = y i t i F γ = 2 ) ( α+βti + γti 2 y i t 2 i = 0 leads to ) ) ) ( ti 2 α+( ti 3 β+( ti 4 γ = y i ti 2

Lectures INF2320 p. 30/80 Approx. by a quadratic function Here t i = 55, t 4 i = 25330, ti 2 y i = 138.7, t 2 i = 385, y i = 3.12, t 3 i = 3025, t i y i = 20, which leads to the linear system 55 385 α 55 385 3025 β 385 3025 25330 γ = 3.12 20 138.7. (21)

Solving the linear system (21) with, e.g., matlab we get α 0.4078, β 0.2997, (22) γ 0.0241. We have now obtained three approximations of the data The constant The linear The quadratic p 0 (t) = 0.312 p 1 (t) = 0.123+0.034t p 2 (t) = 0.4078+0.2997t 0.0241t 2 Lectures INF2320 p. 31/80

0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 Figure : Constant, linear and quadratic approximations of the global annual mean temperature deviation measurements from the year 1991 to 2000. Lectures INF2320 p. 32/80

Lectures INF2320 p. 33/80 Summary Approximating a data set (t i,y i ) i = 1,...,n, with a constant function p 0 (t) = α. Using the method of least squares gives α = 1 n n y i, (23) which is recognized as the arithmetic average.

Lectures INF2320 p. 34/80 Summary Approximating the data set with a linear function can be done by minimizing min α,β p 1 (t) = α+βt F(α,β) = min α,β n (p 1 (t i ) y i ) 2, which leads to the following 2 2 linear system n n n t i α y i n n = t i ti 2 n β t i y i. (24)

Lectures INF2320 p. 35/80 Summary A quadratic approximation on the form p 2 (t) = α+βt + γt 2 can be done by minimizing min α,β,γ F(α,β,γ) = min α,β,γ n (p 2 (t i ) y i ) 2, which leads to the following 3 3 linear system n n n t i ti 2 n n t i t 2 n i ti 3 n ti 2 n ti 3 n ti 4 α β γ = n y i n y i t i n y i ti 2. (25)

Lectures INF2320 p. 36/80 Large Data Sets 0.6 0.4 0.2 0 0.2 0.4 0.6 1850 1900 1950 2000 Figure 11: The global annual mean temperature deviation measurements from the year 1856 to 2000.

Lectures INF2320 p. 37/80 Application to the temperature data We use the derived methods to model the global temperature deviation from 1856 to 2000. Here 145 145 n = 145, t 2 i = 26745, t 4 i = 1.30415, 145 t i y i = 502.43, 145 145 t i = 585, t 3 i = 1.12042 8, 145 y i = 21.82, 145 ti 2 y i = 19649.8, (26) where we have used t i = i, i.e., t 1 = 1 corresponds to the year of 1856, t 2 = 2 corresponds to the year of 1857 etc.

p 1 (t) 0.4638+0.0043t. Lectures INF2320 p. 38/80 Application to the temperature data First we get the constant model p 0 (t) 0.1505. (27) The coefficients α and β of the linear model are obtained by solving the linear system (24), i.e. ( )( ) ( ) 145 585 α 21.82 =. 585 26745 β 502.43 Consequently, α 0.4638 and β 0.0043, so the linear model is given by

Lectures INF2320 p. 39/80 Application to the temperature data Similarly, the coefficients α, β and γ of the quadratic model are obtained by solving the linear system (25), i.e. 145 585 26745 6 α 21.82 585 26745 6 1.12042 8 β = 502.43 26745 6 1.12042 8 1.30415 γ 19649.8 The solution of this system is given by α 0.3136, β 1.8404 3 and γ 4.2005 5, so the quadratic model is given by p 2 (t) 0.3136 1.8404 3 t + 4.2005 5 t 2. (28)

0.6 0.4 0.2 0 0.2 0.4 0.6 1850 1900 1950 2000 Figure 12: Constant, linear and quadratic least squares approximations of the global annual mean temperature deviation measurements; from 1856 to 2000. Lectures INF2320 p. 40/80

Lectures INF2320 p. 41/80 Application to population models We now consider the growth of the world population. Year Population (billions) 1950 2.555 1951 2.593 1952 2.635 1953 2.680 1954 2.728 1955 2.780 Table 2: The total world population from 1950 to 1955.

Lectures INF2320 p. 42/80 Exponential growth First we model the data in Table 2 using the exponential growth model with solution p(t) = p 0 e αt. p (t) = αp(t), p(0) = p 0, (29) We have earlier mentioned that for this model α has to be estimated We shall now estimate α relative to this data, in the sense of least squares

Lectures INF2320 p. 43/80 Exponential growth Put t = 0 at 1950 and measure t in years p 0 = 2.555 We want to determine the parameter α for data from Table 2 and p (t) p(t) = α (30) Since only p is available, we have to approximate p (t) using the standard formula p (t) p(t + t) p(t) t (31)

Lectures INF2320 p. 44/80 Exponential growth By choosing t = 1, we estimate α to be α n = p(n+1) p(n) p(n) (32) Let b n be the relative annual growth in percentage, i.e. b n = 0 p(n+1) p(n) p(n) (33)

Lectures INF2320 p. 45/80 Exponential growth Year n p(n) b n = 0 1950 0 2.555 1.49 1951 1 2.593 1.62 1952 2 2.635 1.71 1953 3 2.680 1.79 1954 4 2.728 1.91 1955 5 2.780 Table 3: Estimated values of p (t) p(t) of the world s population from 1950 to 1955. p(n+1) p(n) p(n) using (33) based on the numbers

Lectures INF2320 p. 46/80 Exponential growth We can to compute a constant approximation b, to this data set in the sense of least squares, i.e. the average b = 1 5 4 b n = 1 (1.49+1.62+1.71+1.79+1.91) = 1.704 n=0 5 Since b n = 0α n we get α = 1 0 This gives us the model b = 0.01704 (34) p(t) = 2.555e 0.01704t (35)

Lectures INF2320 p. 47/80 Exponential growth In the year 2000 (t=50) this model gives p(50) = 2.555e 0.01704 50 5.990 (36) The actual population in the 2000 was 6.080 billions The relative error is 6.080 5.990 6.080 0% = 1.48%, (37) which is remarkably small

6.5 x 9 6 5.5 5 4.5 4 3.5 3 2.5 1950 1955 1960 1965 1970 1975 1980 1985 1990 1995 2000 Figure 13: The figure shows the graph of an exponential population model p(t) = 2.555e 0.01704t together with the actual measurements marked by o. Lectures INF2320 p. 48/80

Lectures INF2320 p. 49/80 Exponential growth Will this model fit in the future as well? We try to use the development from 1990 to 2000 (see Figure 13) to predict the world population in 20

Lectures INF2320 p. 50/80 Year n p(n) b n = 0 p(n+1) p(n) p(n) 1990 0 5.284 1.57 1991 1 5.367 1.55 1992 2 5.450 1.49 1993 3 5.531 1.46 1994 4 5.611 1.43 1995 5 5.691 1.37 1996 6 5.769 1.35 1997 7 5.847 1.33 1998 8 5.925 1.32 1999 9 6.003 1.28 2000 6.080 Table 4: The calculated b n values associated with an exponential population model for the world between 1990 and 2000.

Lectures INF2320 p. 51/80 Exponential growth The average of the b n values is b = 1 Therefore α = b 0 = 0.0142 9 b n = 1.42 n=0 Starting from t=0 in the year 2000, the model reads p(t) = 6.080e 0.0142t (38) This model predicts that there will be p(0) = 6.080e 1.42 25.2 (39) billion people living on the earth in the year of 20

Lectures INF2320 p. 52/80 Logistic growth of the world pop. We study how the world population can be modeled by the logistic growth model p (t) = α p(t)(1 p(t)/β) (40) By defining γ = α/β, we can rewrite (40) p (t) p(t) = α+γ p(t) (41) Hence, we can determine constants α and γ by fitting a linear function to the observations of p (t) p(t) (42)

Lectures INF2320 p. 53/80 Logistic growth As above we define b n = 0 p(n+1) p(n) p(n) (43) We now want to determine two constants A and B, such that the data are modeled as accurately as possible by a linear function in the sense of least squares b n A+Bp(n), (44)

Lectures INF2320 p. 54/80 Logistic growth A and B can be determined by the following 2 2 linear system, 9 n=0 p(n) We have 9 n=0 p(n) 9 (p(n)) 2 n=0 A B = 9 b n n=0 9 p(n)b n n=0 (45) 9 n=0 p(n) = 56.5, 9 b n 14.1, n=0 9 (p(n)) 2 = 319.5, n=0 9 n=0 p(n)b n 79.6

Lectures INF2320 p. 55/80 Logistic growth By solving the (2 2) system ( 56.5 56.5 319.5 )( A B ) = ( 14.1 79.6 ) we get A = 2.7455 and B = 0.2364 Inserting this in the logistic model we get p (t) 0.027 p(t)(1 p(t)/11.44) (46) This indicates that the carrying capacity of the earth is about 11.44 billions

Lectures INF2320 p. 56/80 Logistic growth Let t = 0 correspond to the year 2000, which gives the initial condition p(0) 6.08 The analytical solution to the differential equation is p(t) 69.5 6.08+5.36e 0.027t (47) This model predicts that there will be p(0).79 billion people on the earth in the year of 20

2.6 x 2.4 2.2 2 1.8 1.6 1.4 1.2 1 0.8 0.6 2000 20 2020 2030 2040 2050 2060 2070 2080 2090 20 Figure 14: Predictions of the population growth on the earth based on an exponential model (solid curve) and a logistic model (dashed curve). Lectures INF2320 p. 57/80

Lectures INF2320 p. 58/80 Approximations of Functions Above we have studied continuous representation of discrete data Next we will consider continuous approximation of continuous functions Consider the function y(t) = ln ( ) 1 sin(t)+et In Figure 15 we see that y(x) seems to be close to the linear function p(t) = t on the interval [0,1] In Figure 16 we see that y(x) seems to be even closer to the linear function plotted on t [0,] (48)

Lectures INF2320 p. 59/80 1.2 1 0.8 0.6 0.4 0.2 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Figure 15: The function y(t) = ln ( 1 sin(t)+et) (solid curve) and a linear approximation (dashed line) on the interval t [0, 1].

Lectures INF2320 p. 60/80 9 8 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 9 Figure 16: The function y(t) = ln ( 1 sin(t)+et) (solid curve) and a linear approximation (dashed line) on the interval t [0, ].

Lectures INF2320 p. 61/80 Approximations by constants For a given function y(t), t [a,b], we want to compute a constant approximation of it p(t) = α (49) for t [a,b], in the sense of least squares That means that we want to minimize the integral b a (p(t) y(t)) 2 dt = b a (α y(t)) 2 dt

Lectures INF2320 p. 62/80 Approximations by constants Define the function F(α) = b a (α y(t)) 2 dt (50) The derivative with respect to α is F (α) = 2 And solving F (α) = 0 gives b a (α y(t)) dt α = 1 b a b a y(t)dt (51)

Lectures INF2320 p. 63/80 Note that The formula for α is the integral version of the average of y on [a,b]. In the discrete case we would have written α = 1 n n y i, (52) If y i in (52) is y(t i ), where t i = a+i t and t = b a n, then 1 n n y i = 1 b a t n y(t i ) 1 b a b a y(t) dt. We therefore conclude that (51) is a natural continuous version of (52).

Lectures INF2320 p. 64/80 Note that We used d dα b a (α y(t)) 2 dt = b a α (α y(t))2 dt Is that a legal operation? This is discussed in Exercise 5. The α given by (51) is a minimum, since F (α) = 2(b a) > 0

Lectures INF2320 p. 65/80 Example 15; const. approx. Consider y(t) = sin(t) defined on 0 t π/2. A constant approximation of y is given by p(t) = α (51) = = 2 π π/2 0 sin(t)dt = 2 π [cos(t)]π/2 0 2 π (0 1) = 2 π.

Lectures INF2320 p. 66/80 Example 16; const. approx. Consider y(t) = t 2 + 1 cos(t) defined on 0 t 1. A constant approximation of y is given by p(t) = α (51) = = 1 0 ( t 2 + 1 ) cos(t) 1 3 + 1 dt = sin(1) 0.417. [ 1 3 t3 + 1 ] 1 sin(t) 0

Approximations by Linear Functions Now, we search for a linear approximation of a function y(t), t [a,b], i.e. in the sense of least squares Define p(t) = α+βt (53) F(α, β) = b a (α+βt y(t)) 2 dt (54) A minimum of F is obtained by finding α and β such that F α = F β = 0 Lectures INF2320 p. 67/80

Lectures INF2320 p. 68/80 Approximations by Linear Functions We have F α F β b = 2 (α+βt y(t))dt a b = 2 (α+βt y(t))t dt a Therefore α and β can be determined by solving the following linear system (b a)α+ 1 2 (b2 a 2 )β = 1 2 (b2 a 2 )α+ 1 3 (b3 a 3 )β = b a b a y(t) dt t y(t)dt (55)

Lectures INF2320 p. 69/80 Example 15; linear approx. Consider defined on 0 t π/2. We have and π/2 0 π/2 0 y(t) = sin(t) sin(t) dt = 1 t sin(t) dt = 1.

Lectures INF2320 p. 70/80 Example 15; linear approx. The linear system now reads ( )( π/2 π 2 /8 π 2 /8 π 3 /24 α β ) = ( 1 1 ). The solution is ( α β ) = 1 π 2 8π 24 96 π 24 ( 0.115 0.664 ). Therefore the linear approximation is given by p(t) 0.115 + 0.664t.

Lectures INF2320 p. 71/80 Example 16; linear approx. Consider 1 2 1 3 y(t) = t 2 + 1 cos(t) defined on 0 t 1. The linear system (55) then reads ( )( ) ( 1 1 1 2 α 3 = + 1 sin(1) ), β 3 20 + 1 cos(1)+ 1 sin(1) with solution α 0.059 and β 0.953. We conclude that the linear least squares approximation is given by p(t) 0.059 + 0.953t.

Approx. by Quadratic Functions We seek a quadratic function p(t) = α+βt + γt 2 (56) that approximates a given function y = y(t), a t b, in the sense of least squares Let F(α,β,γ) = b a (α+βt + γt 2 y(t)) 2 dt (57) Define α, β and γ to be the solution of the three equations: F α = F β = F γ = 0 Lectures INF2320 p. 72/80

Lectures INF2320 p. 73/80 Approx. by Quadratic Functions By taking the derivatives, we have F α b = 2 (α+βt + γt 2 y(t))dt a F β b = 2 (α+βt + γt 2 y(t))t dt a F γ = 2 b a (α+βt + γt 2 y(t))t 2 dt

Lectures INF2320 p. 74/80 The coefficients α, β and γ can now be determined from the linear system (b a)α+ 1 2 (b2 a 2 )β+ 1 3 (b3 a 3 )γ = 1 2 (b2 a 2 )α+ 1 3 (b3 a 3 )β+ 1 4 (b4 a 4 )γ = 1 3 (b3 a 3 )α+ 1 4 (b4 a 4 )β+ 1 5 (b5 a 5 )γ = b a b a b a y(t) dt t y(t)dt t 2 y(t)dt

Lectures INF2320 p. 75/80 Example 15; quad. approx. For the function y(t) = sin(t), 0 t π/2, the linear system reads π/2 π 2 /8 π 3 /24 π 2 /8 π 3 /24 π 4 /64 π 3 /24 π 4 /64 π 5 /160 α β γ = 1 1 π 2, and the solution is given by α 0.024, β 1.196 and γ 0.338, which gives the quadratic approximation p(t) = 0.024+1.196t 0.338t 2.

Lectures INF2320 p. 76/80 Example 16; quad. approx. Let us consider y(t) = t 2 + 1 cos(t) for 0 t 1. The linear system takes the form 1 1 1/2 1/3 α 3 + 1 sin(1) 3 1/2 1/3 1/4 β = 20 + 1 cos(1)+ 1 sin(1) 1 1/3 1/4 1/5 γ 5 + 1 5 cos(1) 1 sin(1) and the solution is given by α 0.0, β 0.004 and γ 0.957, and the quadratic approximation is p(t) = 0.0 0.004t + 0.957t 2.

Lectures INF2320 p. 77/80 1 0.8 0.6 0.4 0.2 0 0 0.5 1 1.5 Figure 17: The function y(t) = sin(t) (solid curve) and its least squares approximations: constant (dashed line), linear (dotted line) and quadratic (dashed-dotted curve).

Lectures INF2320 p. 78/80 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Figure 18: The function y(t) = t 2 + 1 cos(t) (solid curve) and its least squares approximations: constant (dashed line), linear (dotted line) and quadratic (dashed-dotted curve).

Lectures INF2320 p. 79/80

Lectures INF2320 p. 80/80