Notes on the Singular Value Decomposition
|
|
- Rosalyn Terry
- 7 years ago
- Views:
Transcription
1 Notes on the Singular Value Decomposition Robert A van de Geijn The University of Texas at Austin Austin, TX 7871 September 11, 14 NOTE: I have not thoroughly proof-read these notes!!! We recommend the reader review Weeks 9-11 of Linear Algebra: Foundations to Frontiers - Notes to LAFF With 1 Orthogonality and Unitary Matrices Definition 1 Let u, v C m These vectors are orthogonal perpendicular if u v Definition Let q, q 1,, q n 1 C m These vectors are said to be mutually orthonormal if for all i, j < n { qi 1 if i j q j otherwise Notice that for n vectors of length m to be mutually orthonormal, n must be less than or equal to m This is because n mutually orthonormal vectors are linearly independent and there can be at most m linearly independent vectors of length m Definition 3 Let Q C m n with n m Then Q is said to be an orthonormal matrix if Q Q I the identity Exercise 4 Let Q C m n with n m Partition Q q q 1 q n 1 Show that Q is an orthonormal matrix if and only if q, q 1,, q n 1 are mutually orthonormal Definition 5 Let Q C m m Then Q is said to be a unitary matrix if Q Q I the identity Notice that unitary matrices are always square and only square matrices can be unitary Sometimes the term orthogonal matrix is used instead of unitary matrix, especially if the matrix is real valued Exercise 6 Let Q C m m Show that if Q is unitary then Q 1 Q and QQ I Exercise 7 Let Q, Q 1 C m m both be unitary Show that their product, Q Q 1, is unitary Exercise 8 Let Q, Q 1,, Q k 1 C m m all be unitary Show that their product, Q Q 1 Q k 1, is unitary The following is a very important observation: Let Q be a unitary matrix with Q q q 1 q m 1 1
2 Let x C m Then x QQ x q q 1 q m 1 q q 1 q m 1 x q q 1 q m 1 q q 1 q m 1 q q 1 q m 1 q x q 1 x q m 1x x q xq + q 1 xq q m 1xq m 1 What does this mean? The vector x χ χ 1 gives the coefficients when the vector x is written as a linear combination χ m 1 of the unit basis vectors: x χ e + χ 1 e χ m 1 e m 1 The vector Q x q x q 1 x q m 1x gives the coefficients when the vector x is written as a linear combination of the orthonormal vectors q, q 1,, q m 1 : x q xq + q 1 xq q m 1xq m 1 The vector q i xq i equals the component of x in the direction of vector q i Another way of looking at this is that if q, q 1,, q m 1 is an orthonormal basis for C m, then any x C m can be written as a linear combination of these vectors: Now, x α q + α 1 q α m 1 q m 1 q i x q i α q + α 1 q α i 1 q i 1 + α i q i + α i+1 q i α m 1 q m 1 α qi q α i + α 1 q i q α i 1 q i q i 1 + α i q i q i 1 + α i+1 q i q i α m 1 q i q m 1 Thus q i x α i, the coefficient that multiplies q i
3 Exercise 9 Let U C m m be unitary and x C m, then Ux x Exercise 1 Let U C m m and V C n n be unitary matrices and A C m n Then UA AV A Exercise 11 Let U C m m and V C n n be unitary matrices and A C m n Then UA F AV F A F Toward the SVD Lemma 1 Given A C m n there exists unitary U C m m, unitary V C n n, and diagonal D R m n DT such that A UDV L where D with D T L diagδ,, δ r 1 and δ i > for i < r Proof: First, let us observe that if A the zero matrix then the theorem trivially holds: A UDV where U I m m, V I n n, and D, so that D T L is Thus, wlog assume that A We will prove this for m n, leaving the case where m n as an exercise, employing a proof by induction on n Base case: n 1 In this case A a where a R m is its only column By assumption, a Then A a u a 1 where u a / a Choose U 1 C m m 1 so that U u U 1 is unitary Then A a u a 1 where D T L δ a and V 1 u U 1 a 1 UDV Inductive step: Assume the result is true for all matrices with 1 k < n columns Show that it is true for matrices with n columns Let A C m n with n Wlog, A so that A Let δ and v C n have the property that v 1 and δ Av A In other words, v is the vector that maximizes max x 1 Ax Let u Av /δ Note that u 1 Choose U 1 C m m 1 and V 1 C n n 1 so that Ũ u U 1 and Ṽ v V 1 are unitary Then Ũ AṼ u U 1 A v V 1 u Av u AV 1 δ u u u AV 1 U1 Av U1 AV 1 δu1 u U1 AV 1 δ w B where w V1 A u and B U1 AV 1 Now, we will argue that w, the zero vector of appropriate size: δ w δ A U AV max x 3 U AV x x max x B x x,
4 δ w B δ w δ w δ + w w δ + w w δ + w w Thus δ δ + w h which means that w and Ũ AṼ δ + w w Bw δ w δ B By the induction hypothesis, there exists unitary Ǔ Cm 1 m 1, unitary ˇV C n 1 n 1, and Ď R m 1 n 1 such that B ǓĎ ˇV where Ď ĎT L with ĎT L diagδ 1,, δ r 1 Now, let U Ũ 1, V Ǔ Ṽ 1 δ, and D ˇV Ď There are some really tough to see checks in the definition of U, V, and D!! Then A UDV where U, V, and D have the desired properties By the Principle of Mathematical Induction the result holds for all matrices A C m n with m n Exercise 13 Let D diagδ,, δ n 1 Show that D max n 1 i δ i Exercise 14 Let A AT Use the SVD of A to show that A A T Exercise 15 Assume that U C m m and V C n n be unitary matrices Let A, B C m n with B UAV Show that the singular values of A equal the singular values of B σ Exercise 16 Let A C m n with A and assume that A σ Show that B A B int: Use the SVD of B Exercise 17 Prove Lemma 1 for m n You can use the following as an outline for your proof: Proof: First, let us observe that if A the zero matrix then the theorem trivially holds: A UDV where U I m m, V I n n, and D, so that D T L is Thus, wlog assume that A We will employ a proof by induction on m Base case: m 1 In this case A where â T R1 n is its only row By assumption, â T Then A â T â T â T 1 v 4
5 where v â T / â T Choose V 1 C n n 1 so that V v V 1 is unitary Then A â T 1 â T v V 1 UDV where D T L δ â T and U 1 Inductive step: Similarly modify the inductive step of the proof of the theorem By the Principle of Mathematical Induction the result holds for all matrices A C m n with m n 3 The Theorem Theorem 18 Singular Value Decomposition Given A C m n there exists unitary U C m m, unitary ΣT V C n n, and Σ R m n such that A UΣV L where Σ with Σ T L diagσ,, σ r 1 and σ σ 1 σ r 1 > The σ,, σ r 1 are known as the singular values of A Proof: Notice that the proof of the above theorem is identical to that of Lemma 1 owever, thanks to the above exercises, we can conclude that B σ in the proof, which then can be used to show that the singular values are found in order Proof:Alternative An alternative proof uses Lemma 1 to conclude that A UDV If the entries on the diagonal of D are not ordered from largest to smallest, then this can be fixed by permuting the rows and columns of D, and correspondingly permuting the columns of U and V 4 Geometric Interpretation Again We will now quickly illustrate what the SVD Theorem tells us about matrix-vector multiplication linear transformations by examining the case where A R Let A UΣV T be its SVD decomposition Notice that all matrices are now real valued, and hence V V T Partition A u u 1 σ σ 1 v v 1 T Since U and V are unitary matrices, {u, u 1 } and {v, v 1 } form orthonormal bases for the range and domain of A, respectively: 5
6 R : Domain of A R : Range codomain of A Let us manipulate the decomposition a little: [ σ T σ A u u 1 v v 1 u u 1 σ 1 σ 1 T σ u σ 1 u 1 v v 1 ] v v 1 T Now let us look at how A transforms v and v 1 : T Av σ u σ 1 u 1 v v 1 v σ u σ 1 u 1 1 σ u and similarly Av 1 σ 1 u 1 This motivates the pictures R : Domain of A R : Range codomain of A 6
7 Now let us look at how A transforms any vector with Euclidean unit length Notice that x means that x χ e + χ 1 e 1, where e and e 1 are the unit basis vectors Thus, χ and χ 1 are the coefficients when x is expressed using e and e 1 as basis owever, we can also express x in the basis given by v and v 1 : T x V }{{ V T v } x v v 1 v v 1 x T x v v 1 v1 T x I v T T α x v + v1 x v 1 α v + α v 1 v v 1 α 1 α α 1 χ χ 1 Thus, in the basis formed by v and v 1, its coefficients are α and α 1 Now, Ax σ u σ 1 u 1 σ u σ 1 u 1 α α 1 v v 1 T x σ u σ 1 u 1 α σ u + α 1 σ 1 u 1 v v 1 T v v 1 α α 1 This is illustrated by the following picture, which also captures the fact that the unit ball is mapped to an ellipse 1 with major axis equal to σ A and minor axis equal to σ 1 : R : Domain of A R : Range codomain of A Finally, we show the same insights for general vector x not necessarily of unit length 1 It is not clear that it is actually an ellipse and this is not important to our observations 7
8 R : Domain of A R : Range codomain of A Another observation is that if one picks the right basis for the domain and codomain, then the computation Ax simplifies to a matrix multiplication with a diagonal matrix Let us again illustrate this for nonsingular A R with A u u 1 } {{ } U σ } σ 1 {{ } Σ v v 1 } {{ } V Now, if we chose to express y using u and u 1 as the basis and express x using v and v 1 as the basis, then ŷ ψ UU }{{ T } y u T yu + u T 1 yu 1 ψ 1 I x V }{{ V T } x v T xv + v1 T xv 1 I χ χ 1 T If y Ax then U U T y ŷ UΣV T x Ax UΣ x so that ŷ Σ x and ψ ψ 1 These observation generalize to A C m m σ χ σ 1 χ 1 5 Consequences of the SVD Theorem Throughout this section we will assume that 8
9 A UΣV is the SVD of A C m n, with U and V unitary and Σ diagonal ΣT L Σ where Σ T L diagσ,, σ r 1 with σ σ 1 σ r 1 > U U L U R with U L C m r V V L V R with V L C n r We first generalize the observations we made for A R Let us track what the effect of Ax UΣV x is on vector x We assume that m n Let U u u m 1 and V v v n 1 Let x V V x v v n 1 v v n 1 x v v n 1 v x v n 1x v xv + + v n 1xv n 1 This can be interpreted as follows: vector x can be written in terms of the usual basis of C n as χ e + +χ 1 e n 1 or in the orthonormal basis formed by the columns of V as v xv + +v n 1xv n 1 Notice that Ax Av xv + + v n 1xv n 1 v xav + + v n 1xAv n 1 so that we next look at how A transforms each v i : Av i UΣV v i UΣe i σ i Ue i σ i u i Thus, another way of looking at Ax is Ax v xav + + v n 1xAv n 1 v xσ u + + v n 1xσ n 1 u n 1 σ u v x + + σ n 1 u n 1 v n 1x σ u v + + σ n 1 u n 1 v n 1 x Corollary 19 A U L Σ T L V L This is called the reduced SVD of A Proof: A UΣV U L U R Σ T L V L V R UL Σ T LV L Corollary Let A U L Σ T L VL be the reduced SVD with U L u u r 1, Σ T L diagσ,, σ r 1, and V L v v r 1 Then A σ u v + σ 1 u 1 v1 + + σ r 1 u r 1 vr 1 each term nonzero and an outer product, and hence a rank-1 matrix Proof: We leave the proof as an exercise Corollary 1 CA CU L Proof: 9
10 Let y CA Then there exists x C n such that y Ax by the definition of y CA But then y Ax U L Σ T L V L x z U L z, ie, there exists z C r such that y U L z This means y CU L Let y CU L Then there exists z C r such that y U L z But then y U L z U L Σ T L Σ 1 T L I z U L Σ T L V L V L I Σ 1 T L z A V LΣ 1 T L }{{ z } x Ax so that there exists x C n such that y Ax, ie, y CA Corollary The rank of A is r Proof: The rank of A equals the dimension of CA CU L But the dimension of CU L is clearly r Corollary 3 N A CV R Proof: Let x N A Then x V }{{ V } x V L V R V L V R x I VL V L V x R VR x V L VL x + V R VR x V L V R V L V R x If we can show that VL x then x V Rz where z VR x Assume that V L x Then Σ T L VL x since Σ T L is nonsingular and U L Σ T L VL x since U L has linearly independent columns But that contradicts the fact that Ax U L Σ T L VL x Let x CV R Then x V R z for some z C r and Ax U L Σ T L Corollary 4 For all x C n there exists z CV L such that Ax Az V L V R z Proof: Ax A } V {{ V } I x A V L V R V L V R x A V L V L x + V R V R x AV L V L x + AV R V R x AV L VL x + U L Σ T L VL V R V R x A V L V L x z 1
11 C n C m dim r z CA CU L CA CV L x z + x n y Az dim r y Ax U L Σ T L V L x y x n dim n r N A CV R N A CU R dim m r Figure 1: A pictorial description of how x z + x n is transformed by A C m n into y Ax Az + x n We see that CV L and CV R are orthogonal complements of each other within C n Similarly, CU L and CU R are orthogonal complements of each other within C m Any vector x can be written as the sum of a vector z CV R and x n CV C N A Alternative proof which uses the last corollary: Ax A V L V L x + V R V R x AV L V L x + A V R V R x N A A V L V L x z The proof of the last corollary also shows that Corollary 5 Any vector x C n can be written as x z + x n where z CV L and x n N A CV R Corollary 6 A V L Σ T L UL so that CA CV L and N A CU R The above corollaries are summarized in Figure 1 Theorem 7 Let A C n n be nonsingular Let A UΣV be its SVD Then 1 The SVD is the reduced SVD σ n 1 3 If U u u n 1, Σ diagσ,, σ n 1, and V v v n 1, then A 1 V P T P Σ 1 P T UP T 1 v n 1 v diag,, 1 σ n 1 σ u n 1 u, 11
12 1 1 where P is the permutation matrix such that P x reverses the order of the entries 1 in x Note: for this permutation matrix, P T P In general, this is not the case What is the case for all permutation matrices P is that P T P P P T I 4 A 1 1/σ n 1 Proof: The only item that is less than totally obvious is 3 Clearly A 1 V Σ 1 U The problem is that in Σ 1 the diagonal entries are not ordered from largest to smallest The permutation fixes this Corollary 8 If A C m n has linearly independent columns then A A is invertible nonsingular and A A 1 V L Σ 1 T L V L Proof: Since A has linearly independent columns, A U L Σ T L VL columns and V L is unitary ence is the reduced SVD where U L has n A A U L Σ T L VL U L Σ T L VL V L Σ T LUL U L Σ T L VL V L Σ T L Σ T L VL V L Σ T LVL Since V L is unitary and Σ T L is diagonal with nonzero diagonal entries, tey are both nonsingular Thus VL Σ T LVL V L Σ 1 T L V L I This means A T A is invertible and A T A 1 is as given 6 Projection onto the Column Space Definition 9 Let U L C m k have orthonormal columns The projection of a vector y C m onto CU L is the vector U L x that minimizes y U L x, where x C k We will also call this vector y the component of x in CU L Theorem 3 Let U L C m k have orthonormal columns The projection of y onto CU L is given by U L U L y Proof: The vector U L x that we want must satisfy U L x y min w C k U L w y Now, the -norm is invariant under multiplication by the unitary matrix U U L U R U L x y min w C k U L w y min U U L w y since the two norm is preserved w C k min U L U R UL w y w C k U min L U L w y w C k U R 1
13 U min L U U w C k UR L w L y UR U min L U L w U w C k UR U L y Lw UR y w U min L y w C k UR y w U min L y w C k UR y min w U w C k L y + U R y since min w U L y + w C k U R y u v u + v This is minimized when w UL y Thus, the vector that is closest to y in the space spanned by U L is given by x U L UL y Corollary 31 Let A C m n and A U L Σ T L VL CA is given by U L UL y be its reduced SVD Then the projection of y Cm onto Proof: This follows immediately from the fact that CA CU L Corollary 3 Let A C m n have linearly independent columns Then the projection of y C m onto CA is given by AA A 1 A y Proof: From Corrolary 8, we know that A A is nonsingular and that A A 1 V L Σ T L 1 V L Now, AA A 1 A y U L Σ T L V L V L Σ T L 1 V L U L Σ T L V L y U L Σ T L VL V L I Σ 1 T L Σ 1 T L V L V L I Σ T L U L y U L U L y ence the projection of y onto CA is given by AA A 1 A y Definition 33 Let A have linearly independent columns Then A A 1 A Moore-Penrose generalized inverse of matrix A is called the pseudo-inverse or 7 Low-rank Approximation of a Matrix Theorem 34 Let A C m n have SVD A UΣV and assume A has rank r Partition ΣT L U U L U R, V V L V R, and Σ Σ BR, 13
14 where U L C m k, V L C n k, and Σ T L R k k with k r Then B U L Σ T L VL closest to A in the following sense: is the matrix in Cm n A B min C C m n rankc k A C σ k Proof: First, if B is as defined, then clearly A B σ k : A B U A BV U AV U BV Σ U L U R B V L V ΣT L R Σ BR Σ BR Σ BR σ k ΣT L Next, assume that C has rank t k and A C < A B We will show that this leads to a contradiction The null space of C has dimension at least n k since dimn C + rankc n If x N C then Ax A Cx A C x < σ k x Partition U u u m 1 and V v v n 1 Then Av j σ j u j σ j σ s for j,, k Now, let x be any linear combination of v,, v k : x α v + + α k v k Notice that Then x α v + + α k v k α + α k Ax Aα v + + α k v k α Av + + α k Av k α σ u + + α k σ k u k α σ u + + α k σ k u k α σ + + α k σ k α + + α k σ k so that Ax σ k x In other words, vectors in the subspace of all linear combinations of {v,, v k } satisfy Ax σ k x The dimension of this subspace is k + 1 since {v,, v k } form an orthonormal basis Both these subspaces are subspaces of C n Since their dimensions add up to more than n there must be at least one nonzero vector z that satisfies both Az < σ k z and Az σ k z, which is a contradiction The above theorem tells us how to pick the best approximation with given rank to a given matrix 8 An Application Let Y R m n be a matrix that, for example, stores a picture In this case, the i, j entry in Y is, for example, a number that represents the grayscale value of pixel i, j The following instructions, executed in octave or matlab, generate the picture of Mexican artist Frida Kahlo in Figure top-left The file FridaPNGpng can be found at 14
15 Original picture k 1 k k 5 k 1 k 5 Figure : Multiple pictures as generated by the code 15
16 octave> IMG imread FridaPNGpng ; % this reads the image octave> Y IMG :,:,1 ; octave> imshow Y % this dispays the image Although the picture is black and white, it was read as if it is a color image, which means a m n 3 array of pixel information is stored Setting Y IMG :,:,1 extracts a single matrix of pixel information If you start with a color picture, you will want to approximate IMG :,:,1, IMG :,:,, and IMG :,:,3 separately Now, let Y UΣV T be the SVD of matrix Y Partition, conformally, U U L U R, V V L V R, and Σ ΣT L where U L and V L have k columns and Σ T L is k k so that Σ T L T Y U L U R V L V R Σ BR Σ T L V T L U L U R Σ BR VR T Σ T L VL T U L U R Σ BR VR T U L Σ T L V T L + U R Σ BR V T R Recall that then U L Σ T L VL T is the best rank-k approximation to Y Let us approximate the matrix that stores the picture with U L Σ T L VL T : >> IMG imread FridaPNGpng ; % read the picture >> Y IMG :,:,1 ; >> imshow Y ; % this dispays the image >> k 1; >> [ U, Sigma, V ] svd Y ; >> UL U :, 1:k ; % first k columns >> VL V :, 1:k ; % first k columns >> SigmaTL Sigma 1:k, 1:k ; % TL submatrix of Sigma >> Yapprox uint8 UL * SigmaTL * VL ; >> imshow Yapprox ; Σ BR As one increases k, the approximation gets better, as illustrated in Figure The graph in Figure 3 helps explain The original matrix Y is , with 181, 53 entries When k 1, matrices U, V, and Σ are 387 1, and 1 1, respectively, requiring only 8, 66 entries to be stores 9 SVD and the Condition Number of a Matrix In Notes on Norms we saw that if Ax b and Ax + δx b + δb, then δx x κ A δb b, where κ A A A 1 is the condition number of A, using the -norm Exercise 35 Show that if A C m m is nonsingular, then, 16
17 Figure 3: Distribution of singular values for the picture A σ, the largest singular value; A 1 1/σ m 1, the inverse of the smallest singular value; and κ A σ /σ m 1 If we go back to the example of A R, recall the following pictures that shows how A transforms the unit circle: 17
18 R : Domain of A R : Range codomain of A In this case, the ratio σ /σ n 1 represents the ratio between the major and minor axes of the ellipse on the right 1 An Algorithm for Computing the SVD? It would seem that the proof of the existence of the SVD is constructive in the sense that it provides an algorithm for computing the SVD of a given matrix A C m m Not so fast! Observe that Computing A is nontrivial Computing the vector that maximizes max x 1 Ax is nontrivial Given a vector q computing vectors q,, q m 1 is expensive as we will see when we discuss the QR factorization Towards the end of the course we will discuss algorithms for computing the eigenvalues and eigenvectors of a matrix, and related algorithms for computing the SVD 18
Linear Algebra Review. Vectors
Linear Algebra Review By Tim K. Marks UCSD Borrows heavily from: Jana Kosecka kosecka@cs.gmu.edu http://cs.gmu.edu/~kosecka/cs682.html Virginia de Sa Cogsci 8F Linear Algebra review UCSD Vectors The length
More informationThe Singular Value Decomposition in Symmetric (Löwdin) Orthogonalization and Data Compression
The Singular Value Decomposition in Symmetric (Löwdin) Orthogonalization and Data Compression The SVD is the most generally applicable of the orthogonal-diagonal-orthogonal type matrix decompositions Every
More informationInner Product Spaces and Orthogonality
Inner Product Spaces and Orthogonality week 3-4 Fall 2006 Dot product of R n The inner product or dot product of R n is a function, defined by u, v a b + a 2 b 2 + + a n b n for u a, a 2,, a n T, v b,
More informationOrthogonal Diagonalization of Symmetric Matrices
MATH10212 Linear Algebra Brief lecture notes 57 Gram Schmidt Process enables us to find an orthogonal basis of a subspace. Let u 1,..., u k be a basis of a subspace V of R n. We begin the process of finding
More informationLecture 5: Singular Value Decomposition SVD (1)
EEM3L1: Numerical and Analytical Techniques Lecture 5: Singular Value Decomposition SVD (1) EE3L1, slide 1, Version 4: 25-Sep-02 Motivation for SVD (1) SVD = Singular Value Decomposition Consider the system
More information1 2 3 1 1 2 x = + x 2 + x 4 1 0 1
(d) If the vector b is the sum of the four columns of A, write down the complete solution to Ax = b. 1 2 3 1 1 2 x = + x 2 + x 4 1 0 0 1 0 1 2. (11 points) This problem finds the curve y = C + D 2 t which
More informationα = u v. In other words, Orthogonal Projection
Orthogonal Projection Given any nonzero vector v, it is possible to decompose an arbitrary vector u into a component that points in the direction of v and one that points in a direction orthogonal to v
More informationby the matrix A results in a vector which is a reflection of the given
Eigenvalues & Eigenvectors Example Suppose Then So, geometrically, multiplying a vector in by the matrix A results in a vector which is a reflection of the given vector about the y-axis We observe that
More informationCS3220 Lecture Notes: QR factorization and orthogonal transformations
CS3220 Lecture Notes: QR factorization and orthogonal transformations Steve Marschner Cornell University 11 March 2009 In this lecture I ll talk about orthogonal matrices and their properties, discuss
More informationChapter 6. Orthogonality
6.3 Orthogonal Matrices 1 Chapter 6. Orthogonality 6.3 Orthogonal Matrices Definition 6.4. An n n matrix A is orthogonal if A T A = I. Note. We will see that the columns of an orthogonal matrix must be
More informationApplied Linear Algebra I Review page 1
Applied Linear Algebra Review 1 I. Determinants A. Definition of a determinant 1. Using sum a. Permutations i. Sign of a permutation ii. Cycle 2. Uniqueness of the determinant function in terms of properties
More informationLectures notes on orthogonal matrices (with exercises) 92.222 - Linear Algebra II - Spring 2004 by D. Klain
Lectures notes on orthogonal matrices (with exercises) 92.222 - Linear Algebra II - Spring 2004 by D. Klain 1. Orthogonal matrices and orthonormal sets An n n real-valued matrix A is said to be an orthogonal
More informationMATRIX ALGEBRA AND SYSTEMS OF EQUATIONS
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS Systems of Equations and Matrices Representation of a linear system The general system of m equations in n unknowns can be written a x + a 2 x 2 + + a n x n b a
More informationNotes on Symmetric Matrices
CPSC 536N: Randomized Algorithms 2011-12 Term 2 Notes on Symmetric Matrices Prof. Nick Harvey University of British Columbia 1 Symmetric Matrices We review some basic results concerning symmetric matrices.
More informationSystems of Linear Equations
Systems of Linear Equations Beifang Chen Systems of linear equations Linear systems A linear equation in variables x, x,, x n is an equation of the form a x + a x + + a n x n = b, where a, a,, a n and
More informationMATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m
MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS 1. SYSTEMS OF EQUATIONS AND MATRICES 1.1. Representation of a linear system. The general system of m equations in n unknowns can be written a 11 x 1 + a 12 x 2 +
More informationTHE DIMENSION OF A VECTOR SPACE
THE DIMENSION OF A VECTOR SPACE KEITH CONRAD This handout is a supplementary discussion leading up to the definition of dimension and some of its basic properties. Let V be a vector space over a field
More informationSection 6.1 - Inner Products and Norms
Section 6.1 - Inner Products and Norms Definition. Let V be a vector space over F {R, C}. An inner product on V is a function that assigns, to every ordered pair of vectors x and y in V, a scalar in F,
More informationFactorization Theorems
Chapter 7 Factorization Theorems This chapter highlights a few of the many factorization theorems for matrices While some factorization results are relatively direct, others are iterative While some factorization
More information160 CHAPTER 4. VECTOR SPACES
160 CHAPTER 4. VECTOR SPACES 4. Rank and Nullity In this section, we look at relationships between the row space, column space, null space of a matrix and its transpose. We will derive fundamental results
More informationSimilarity and Diagonalization. Similar Matrices
MATH022 Linear Algebra Brief lecture notes 48 Similarity and Diagonalization Similar Matrices Let A and B be n n matrices. We say that A is similar to B if there is an invertible n n matrix P such that
More informationOrthogonal Projections
Orthogonal Projections and Reflections (with exercises) by D. Klain Version.. Corrections and comments are welcome! Orthogonal Projections Let X,..., X k be a family of linearly independent (column) vectors
More informationLINEAR ALGEBRA. September 23, 2010
LINEAR ALGEBRA September 3, 00 Contents 0. LU-decomposition.................................... 0. Inverses and Transposes................................. 0.3 Column Spaces and NullSpaces.............................
More informationT ( a i x i ) = a i T (x i ).
Chapter 2 Defn 1. (p. 65) Let V and W be vector spaces (over F ). We call a function T : V W a linear transformation form V to W if, for all x, y V and c F, we have (a) T (x + y) = T (x) + T (y) and (b)
More informationNotes on Orthogonal and Symmetric Matrices MENU, Winter 2013
Notes on Orthogonal and Symmetric Matrices MENU, Winter 201 These notes summarize the main properties and uses of orthogonal and symmetric matrices. We covered quite a bit of material regarding these topics,
More informationMAT 200, Midterm Exam Solution. a. (5 points) Compute the determinant of the matrix A =
MAT 200, Midterm Exam Solution. (0 points total) a. (5 points) Compute the determinant of the matrix 2 2 0 A = 0 3 0 3 0 Answer: det A = 3. The most efficient way is to develop the determinant along the
More informationLINEAR ALGEBRA W W L CHEN
LINEAR ALGEBRA W W L CHEN c W W L Chen, 1997, 2008 This chapter is available free to all individuals, on understanding that it is not to be used for financial gain, and may be downloaded and/or photocopied,
More informationSolutions to Math 51 First Exam January 29, 2015
Solutions to Math 5 First Exam January 29, 25. ( points) (a) Complete the following sentence: A set of vectors {v,..., v k } is defined to be linearly dependent if (2 points) there exist c,... c k R, not
More informationAu = = = 3u. Aw = = = 2w. so the action of A on u and w is very easy to picture: it simply amounts to a stretching by 3 and 2, respectively.
Chapter 7 Eigenvalues and Eigenvectors In this last chapter of our exploration of Linear Algebra we will revisit eigenvalues and eigenvectors of matrices, concepts that were already introduced in Geometry
More informationMethods for Finding Bases
Methods for Finding Bases Bases for the subspaces of a matrix Row-reduction methods can be used to find bases. Let us now look at an example illustrating how to obtain bases for the row space, null space,
More informationNumerical Analysis Lecture Notes
Numerical Analysis Lecture Notes Peter J. Olver 6. Eigenvalues and Singular Values In this section, we collect together the basic facts about eigenvalues and eigenvectors. From a geometrical viewpoint,
More informationOctober 3rd, 2012. Linear Algebra & Properties of the Covariance Matrix
Linear Algebra & Properties of the Covariance Matrix October 3rd, 2012 Estimation of r and C Let rn 1, rn, t..., rn T be the historical return rates on the n th asset. rn 1 rṇ 2 r n =. r T n n = 1, 2,...,
More informationInner product. Definition of inner product
Math 20F Linear Algebra Lecture 25 1 Inner product Review: Definition of inner product. Slide 1 Norm and distance. Orthogonal vectors. Orthogonal complement. Orthogonal basis. Definition of inner product
More information1 Introduction to Matrices
1 Introduction to Matrices In this section, important definitions and results from matrix algebra that are useful in regression analysis are introduced. While all statements below regarding the columns
More information5. Orthogonal matrices
L Vandenberghe EE133A (Spring 2016) 5 Orthogonal matrices matrices with orthonormal columns orthogonal matrices tall matrices with orthonormal columns complex matrices with orthonormal columns 5-1 Orthonormal
More information[1] Diagonal factorization
8.03 LA.6: Diagonalization and Orthogonal Matrices [ Diagonal factorization [2 Solving systems of first order differential equations [3 Symmetric and Orthonormal Matrices [ Diagonal factorization Recall:
More informationNOTES ON LINEAR TRANSFORMATIONS
NOTES ON LINEAR TRANSFORMATIONS Definition 1. Let V and W be vector spaces. A function T : V W is a linear transformation from V to W if the following two properties hold. i T v + v = T v + T v for all
More informationISOMETRIES OF R n KEITH CONRAD
ISOMETRIES OF R n KEITH CONRAD 1. Introduction An isometry of R n is a function h: R n R n that preserves the distance between vectors: h(v) h(w) = v w for all v and w in R n, where (x 1,..., x n ) = x
More informationReview Jeopardy. Blue vs. Orange. Review Jeopardy
Review Jeopardy Blue vs. Orange Review Jeopardy Jeopardy Round Lectures 0-3 Jeopardy Round $200 How could I measure how far apart (i.e. how different) two observations, y 1 and y 2, are from each other?
More information18.06 Problem Set 4 Solution Due Wednesday, 11 March 2009 at 4 pm in 2-106. Total: 175 points.
806 Problem Set 4 Solution Due Wednesday, March 2009 at 4 pm in 2-06 Total: 75 points Problem : A is an m n matrix of rank r Suppose there are right-hand-sides b for which A x = b has no solution (a) What
More informationMATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors. Jordan canonical form (continued).
MATH 423 Linear Algebra II Lecture 38: Generalized eigenvectors Jordan canonical form (continued) Jordan canonical form A Jordan block is a square matrix of the form λ 1 0 0 0 0 λ 1 0 0 0 0 λ 0 0 J = 0
More informationThese axioms must hold for all vectors ū, v, and w in V and all scalars c and d.
DEFINITION: A vector space is a nonempty set V of objects, called vectors, on which are defined two operations, called addition and multiplication by scalars (real numbers), subject to the following axioms
More informationMathematics Course 111: Algebra I Part IV: Vector Spaces
Mathematics Course 111: Algebra I Part IV: Vector Spaces D. R. Wilkins Academic Year 1996-7 9 Vector Spaces A vector space over some field K is an algebraic structure consisting of a set V on which are
More informationNotes on Determinant
ENGG2012B Advanced Engineering Mathematics Notes on Determinant Lecturer: Kenneth Shum Lecture 9-18/02/2013 The determinant of a system of linear equations determines whether the solution is unique, without
More information4: EIGENVALUES, EIGENVECTORS, DIAGONALIZATION
4: EIGENVALUES, EIGENVECTORS, DIAGONALIZATION STEVEN HEILMAN Contents 1. Review 1 2. Diagonal Matrices 1 3. Eigenvectors and Eigenvalues 2 4. Characteristic Polynomial 4 5. Diagonalizability 6 6. Appendix:
More informationFinite Dimensional Hilbert Spaces and Linear Inverse Problems
Finite Dimensional Hilbert Spaces and Linear Inverse Problems ECE 174 Lecture Supplement Spring 2009 Ken Kreutz-Delgado Electrical and Computer Engineering Jacobs School of Engineering University of California,
More informationNumerical Methods I Eigenvalue Problems
Numerical Methods I Eigenvalue Problems Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 Course G63.2010.001 / G22.2420-001, Fall 2010 September 30th, 2010 A. Donev (Courant Institute)
More information3 Orthogonal Vectors and Matrices
3 Orthogonal Vectors and Matrices The linear algebra portion of this course focuses on three matrix factorizations: QR factorization, singular valued decomposition (SVD), and LU factorization The first
More informationMATH 551 - APPLIED MATRIX THEORY
MATH 55 - APPLIED MATRIX THEORY FINAL TEST: SAMPLE with SOLUTIONS (25 points NAME: PROBLEM (3 points A web of 5 pages is described by a directed graph whose matrix is given by A Do the following ( points
More informationLecture 1: Schur s Unitary Triangularization Theorem
Lecture 1: Schur s Unitary Triangularization Theorem This lecture introduces the notion of unitary equivalence and presents Schur s theorem and some of its consequences It roughly corresponds to Sections
More information3. Let A and B be two n n orthogonal matrices. Then prove that AB and BA are both orthogonal matrices. Prove a similar result for unitary matrices.
Exercise 1 1. Let A be an n n orthogonal matrix. Then prove that (a) the rows of A form an orthonormal basis of R n. (b) the columns of A form an orthonormal basis of R n. (c) for any two vectors x,y R
More information8 Square matrices continued: Determinants
8 Square matrices continued: Determinants 8. Introduction Determinants give us important information about square matrices, and, as we ll soon see, are essential for the computation of eigenvalues. You
More informationCS 5614: (Big) Data Management Systems. B. Aditya Prakash Lecture #18: Dimensionality Reduc7on
CS 5614: (Big) Data Management Systems B. Aditya Prakash Lecture #18: Dimensionality Reduc7on Dimensionality Reduc=on Assump=on: Data lies on or near a low d- dimensional subspace Axes of this subspace
More information3. INNER PRODUCT SPACES
. INNER PRODUCT SPACES.. Definition So far we have studied abstract vector spaces. These are a generalisation of the geometric spaces R and R. But these have more structure than just that of a vector space.
More informationA note on companion matrices
Linear Algebra and its Applications 372 (2003) 325 33 www.elsevier.com/locate/laa A note on companion matrices Miroslav Fiedler Academy of Sciences of the Czech Republic Institute of Computer Science Pod
More informationMatrix Representations of Linear Transformations and Changes of Coordinates
Matrix Representations of Linear Transformations and Changes of Coordinates 01 Subspaces and Bases 011 Definitions A subspace V of R n is a subset of R n that contains the zero element and is closed under
More informationMATH10212 Linear Algebra. Systems of Linear Equations. Definition. An n-dimensional vector is a row or a column of n numbers (or letters): a 1.
MATH10212 Linear Algebra Textbook: D. Poole, Linear Algebra: A Modern Introduction. Thompson, 2006. ISBN 0-534-40596-7. Systems of Linear Equations Definition. An n-dimensional vector is a row or a column
More informationMath 550 Notes. Chapter 7. Jesse Crawford. Department of Mathematics Tarleton State University. Fall 2010
Math 550 Notes Chapter 7 Jesse Crawford Department of Mathematics Tarleton State University Fall 2010 (Tarleton State University) Math 550 Chapter 7 Fall 2010 1 / 34 Outline 1 Self-Adjoint and Normal Operators
More information1 Sets and Set Notation.
LINEAR ALGEBRA MATH 27.6 SPRING 23 (COHEN) LECTURE NOTES Sets and Set Notation. Definition (Naive Definition of a Set). A set is any collection of objects, called the elements of that set. We will most
More informationSimilar matrices and Jordan form
Similar matrices and Jordan form We ve nearly covered the entire heart of linear algebra once we ve finished singular value decompositions we ll have seen all the most central topics. A T A is positive
More informationBindel, Spring 2012 Intro to Scientific Computing (CS 3220) Week 3: Wednesday, Feb 8
Spaces and bases Week 3: Wednesday, Feb 8 I have two favorite vector spaces 1 : R n and the space P d of polynomials of degree at most d. For R n, we have a canonical basis: R n = span{e 1, e 2,..., e
More information13 MATH FACTS 101. 2 a = 1. 7. The elements of a vector have a graphical interpretation, which is particularly easy to see in two or three dimensions.
3 MATH FACTS 0 3 MATH FACTS 3. Vectors 3.. Definition We use the overhead arrow to denote a column vector, i.e., a linear segment with a direction. For example, in three-space, we write a vector in terms
More information1 0 5 3 3 A = 0 0 0 1 3 0 0 0 0 0 0 0 0 0 0
Solutions: Assignment 4.. Find the redundant column vectors of the given matrix A by inspection. Then find a basis of the image of A and a basis of the kernel of A. 5 A The second and third columns are
More information( ) which must be a vector
MATH 37 Linear Transformations from Rn to Rm Dr. Neal, WKU Let T : R n R m be a function which maps vectors from R n to R m. Then T is called a linear transformation if the following two properties are
More informationUniversity of Lille I PC first year list of exercises n 7. Review
University of Lille I PC first year list of exercises n 7 Review Exercise Solve the following systems in 4 different ways (by substitution, by the Gauss method, by inverting the matrix of coefficients
More informationSection 1.7 22 Continued
Section 1.5 23 A homogeneous equation is always consistent. TRUE - The trivial solution is always a solution. The equation Ax = 0 gives an explicit descriptions of its solution set. FALSE - The equation
More informationExamination paper for TMA4205 Numerical Linear Algebra
Department of Mathematical Sciences Examination paper for TMA4205 Numerical Linear Algebra Academic contact during examination: Markus Grasmair Phone: 97580435 Examination date: December 16, 2015 Examination
More informationLS.6 Solution Matrices
LS.6 Solution Matrices In the literature, solutions to linear systems often are expressed using square matrices rather than vectors. You need to get used to the terminology. As before, we state the definitions
More informationLinear Algebraic Equations, SVD, and the Pseudo-Inverse
Linear Algebraic Equations, SVD, and the Pseudo-Inverse Philip N. Sabes October, 21 1 A Little Background 1.1 Singular values and matrix inversion For non-smmetric matrices, the eigenvalues and singular
More informationSolving Linear Systems, Continued and The Inverse of a Matrix
, Continued and The of a Matrix Calculus III Summer 2013, Session II Monday, July 15, 2013 Agenda 1. The rank of a matrix 2. The inverse of a square matrix Gaussian Gaussian solves a linear system by reducing
More informationLinear Algebra: Determinants, Inverses, Rank
D Linear Algebra: Determinants, Inverses, Rank D 1 Appendix D: LINEAR ALGEBRA: DETERMINANTS, INVERSES, RANK TABLE OF CONTENTS Page D.1. Introduction D 3 D.2. Determinants D 3 D.2.1. Some Properties of
More informationx1 x 2 x 3 y 1 y 2 y 3 x 1 y 2 x 2 y 1 0.
Cross product 1 Chapter 7 Cross product We are getting ready to study integration in several variables. Until now we have been doing only differential calculus. One outcome of this study will be our ability
More informationInner Product Spaces
Math 571 Inner Product Spaces 1. Preliminaries An inner product space is a vector space V along with a function, called an inner product which associates each pair of vectors u, v with a scalar u, v, and
More informationMath 115A HW4 Solutions University of California, Los Angeles. 5 2i 6 + 4i. (5 2i)7i (6 + 4i)( 3 + i) = 35i + 14 ( 22 6i) = 36 + 41i.
Math 5A HW4 Solutions September 5, 202 University of California, Los Angeles Problem 4..3b Calculate the determinant, 5 2i 6 + 4i 3 + i 7i Solution: The textbook s instructions give us, (5 2i)7i (6 + 4i)(
More information1 VECTOR SPACES AND SUBSPACES
1 VECTOR SPACES AND SUBSPACES What is a vector? Many are familiar with the concept of a vector as: Something which has magnitude and direction. an ordered pair or triple. a description for quantities such
More information1 Solving LPs: The Simplex Algorithm of George Dantzig
Solving LPs: The Simplex Algorithm of George Dantzig. Simplex Pivoting: Dictionary Format We illustrate a general solution procedure, called the simplex algorithm, by implementing it on a very simple example.
More informationLinear Algebra Notes
Linear Algebra Notes Chapter 19 KERNEL AND IMAGE OF A MATRIX Take an n m matrix a 11 a 12 a 1m a 21 a 22 a 2m a n1 a n2 a nm and think of it as a function A : R m R n The kernel of A is defined as Note
More informationThe Determinant: a Means to Calculate Volume
The Determinant: a Means to Calculate Volume Bo Peng August 20, 2007 Abstract This paper gives a definition of the determinant and lists many of its well-known properties Volumes of parallelepipeds are
More informationEigenvalues and Eigenvectors
Chapter 6 Eigenvalues and Eigenvectors 6. Introduction to Eigenvalues Linear equations Ax D b come from steady state problems. Eigenvalues have their greatest importance in dynamic problems. The solution
More informationMath 215 HW #6 Solutions
Math 5 HW #6 Solutions Problem 34 Show that x y is orthogonal to x + y if and only if x = y Proof First, suppose x y is orthogonal to x + y Then since x, y = y, x In other words, = x y, x + y = (x y) T
More informationMehtap Ergüven Abstract of Ph.D. Dissertation for the degree of PhD of Engineering in Informatics
INTERNATIONAL BLACK SEA UNIVERSITY COMPUTER TECHNOLOGIES AND ENGINEERING FACULTY ELABORATION OF AN ALGORITHM OF DETECTING TESTS DIMENSIONALITY Mehtap Ergüven Abstract of Ph.D. Dissertation for the degree
More informationNumerical Methods I Solving Linear Systems: Sparse Matrices, Iterative Methods and Non-Square Systems
Numerical Methods I Solving Linear Systems: Sparse Matrices, Iterative Methods and Non-Square Systems Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 Course G63.2010.001 / G22.2420-001,
More informationMore than you wanted to know about quadratic forms
CALIFORNIA INSTITUTE OF TECHNOLOGY Division of the Humanities and Social Sciences More than you wanted to know about quadratic forms KC Border Contents 1 Quadratic forms 1 1.1 Quadratic forms on the unit
More informationVector and Matrix Norms
Chapter 1 Vector and Matrix Norms 11 Vector Spaces Let F be a field (such as the real numbers, R, or complex numbers, C) with elements called scalars A Vector Space, V, over the field F is a non-empty
More informationGROUP ALGEBRAS. ANDREI YAFAEV
GROUP ALGEBRAS. ANDREI YAFAEV We will associate a certain algebra to a finite group and prove that it is semisimple. Then we will apply Wedderburn s theory to its study. Definition 0.1. Let G be a finite
More informationContinued Fractions and the Euclidean Algorithm
Continued Fractions and the Euclidean Algorithm Lecture notes prepared for MATH 326, Spring 997 Department of Mathematics and Statistics University at Albany William F Hammond Table of Contents Introduction
More informationI. GROUPS: BASIC DEFINITIONS AND EXAMPLES
I GROUPS: BASIC DEFINITIONS AND EXAMPLES Definition 1: An operation on a set G is a function : G G G Definition 2: A group is a set G which is equipped with an operation and a special element e G, called
More informationLinear Algebra Notes for Marsden and Tromba Vector Calculus
Linear Algebra Notes for Marsden and Tromba Vector Calculus n-dimensional Euclidean Space and Matrices Definition of n space As was learned in Math b, a point in Euclidean three space can be thought of
More information6. Cholesky factorization
6. Cholesky factorization EE103 (Fall 2011-12) triangular matrices forward and backward substitution the Cholesky factorization solving Ax = b with A positive definite inverse of a positive definite matrix
More informationMATH 304 Linear Algebra Lecture 18: Rank and nullity of a matrix.
MATH 304 Linear Algebra Lecture 18: Rank and nullity of a matrix. Nullspace Let A = (a ij ) be an m n matrix. Definition. The nullspace of the matrix A, denoted N(A), is the set of all n-dimensional column
More informationLeast-Squares Intersection of Lines
Least-Squares Intersection of Lines Johannes Traa - UIUC 2013 This write-up derives the least-squares solution for the intersection of lines. In the general case, a set of lines will not intersect at a
More informationOrthogonal Bases and the QR Algorithm
Orthogonal Bases and the QR Algorithm Orthogonal Bases by Peter J Olver University of Minnesota Throughout, we work in the Euclidean vector space V = R n, the space of column vectors with n real entries
More informationRecall the basic property of the transpose (for any A): v A t Aw = v w, v, w R n.
ORTHOGONAL MATRICES Informally, an orthogonal n n matrix is the n-dimensional analogue of the rotation matrices R θ in R 2. When does a linear transformation of R 3 (or R n ) deserve to be called a rotation?
More informationName: Section Registered In:
Name: Section Registered In: Math 125 Exam 3 Version 1 April 24, 2006 60 total points possible 1. (5pts) Use Cramer s Rule to solve 3x + 4y = 30 x 2y = 8. Be sure to show enough detail that shows you are
More informationRecall that two vectors in are perpendicular or orthogonal provided that their dot
Orthogonal Complements and Projections Recall that two vectors in are perpendicular or orthogonal provided that their dot product vanishes That is, if and only if Example 1 The vectors in are orthogonal
More informationExamination paper for TMA4115 Matematikk 3
Department of Mathematical Sciences Examination paper for TMA45 Matematikk 3 Academic contact during examination: Antoine Julien a, Alexander Schmeding b, Gereon Quick c Phone: a 73 59 77 82, b 40 53 99
More informationChapter 17. Orthogonal Matrices and Symmetries of Space
Chapter 17. Orthogonal Matrices and Symmetries of Space Take a random matrix, say 1 3 A = 4 5 6, 7 8 9 and compare the lengths of e 1 and Ae 1. The vector e 1 has length 1, while Ae 1 = (1, 4, 7) has length
More informationCONTROLLABILITY. Chapter 2. 2.1 Reachable Set and Controllability. Suppose we have a linear system described by the state equation
Chapter 2 CONTROLLABILITY 2 Reachable Set and Controllability Suppose we have a linear system described by the state equation ẋ Ax + Bu (2) x() x Consider the following problem For a given vector x in
More information1 Homework 1. [p 0 q i+j +... + p i 1 q j+1 ] + [p i q j ] + [p i+1 q j 1 +... + p i+j q 0 ]
1 Homework 1 (1) Prove the ideal (3,x) is a maximal ideal in Z[x]. SOLUTION: Suppose we expand this ideal by including another generator polynomial, P / (3, x). Write P = n + x Q with n an integer not
More informationMultidimensional data and factorial methods
Multidimensional data and factorial methods Bidimensional data x 5 4 3 4 X 3 6 X 3 5 4 3 3 3 4 5 6 x Cartesian plane Multidimensional data n X x x x n X x x x n X m x m x m x nm Factorial plane Interpretation
More information