Lecture 21

[[lecture-data]]

2024-10-16

Readings

  • a
 

7. Chapter 7

We are talking about the singular value decomposition. Today: see how it can be used for generalized inverses

Recall, if we have a matrix and then we have generalized inverse when some of the conditions are met:

  1. is hermitian
  2. is hermitian

Note

if is square and invertible, then that satisfies all 4 conditions is the inverse of ie .

We can think of these in terms of the linear system .

  • if is square invertible, then
  • If is “tall and skinny” we want something to get the correct shape of . If a system has a solution, then a 1-generalized inverse will give the solution . And if , then the system is consistent (ie a solution exists)

Theorem

Suppose that and are given. Suppose is a 1-2generalized inverse of . Then solves in a least squares sense. ie, has optimal solution .

can be expressed as for some . We show that is minimized when .

Any vector in

\lvert \lvert A(Bb+y) - b \rvert \rvert _{2}^2 &= [(AB-I)b + Ay]^*[(AB - I)b + Ay] \\ &= \lvert \lvert (AB-I)b \rvert \rvert _{2}^2 + \lvert \lvert Ay \rvert \rvert_{2}^2 + \lvert \lvert Ay \rvert \rvert_{2}^2 + y^*A^*(AB-I)b + b^*(AB-I)^* Ay \\ (*) &= \lvert \lvert (AB -I)b \rvert \rvert_{2}^2 + \lvert \lvert Ay \rvert \rvert_{2}^2 \end{aligned}$$ $(*)$ we see that $[A^*(AB-I)]^* = (AB-I)A = ABA-A =0$ since $B$ is a 1, 2 [[Concept Wiki/generalized inverse]] (ie, $AB$ is [[Concept Wiki/hermitian]] and $ABA=A$). ie, the expression depends only on the term $\lvert \lvert Ay \rvert \rvert_{2}^2$, which is minimized precisely when $\lvert \lvert Ay \rvert \rvert=0$ such as when $y =0$ for example. (the first term of the expression is constant). - thus the solutions to the least squares problem is the set $\{ Bb+y : y \in \text{Null}(A) \}$

(see a 1-2-generalized inverse gives a least squares optimal solution)

Theorem

There exists a unique 1-2-3-4 generalized inverse for every matrix called the Moore-Penrose inverse (or pseudoinverse). And if is real valued, then this inverse is also real valued.

are both 1-2-3-4 generalized inverses for . NTS .

(Uniqueness first). Suppose

  • Claim 1:
  • since (1 generalized inverse)
  • and are both hermitian. so we have
  • since
  • But is hermitian, so we get .
  • Claim 2: argued analogously to the first claim
  • since
  • Then since both and are hermitian
  • since is hermitian Consider .
  • By claim 2, we have
  • By claim 1, we have .
  • And by property 3 of the generalized inverses, we get .

(Existence) Let us first consider a special case. Suppose is “diagonal”. Ie, . Define “diagonal”. And for all , we have if and otherwise. Then is a 1-2-3-4 generalized inverse for . We can check this easily

  • clearly is hermitian, same with .

Now, what happens if we have a matrix with a 1-2-3-4 generalized inverse ? Let be unitary, also unitary. Then has a 1-2-3-4 generalized inverse . We can show this easily:

  • . Then since is 1-2-3-4 generalized inverse we get
  • is hermitian since is hermitian
  • (And the other two conditions are shown exactly analogously)

Say is an SVD of . Then the moore-penrose inverse of is by the two above facts.

  • if is real, then the SVD is real and so the pseudoinverse is also real.

(see Moore-Penrose inverse)

Theorem

Let be given. Among the solutions to , we have that is a unique solution of the minimum euclidian norm.

. Let be an SVD and the rank of is .

Recall that the solutions of the least squares problem are exactly

  • since is (almost) an SVD. Almost because the sigmas in are not necessarily non-decreasing. (recall that for SVD we assume that the singular values are ordered)

Thus for all we have since we can see that from the above. Thus for any , we have

  • the minimum occurs precisely when !

(see the psuedoinverse gives the least norm solution to the least squares problem)