[[lecture-data]]2024-10-16
Readings
- a
7. Chapter 7
We are talking about the singular value decomposition. Today: see how it can be used for generalized inverses
Recall, if we have a matrix and then we have generalized inverse when some of the conditions are met:
Note
if is square and invertible, then that satisfies all 4 conditions is the inverse of ie .
We can think of these in terms of the linear system .
- if is square invertible, then
- If is “tall and skinny” we want something to get the correct shape of . If a system has a solution, then a 1-generalized inverse will give the solution . And if , then the system is consistent (ie a solution exists)
Theorem
Suppose that and are given. Suppose is a 1-2generalized inverse of . Then solves in a least squares sense. ie, has optimal solution .
can be expressed as for some . We show that is minimized when .
Any vector in
\lvert \lvert A(Bb+y) - b \rvert \rvert _{2}^2 &= [(AB-I)b + Ay]^*[(AB - I)b + Ay] \\ &= \lvert \lvert (AB-I)b \rvert \rvert _{2}^2 + \lvert \lvert Ay \rvert \rvert_{2}^2 + \lvert \lvert Ay \rvert \rvert_{2}^2 + y^*A^*(AB-I)b + b^*(AB-I)^* Ay \\ (*) &= \lvert \lvert (AB -I)b \rvert \rvert_{2}^2 + \lvert \lvert Ay \rvert \rvert_{2}^2 \end{aligned}$$ $(*)$ we see that $[A^*(AB-I)]^* = (AB-I)A = ABA-A =0$ since $B$ is a 1, 2 [[Concept Wiki/generalized inverse]] (ie, $AB$ is [[Concept Wiki/hermitian]] and $ABA=A$). ie, the expression depends only on the term $\lvert \lvert Ay \rvert \rvert_{2}^2$, which is minimized precisely when $\lvert \lvert Ay \rvert \rvert=0$ such as when $y =0$ for example. (the first term of the expression is constant). - thus the solutions to the least squares problem is the set $\{ Bb+y : y \in \text{Null}(A) \}$(see a 1-2-generalized inverse gives a least squares optimal solution)
Theorem
There exists a unique 1-2-3-4 generalized inverse for every matrix called the Moore-Penrose inverse (or pseudoinverse). And if is real valued, then this inverse is also real valued.
are both 1-2-3-4 generalized inverses for . NTS .
(Uniqueness first). Suppose
- Claim 1:
- since (1 generalized inverse)
- and are both hermitian. so we have
- since
- But is hermitian, so we get .
- Claim 2: argued analogously to the first claim
- since
- Then since both and are hermitian
- since is hermitian Consider .
- By claim 2, we have
- By claim 1, we have .
- And by property 3 of the generalized inverses, we get .
(Existence) Let us first consider a special case. Suppose is “diagonal”. Ie, . Define “diagonal”. And for all , we have if and otherwise. Then is a 1-2-3-4 generalized inverse for . We can check this easily
- clearly is hermitian, same with .
Now, what happens if we have a matrix with a 1-2-3-4 generalized inverse ? Let be unitary, also unitary. Then has a 1-2-3-4 generalized inverse . We can show this easily:
- . Then since is 1-2-3-4 generalized inverse we get
- is hermitian since is hermitian
- (And the other two conditions are shown exactly analogously)
Say is an SVD of . Then the moore-penrose inverse of is by the two above facts.
- if is real, then the SVD is real and so the pseudoinverse is also real.
(see Moore-Penrose inverse)
Theorem
Let be given. Among the solutions to , we have that is a unique solution of the minimum euclidian norm.
. Let be an SVD and the rank of is .
Recall that the solutions of the least squares problem are exactly
- since is (almost) an SVD. Almost because the sigmas in are not necessarily non-decreasing. (recall that for SVD we assume that the singular values are ordered)
Thus for all we have since we can see that from the above. Thus for any , we have
- the minimum occurs precisely when !
(see the psuedoinverse gives the least norm solution to the least squares problem)