Lecture 21

[[lecture-data]]

2024-10-16

Readings

7. Chapter 7

We are talking about the singular value decomposition. Today: see how it can be used for generalized inverses

Recall, if we have a matrix $A \in M_{m, n}$ and $B \in M_{n, m}$ then we have generalized inverse when some of the conditions are met:

$A B A = A$
$A B$ is hermitian
$B A B = B$
$B A$ is hermitian

Note

if $A$ is square and invertible, then $B$ that satisfies all 4 conditions is the inverse of $A$ ie $A^{- 1}$ .

We can think of these in terms of the linear system $A x = b$ .

if $A$ is square invertible, then $x = A^{- 1} b$
If $A$ is "tall and skinny" we want something to get the correct shape of $x$ . If a system has a solution, then a 1-generalized inverse $B$ will give the solution $x = B b$ . And if $A B b = b$ , then the system is consistent (ie a solution exists)

Theorem

Suppose that $A \in M_{m, n}$ and $b \in C^{m}$ are given. Suppose $B \in M_{n, m}$ is a 1-2generalized inverse of $A$ . Then $x = B b$ solves $A x = b$ in a least squares sense. ie,

min_{x \in C^{n}} | | A x - b | |_{2}

has optimal solution $\hat{x} = B b$ .

Proof

Any vector in $C^{n}$ can be expressed as $x = B b + y$ for some $y \in C^{n}$ . We show that $| | A (B b + y) - b | |_{2}^{2}$ is minimized when $y = 0$ .

\begin{aligned} | | A (B b + y) - b | |_{2}^{2} & = [(A B - I) b + A y]^{*} [(A B - I) b + A y] \\ = | | (A B - I) b | |_{2}^{2} + | | A y | |_{2}^{2} + | | A y | |_{2}^{2} + y^{*} A^{*} (A B - I) b + b^{*} (A B - I)^{*} A y \\ (*) & = | | (A B - I) b | |_{2}^{2} + | | A y | |_{2}^{2} \end{aligned}

$(*)$ we see that $[A^{*} (A B - I)]^{*} = (A B - I) A = A B A - A = 0$ since $B$ is a 1, 2 generalized inverse (ie, $A B$ is hermitian and $A B A = A$ ).

ie, the expression depends only on the term $| | A y | |_{2}^{2}$ , which is minimized precisely when $| | A y | | = 0$ such as when $y = 0$ for example. (the first term of the expression is constant).

thus the solutions to the least squares problem is the set ${B b + y : y \in Null (A)}$

(see a 1-2-generalized inverse gives a least squares optimal solution)

Theorem

There exists a unique 1-2-3-4 generalized inverse for every matrix $A \in M_{m, n}$ called the Moore-Penrose inverse (or pseudoinverse). And if $A$ is real valued, then this inverse is also real valued.

Proof

(Uniqueness first). Suppose $B, C \in M_{n, m}$ are both 1-2-3-4 generalized inverses for $A$ . NTS $B = C$ .

Claim 1: $A B = A C$
$A B = A C A B$ since $A = A C A$ (1 generalized inverse)
$A C$ and $A B$ are both hermitian. so we have
$A B = C^{*} A^{*} B^{*} A^{*} = C^{*} (A B A)^{*} = C^{*} A^{*}$ since $A B A = A$
But $A C$ is hermitian, so we get $A B = A C$ .
Claim 2: $B A = C A$ argued analogously to the first claim
$B A = B A C A$ since $A = A C A$
Then $B A = B A C A = A^{*} B^{*} A^{*} C^{*}$ since both $B A$ and $C A$ are hermitian
$B A = (A B A)^{*} C^{*} = A^{*} C^{*} = C A$ since $C A$ is hermitian
Consider $C A B$ .
By claim 2, we have $B A B = C A B$
By claim 1, we have $C A B = C A C$ .
And by property 3 of the generalized inverses, we get $B = B A B = C A B = C A C = C$ .

(Existence)
Let us first consider a special case. Suppose $Σ$ is $M_{m, n}$ "diagonal". Ie, $i \neq j, Σ_{i j} = 0$ . Define $Σ^{^{†}} \in M_{n, m}$ "diagonal". And for all $i$ , we have $(Σ^{^{†}})_{i i} = \frac{1}{Σ_{i i}}$ if $Σ_{i i} \neq 0$ and $0$ otherwise. Then $Σ^{^{†}}$ is a 1-2-3-4 generalized inverse for $Σ$ . We can check this easily

clearly $Σ Σ^{^{†}}$ is hermitian, same with $Σ^{^{†}} Σ$ .
$Σ Σ^{^{†}} Σ = Σ$
$Σ^{^{†}} Σ Σ^{^{†}} = Σ^{^{†}}$

Now, what happens if we have a matrix $Y \in M_{m, n}$ with a 1-2-3-4 generalized inverse $Z \in M_{n, m}$ ? Let $U \in M_{m}$ be unitary, $V \in M_{n}$ also unitary. Then $U Y V^{*}$ has a 1-2-3-4 generalized inverse $V Z U^{*}$ . We can show this easily:

$[U Y V^{*}] [V Z U^{*}] [U Y V^{*}] = U Y Z Y V^{*}$ . Then since $Z$ is 1-2-3-4 generalized inverse we get $U Y Z Y V^{*} = U Y V^{*}$
$[U Y V^{*}] [V Z U^{*}] = U Y Z U^{*}$ is hermitian since $Y Z$ is hermitian
(And the other two conditions are shown exactly analogously)

Say $A = U Σ V^{*}$ is an SVD of $A$ . Then the moore-penrose inverse of $A$ is $A^{^{†}} = V Σ^{^{†}} U^{*}$ by the two above facts.

if $A$ is real, then the SVD is real and so the pseudoinverse is also real.

(see Moore-Penrose inverse)

Theorem

Let $A \in M_{m, n}, b \in C^{m}$ be given. Among the solutions to $min_{x \in C^{n}} | | A x - b | |_{2}$ , we have that $A^{†} b$ is a unique solution of the minimum euclidian norm.

Proof

Recall that the solutions of the least squares problem are exactly $A^{†} b + y : y \in Null (A)$ . Let $A = U Σ V^{*}$ be an SVD and the rank of $A$ is $k$ .

$Null (A) = span {v_{k + 1}, v_{k + 2}, \dots, v_{n}}$
$Range (A^{†}) = span {v_{0}, v_{1}, \dots, v_{k}}$ since $A^{†} = V Σ^{†} U^{*}$ is (almost) an SVD. Almost because the sigmas in $Σ^{†}$ are not necessarily non-decreasing. (recall that for SVD we assume that the singular values are ordered)

Thus for all $y \in Null (A)$ we have $y ⊥ A^{†} b$ since we can see that $range (A^{†}) ⊥ Null (A)$ from the above. Thus for any $y \in Null (A)$ , we have

| | A^{†} b + y | |_{2}^{2} = [A^{†} b + y]^{*} [A^{†} b + y] = | | A^{†} b | |_{2}^{2} + | | y | |_{2}^{2} + 0 + 0 ⟹

the minimum occurs precisely when $y = 0$ !

(see the psuedoinverse gives the least norm solution to the least squares problem)