Singular Value Decomposition
This post is the third in a 3-part series on building geometric intuition about key concepts in linear algebra.
import matplotlib.pyplot as plt
import numpy as np
%matplotlib inline
%config InlineBackend.figure_format = 'retina'
from plot_helper import *
#
    What if M is not a symmetric matrix? ¶
In the previous post , we looked at the geometric intuition behind decomposing the transformation of a real, symmetric matrix $M$. Now, let's look at what happens when $M$ is real, but not symmetric. For example,
$$ M = \begin{pmatrix} 2 & 0.1 \\ 1 & 2 \\ \end{pmatrix} $$Let us inspect the eigenvalues and eigenvectors of $M$.
M = np.array([[2, 0.1],[1, 2]])
evals, C = np.linalg.eig(M)
D = np.diag(evals)
print(f"Eigenvalues (D)\n {D} \n\n Eigenvectors (C)\n {C}")
#
    Check the eigenvectors ¶
- They are not orthogonal
- They are not along the major & minor axes of the ellipse (after transformation)
plot_linear_transformation(M, C[:,0], C[:,1], unit_vector=False, unit_circle=True)
#
    Vectors along the major / minor axes of ellipse ¶
Let's denote the vectors along the major / minor axes of ellipse on the right as $u_1,u_2$.
alpha = np.linspace(0, 2*numpy.pi, 5001)
circle = np.vstack((np.cos(alpha), np.sin(alpha)))
ellipse = M @ circle    
lengths = np.linalg.norm(ellipse, axis=0)
u1 = ellipse[:,np.argmax(lengths)]
u2 = ellipse[:,np.argmin(lengths)]
U_v = np.column_stack((u1, u2))
print(f"Major \n {U_v[:,0]} \n\n Minor \n {U_v[:,1]}")
#
    Compute the representation of major / minor before transformation ¶
- Let $V = [v_1, v_2]$ denote the coordinate representations of $U_v = [u_1,u_2]$ before the transformation.
- 
     Let's plot the $u$ and $v$ vectors below.
     - Note that the vectors are orthogonal.
- Note also that the $u$ vectors now correspond to the major / minor axes of the ellipse on the right.
 
M_inv = np.linalg.inv(M)
V = M_inv @ U_v
plot_linear_transformation(M, V[:,0], V[:,1], unit_vector=False, unit_circle=True)
#
    Let's now normalize the column vectors in $U_v$
- Let $L$ store the lengths of the column vectors in $U_v$
- Let $U$ be the normalised column vectors in $U_v$
- Make $L$ a diagonal matrix so that $U_v = UL = MV$
L = np.linalg.norm(U_v, axis=0)
U = U_v/L
L = np.diag(L)
#
    assert np.allclose(U@L, M@V)
    Singular Value Decomposition of $M$ ¶
\begin{align} UL &= MV \\ \therefore M &= ULV^{-1} = ULV^T \end{align}- As $V$ is an orthogonal matrix, $V^{-1} = V^T$
- The columns of $U$ are the left singular vectors of $M$
- The diagonal entries of $L$ are the singular values of $M$
- The columns of $V$ are the right singular vectors of $M$
assert np.allclose(M, U@L@V.T)
    Compare the linear transformation by $M$ vs. $ULV^T$ ¶
plot_linear_transformation(M, unit_circle=True)
    plot_linear_transformations(V.T,L,U, unit_circle=True)
    The linear transformation by $M$ is decomposed into these steps:
- $V^T$ first rotates the coordinate axes
- $L$ stretches the circle into an ellipse
- $U$ rotates the coordinate axes again
Let's now compare this decomposition with the SVD implementation from Numpy.
N_U, N_L, N_VT = np.linalg.svd(M)
    As we see below, the blue vectors (from Numpy SVD) are aligned with the green vectors (computed above).
with plt.rc_context():
    plt.rc("figure", dpi=160)
    fig, axes = plt.subplots(ncols=2, figsize=(6,3))
    for ax, (mat_svd, mat_above) in zip(axes.flat, [(N_U, U),(N_VT.T,V)]): 
        ax.spines['left'].set_position('center')
        ax.spines['bottom'].set_position('center')
        ax.spines['right'].set_color('none')
        ax.spines['top'].set_color('none')
        ax.set_xlim([-1,1])
        ax.set_ylim([-1,1])
        for column in mat_svd.T:
            ax.arrow(0,0,*column,head_width=0.05, head_length=0.1, fc='b', ec='b', ls='-.')
        for column in mat_above.T:
            ax.arrow(0,0,*column,head_width=0.05, head_length=0.1, fc='g', ec='g', alpha=0.6)
#
    Let's now compare $V$ from Numpy SVD to the eigenvectors of $M^T M$.
N_V = N_VT.T
D, Q = np.linalg.eig(M.T@M)
print(f"V from Numpy SVD of M\n {N_V} \n\n Eigenvectors\n {Q}")
#
    We see that $V$ contains the eigenvectors of $M^TM$. The following derivation explains why this is the case.
Eigendecomposition of $M^T M$ vs. SVD of $M$ ¶
Given $M = U L V^T$, \begin{align} M^TM =& ~(U L V^T)^T (U L V^T) \\ =& ~V L^2 V^T \end{align}
We also see that the singular values $L$ of $M$ are the square roots of the eigenvalues of $M^TM$.
assert np.allclose(np.diag(D), L**2)
    Similarly, $U$ contains the eigenvectors of $MM^T$.