School of Economics and Management
Beihang University
http://yanfei.site

Non-square matrices

  • Recall that if the matrix A is square (real or complex) then a diagonalisation may exist.
    • This is clearly very useful for easy calculation of many important problems as we saw last week.
    • If a diagonalisation doesn't exist, then there is always a triangularisation via Schur Decomposition.
  • But non-square matrices don’t have eigenvalues, so what can we do?
  • You are about to learn the most useful diagonal decomposition that works for all matrices: Singular Value Decomposition.

Singular values

  • Singular values are the square roots of the eigenvalues of \(A^TA\) which is square and symmetric
  • The singular vectors (\(u\) and \(v\)) come in a pair for each singular value \(\sigma\), such that \[A v = \sigma u.\]

Generalising Eigen-Decomposition

  • Eigendecomposition involves only one eigenvector for each eigenvalue (including multiplicities), stored in an orthogonal matrix \(Q\), with eigenvalues on the diagonal of the matrix \(\Lambda\), so that \(A=Q\lambda Q^T\).
  • We can generalise this now that we have singular vectors \(u\) and \(v\) for each singular value \(\sigma\).

Singular Value Decomposition (SVD)

For \(A \in \mathcal{R}^{m \times n}\), there exists orthogonal matrices \[U = [u_1, \cdots, u_m] \in \mathcal{R}^{m\times m}\] and \[V = [v_1, \cdots, v_n] \in \mathcal{R}^{n\times n}\] such that \[U^TAV = \Sigma = \text{diag}\{\sigma_1, \cdots, \sigma_p\} \in \mathcal{R}^{m\times n},\] with \(p = \min\{m, n\}\) and \(\sigma_1 \geq \dots \geq \sigma_p \geq 0\).

Rearranging, we have \[A = U\Sigma V^T\].

SVD

Try svd() in R.

Some properties of SVD

  • \(\sigma_i\) are singular values of \(A\).
  • The non-zero singular values of \(A\) are the square roots of the non-zero eigenvalues of both \(A^TA\) and \(AA^T\).
  • The rank of a matrix is equal to the number of non-zero singular values.
  • The condition number measures the degree of singularity of \(A^TA\): \[\kappa = \frac{\text{max singular value}}{\text{min singular value}}.\]

Summary

  • SVD: Decomposition of any matrix \(A\).
  • It works by eigendecomposition of \(A^TA\) (or \(AA^T\)) which is square and symmetric.
  • We are now able to associate an orthogonal diagonal form with every matrix, and easily calculate useful properties of the matrix.
  • Over the next few lectures we will look at the fantastic applications of SVD.

Lab session

Peek into SVD and PCA in R, illustrate their relationship and write a short report.