Equivalence of QR Iteration and Simultaneous Iteration

Equivalence of QR Iteration and Simultaneous Iteration (Theorem # 1412)

Theorem

Edit Issues Pull Requests Attributions Admin

Discussion

No discussion available for this theorem.

Proof

[proofplan] We proceed by induction on $k$. The base case $k = 0$ is immediate since $X^{(0)} = I$. For the inductive step, we use the simultaneous iteration relation $AX^{(k)} = X^{(k+1)}R^{(k+1)}$ to identify a QR factorisation of $A^{(k)}$, then show that the QR iteration update $A^{(k+1)} = R^{(k+1)}Q^{(k+1)}$ coincides with $(X^{(k+1)})^\top A X^{(k+1)}$. [/proofplan] [step:Verify the base case $k = 0$] At $k = 0$, simultaneous iteration starts with $X^{(0)} = I$, so \begin{align*} (X^{(0)})^\top A X^{(0)} = I^\top A I = A = A^{(0)}, \end{align*} which matches the QR iteration initialisation $A^{(0)} = A$. [/step] [step:Identify the QR factorisation of $A^{(k)}$ from the simultaneous iteration step] Assume $(X^{(k)})^\top A X^{(k)} = A^{(k)}$ as the induction hypothesis. The simultaneous iteration algorithm computes the QR factorisation \begin{align*} AX^{(k)} = X^{(k+1)} \hat{R}, \end{align*} where $X^{(k+1)}$ has orthonormal columns and $\hat{R}$ is upper triangular. Define \begin{align*} Q := (X^{(k)})^\top X^{(k+1)}, \qquad R := \hat{R}. \end{align*} Since both $X^{(k)}$ and $X^{(k+1)}$ have orthonormal columns (and are square when $p = n$; for the basic QR iteration, $X^{(0)} = I \in \mathbb{R}^{n \times n}$), the matrix $Q$ is orthogonal: \begin{align*} Q^\top Q = (X^{(k+1)})^\top X^{(k)} (X^{(k)})^\top X^{(k+1)} = (X^{(k+1)})^\top X^{(k+1)} = I. \end{align*} We verify that $A^{(k)} = QR$. Left-multiplying $AX^{(k)} = X^{(k+1)}\hat{R}$ by $(X^{(k)})^\top$: \begin{align*} (X^{(k)})^\top A X^{(k)} = (X^{(k)})^\top X^{(k+1)} \hat{R} = QR. \end{align*} By the induction hypothesis, $(X^{(k)})^\top A X^{(k)} = A^{(k)}$, so $A^{(k)} = QR$ is a QR factorisation. [guided] The connection between simultaneous iteration and QR iteration hinges on this observation: the orthogonal factor $Q$ in the QR decomposition of $A^{(k)}$ is not an arbitrary orthogonal matrix — it is the *change-of-basis matrix* $(X^{(k)})^\top X^{(k+1)}$ between consecutive iterate frames. The upper triangular factor $R$ is the same $\hat{R}$ produced by the simultaneous iteration's QR step. Why does this work? Both algorithms perform the same algebraic operation viewed from different coordinate systems. Simultaneous iteration works in the original basis and explicitly tracks $X^{(k)}$. QR iteration works in the frame of $X^{(k)}$ (via the similarity $A^{(k)} = (X^{(k)})^\top A X^{(k)}$) and never sees $X^{(k)}$ directly. [/guided] [/step] [step:Complete the induction by computing $A^{(k+1)} = RQ$] The basic QR iteration defines $A^{(k+1)} = RQ$ (writing the factors in reverse order). We compute: \begin{align*} A^{(k+1)} = RQ = \hat{R} \cdot (X^{(k)})^\top X^{(k+1)}. \end{align*} From $AX^{(k)} = X^{(k+1)}\hat{R}$, left-multiplying by $(X^{(k+1)})^\top$: \begin{align*} (X^{(k+1)})^\top A X^{(k)} = (X^{(k+1)})^\top X^{(k+1)} \hat{R} = \hat{R}. \end{align*} Therefore \begin{align*} A^{(k+1)} &= RQ = (X^{(k+1)})^\top A X^{(k)} \cdot (X^{(k)})^\top X^{(k+1)} \\ &= (X^{(k+1)})^\top A \bigl(X^{(k)} (X^{(k)})^\top\bigr) X^{(k+1)}. \end{align*} Since $X^{(k)}$ is a square orthogonal matrix (both algorithms start with $X^{(0)} = I \in \mathbb{R}^{n \times n}$ and preserve orthonormality), $X^{(k)}(X^{(k)})^\top = I$. Thus \begin{align*} A^{(k+1)} = (X^{(k+1)})^\top A X^{(k+1)}, \end{align*} completing the induction. [guided] The critical step is the insertion of $X^{(k)}(X^{(k)})^\top = I$ in the middle. This identity holds because $X^{(k)} \in \mathbb{R}^{n \times n}$ is orthogonal (the simultaneous iteration with $X^{(0)} = I$ produces square orthogonal matrices at every step). Without this, the product $X^{(k)}(X^{(k)})^\top$ would be a projection rather than the identity, and the factorisation would fail. The result establishes that the QR iteration — which is computationally cheaper because it only manipulates $n \times n$ matrices without explicitly tracking the basis $X^{(k)}$ — produces the same sequence of projected matrices as simultaneous iteration. In particular, the convergence theory for simultaneous iteration (the [Technical Convergence Lemma](/theorems/1410) and the [Convergence of Simultaneous Iteration](/theorems/1411)) transfers directly to the QR iteration: the diagonal entries of $A^{(k)}$ converge to the eigenvalues of $A$. [/guided] [/step]

Explore Further

Convergence for Diagonally Dominant Matrices Numerical Analysis Taylor Expansion of the Second Central Difference Numerical Analysis Error Projection Along Search Direction Numerical Analysis Chebyshev Coefficients of a Derivative Numerical Analysis Strict Diagonal Dominance Implies Nonsingularity Numerical Analysis Jacobi Convergence for SPD-like Matrices Numerical Analysis Gradient Equals Negative Residual Numerical Analysis Inverse DFT Numerical Analysis

What brings you to Androma?

Start with a route through the knowledge graph.