[guided]The purpose of this step is to show that the modified algorithm has not changed the mathematical residual; it has only changed the order in which the projection components are removed. We prove the precise formula
\begin{align*}
w_{k,j}=v_k-\sum_{i=1}^{j}(v_k,e_i)_V e_i
\end{align*}
for each $0\le j\le k-1$.
When $j=0$, the statement says $w_{k,0}=v_k$, which is exactly the definition of the initial modified working vector. Now suppose the formula has been proved for some $j-1$, where $1\le j\le k-1$. The modified algorithm computes the next coefficient from the current working vector:
\begin{align*}
(w_{k,j-1},e_j)_V.
\end{align*}
By the induction formula,
\begin{align*}
w_{k,j-1}=v_k-\sum_{i=1}^{j-1}(v_k,e_i)_V e_i.
\end{align*}
Taking the inner product with $e_j$ and using linearity in the first argument gives
\begin{align*}
(w_{k,j-1},e_j)_V=(v_k,e_j)_V-\sum_{i=1}^{j-1}(v_k,e_i)_V(e_i,e_j)_V.
\end{align*}
Because $(e_1,\ldots,e_{k-1})$ is orthonormal, every factor $(e_i,e_j)_V$ with $i<j$ is zero. Therefore
\begin{align*}
(w_{k,j-1},e_j)_V=(v_k,e_j)_V.
\end{align*}
This is the key point: although modified Gram-Schmidt computes the coefficient from the updated vector $w_{k,j-1}$, exact orthogonality of the previously removed directions makes that coefficient equal to the classical coefficient.
Substituting this equality into the modified update,
\begin{align*}
w_{k,j}=w_{k,j-1}-(w_{k,j-1},e_j)_V e_j,
\end{align*}
we obtain
\begin{align*}
w_{k,j}=w_{k,j-1}-(v_k,e_j)_V e_j.
\end{align*}
Finally, replacing $w_{k,j-1}$ by its induction formula gives
\begin{align*}
w_{k,j}=v_k-\sum_{i=1}^{j-1}(v_k,e_i)_V e_i-(v_k,e_j)_V e_j.
\end{align*}
Combining the two displayed subtraction terms into a single finite sum gives
\begin{align*}
w_{k,j}=v_k-\sum_{i=1}^{j}(v_k,e_i)_V e_i.
\end{align*}
This completes the induction over $j$.[/guided]