[step:Construct an injective intertwining map from the reachable realization]Before defining the comparison map, replace $(\widetilde{A},\widetilde{B},\widetilde{C},\widetilde{D})$, if necessary, by its observable reduction from the previous step. This does not change the transfer function and does not increase the dimension, so it is enough to prove the dimension bound for the reduced realization. Thus we may assume that $(\widetilde{C},\widetilde{A})$ is observable.
Define a linear map first on reachable generators by
\begin{align*}
T(A^kBv) := \widetilde{A}^k\widetilde{B}v
\end{align*}
for every integer $k\geq0$ and every $v\in\mathbb{R}^m$, and extend linearly to $\mathbb{R}^n$, which is possible once well-definedness is proved because $(A,B)$ is reachable.
To prove well-definedness, suppose
\begin{align*}
\sum_{j=1}^{N} A^{k_j}Bv_j = 0,
\end{align*}
where $N\in\mathbb{N}$, each $k_j$ is a non-negative integer, and $v_j\in\mathbb{R}^m$. Define
\begin{align*}
z := \sum_{j=1}^{N}\widetilde{A}^{k_j}\widetilde{B}v_j \in \mathbb{R}^{\ell}.
\end{align*}
For every integer $r\geq 0$, the Markov parameter identities give
\begin{align*}
\widetilde{C}\widetilde{A}^r z=\sum_{j=1}^{N}\widetilde{C}\widetilde{A}^{r+k_j}\widetilde{B}v_j.
\end{align*}
Using the Markov parameter identities once more,
\begin{align*}
\sum_{j=1}^{N}\widetilde{C}\widetilde{A}^{r+k_j}\widetilde{B}v_j=\sum_{j=1}^{N}CA^{r+k_j}Bv_j.
\end{align*}
By linearity of $CA^r$,
\begin{align*}
\sum_{j=1}^{N}CA^{r+k_j}Bv_j=CA^r\left(\sum_{j=1}^{N}A^{k_j}Bv_j\right)=0.
\end{align*}
Thus $z$ lies in the unobservable subspace of $(\widetilde{C},\widetilde{A})$. Since the reduced realization is observable, $z=0$. Hence $T$ is well-defined.
We next prove the intertwining identities. The identity $TB=\widetilde{B}$ follows by taking $k=0$ in the definition of $T$. To prove $TA=\widetilde{A}T$, it is enough by reachability to check it on vectors $A^kBv$ with $k\geq0$. For such a generator, the definition gives
\begin{align*}
TA(A^kBv)=T(A^{k+1}Bv)=\widetilde{A}^{k+1}\widetilde{B}v=\widetilde{A}T(A^kBv).
\end{align*}
This proves $TA=\widetilde{A}T$ on all of $\mathbb{R}^n$. Finally, for every generator $A^kBv$ with $k\geq0$, the Markov parameter identity gives
\begin{align*}
C(A^kBv)=\widetilde{C}\widetilde{A}^k\widetilde{B}v=\widetilde{C}T(A^kBv),
\end{align*}
so $C=\widetilde{C}T$ by reachability.
If $x\in\ker T$, then applying $C=\widetilde{C}T$ and $TA=\widetilde{A}T$ repeatedly gives, for every integer $r\geq0$,
\begin{align*}
CA^r x=\widetilde{C}TA^r x=\widetilde{C}\widetilde{A}^rTx=0.
\end{align*}
Since $(C,A)$ is observable, $x=0$. Hence $T$ is injective, so $n\leq \ell$.[/step]