[step:Simulate an arbitrary polynomial-time computation from $\mathcal{M}_1$ inside $\mathcal{M}_2$]Let $L \in \mathrm{P}_{\mathcal{M}_1}$. By definition of $\mathrm{P}_{\mathcal{M}_1}$, there is a deterministic machine $A$ in $\mathcal{M}_1$ and a polynomial
\begin{align*}
p_A: \mathbb{N} \to \mathbb{N}
\end{align*}
such that $A$ decides $L$ and, on every input $x$ of length $n$, halts within $p_A(n)$ steps.
By the configuration-size hypothesis applied to $A$, there is a polynomial
\begin{align*}
q_A: \mathbb{N} \to \mathbb{N}
\end{align*}
such that every configuration of $A$ reachable within $t$ steps from an input of length $n$ has an encoding of length at most $q_A(n+t)$. By the initial-encoding hypothesis with target model $\mathcal{M}_2$, there is an $\mathcal{M}_2$ machine $I_{2,A}$ and a polynomial
\begin{align*}
e_{2,A}: \mathbb{N} \to \mathbb{N}
\end{align*}
such that, on input $x$, the machine $I_{2,A}$ writes the encoding of the initial configuration of $A$ on $x$ in time at most $e_{2,A}(|x|)$. By the status-recognition hypothesis with target model $\mathcal{M}_2$, there is an $\mathcal{M}_2$ machine $H_{2,A}$ and a polynomial
\begin{align*}
h_{2,A}: \mathbb{N} \to \mathbb{N}
\end{align*}
such that, on an encoded configuration $c$ of $A$, the machine $H_{2,A}$ determines whether $c$ is nonhalting, accepting, or rejecting in time at most $h_{2,A}(|c|)$. By the one-step simulation hypothesis with $(i,j)=(1,2)$, there is a simulator machine $S_{2,A}$ in $\mathcal{M}_2$ and a polynomial
\begin{align*}
r_{2,A}: \mathbb{N} \to \mathbb{N}
\end{align*}
such that one transition of $A$ from an encoded configuration $c$ is simulated by $S_{2,A}$ in time at most $r_{2,A}(|c|)$.
Construct an $\mathcal{M}_2$ machine $B$ as follows. On input $x$, the machine $B$ runs $I_{2,A}$ to obtain the encoding of the initial configuration of $A$ on $x$. It then applies $H_{2,A}$ to the current encoded configuration. If the result is accepting, then $B$ accepts; if the result is rejecting, then $B$ rejects; if the result is nonhalting, then $B$ applies $S_{2,A}$ once and repeats the status test on the new encoded configuration. Since $S_{2,A}$ simulates the actual transition relation of $A$, and since $H_{2,A}$ correctly recognizes accepting and rejecting configurations, $B$ accepts exactly when $A$ accepts and rejects exactly when $A$ rejects. Therefore $B$ decides $L$.[/step]