[step:Condition the ARMA identity at index $t+h$ and collapse each term]Fix $h \ge 1$. The defining ARMA identity, written at time index $t + h$, reads
\begin{align*}
Y_{t+h} = \sum_{i=1}^p \phi_i\, Y_{t+h-i} + Z_{t+h} + \sum_{j=1}^q \theta_j\, Z_{t+h-j}.
\end{align*}
Every random variable here lies in $L^2 \subseteq L^1$, so we may apply $P_t$ and use linearity **(L)**:
\begin{align*}
\hat{Y}_t(h) = P_t Y_{t+h} = \sum_{i=1}^p \phi_i\, P_t Y_{t+h-i} + P_t Z_{t+h} + \sum_{j=1}^q \theta_j\, P_t Z_{t+h-j}.
\end{align*}
We evaluate each term using the facts from the previous step.
*Autoregressive terms $P_t Y_{t+h-i}$ for $1 \le i \le p$.* If $h - i \ge 1$, then $Y_{t+(h-i)}$ is a future value and $P_t Y_{t+(h-i)} = \hat{Y}_t(h-i)$ by definition of the forecast. If $h - i \le 0$, then $t + (h-i) \le t$, so $Y_{t+h-i}$ is $\mathcal{F}_t$-measurable and $P_t Y_{t+h-i} = Y_{t+h-i}$ by **(M)**; under the convention $\hat{Y}_t(r) = Y_{t+r}$ for $r \le 0$ this again equals $\hat{Y}_t(h-i)$. In both cases
\begin{align*}
P_t Y_{t+h-i} = \hat{Y}_t(h - i).
\end{align*}
*Leading innovation $P_t Z_{t+h}$.* Since $h \ge 1$, $Z_{t+h}$ is independent of $\mathcal{F}_t$ with $\mathbb{E}[Z_{t+h}] = 0$, so $P_t Z_{t+h} = 0$ by **(I)**. With the convention $\hat{Z}_t(h) = 0$ for $h \ge 1$ this is $\hat{Z}_t(h)$.
*Moving-average terms $P_t Z_{t+h-j}$ for $1 \le j \le q$.* If $h - j \ge 1$, then $Z_{t+(h-j)}$ is a future innovation, independent of $\mathcal{F}_t$ with mean $0$, so $P_t Z_{t+h-j} = 0$ by **(I)**, which equals $\hat{Z}_t(h-j)$. If $h - j \le 0$, then $t + (h-j) \le t$, so $Z_{t+h-j}$ is $\mathcal{F}_t$-measurable and $P_t Z_{t+h-j} = Z_{t+h-j} = \hat{Z}_t(h-j)$ by **(M)**. In both cases
\begin{align*}
P_t Z_{t+h-j} = \hat{Z}_t(h - j).
\end{align*}
Substituting these three evaluations gives, for every $h \ge 1$,
\begin{align*}
\hat{Y}_t(h) = \sum_{i=1}^p \phi_i\, \hat{Y}_t(h - i) + \hat{Z}_t(h) + \sum_{j=1}^q \theta_j\, \hat{Z}_t(h - j),
\end{align*}
which is the asserted recursion.[/step]