[step:Reduce to the stationary criterion by showing the initial values are asymptotically negligible][claim:Geometric forgetting of initial values]
There exist an almost surely finite random variable $C > 0$ and a constant $\rho \in (0,1)$, with $C$ and $\rho$ independent of $\theta$, such that
\begin{align*}
\Delta_t := \sup_{\theta \in \Theta} \left| \tilde\sigma_t^2(\theta) - \sigma_t^2(\theta) \right| \le C\, \rho^{\,t} \qquad \text{for all } t \ge 1.
\end{align*}
[/claim]
[proof]
Fix $\theta \in \Theta$ and set $\delta_t(\theta) = \tilde\sigma_t^2(\theta) - \sigma_t^2(\theta)$. Let $t_0 = \max(p,q) + 1$. For $t \ge t_0$ and every $1 \le i \le q$ we have $t - i \ge t_0 - q \ge 1$, so both $\tilde\sigma_t^2(\theta)$ and $\sigma_t^2(\theta)$ use the genuinely observed values $\epsilon_{t-i}^2$; the terms $\omega + \sum_i \alpha_i \epsilon_{t-i}^2$ therefore cancel in the difference, leaving the homogeneous recursion
\begin{align*}
\delta_t(\theta) = \sum_{j=1}^{p} \beta_j\, \delta_{t-j}(\theta) \qquad (t \ge t_0).
\end{align*}
Writing $\Delta_t = \sup_\theta |\delta_t(\theta)|$ and using $\beta_j \ge 0$ with $\sum_j \beta_j \le \bar\beta$,
\begin{align*}
\Delta_t \le \bar\beta \max_{1 \le j \le p} \Delta_{t-j} \qquad (t \ge t_0). \tag{2}
\end{align*}
The initial discrepancies $\Delta_{t_0 - p}, \dots, \Delta_{t_0 - 1}$ are finite linear combinations of the fixed initial values and of finitely many observed $\epsilon_s^2$, hence almost surely finite; set $M_0 = \max_{t_0 - p \le s \le t_0 - 1} \Delta_s < \infty$ a.s. Iterating (2) in blocks of length $p$ gives $\Delta_t \le \bar\beta^{\lfloor (t - t_0)/p \rfloor} M_0$. With $\rho := \bar\beta^{1/p} \in (0,1)$ and a suitable a.s.-finite $C = C(M_0, \bar\beta, t_0, p)$ this yields $\Delta_t \le C \rho^{\,t}$ for all $t \ge 1$.
[/proof]
Using the claim we bound $\sup_\theta |\tilde\ell_t(\theta) - \ell_t(\theta)|$. Since $\tilde\sigma_t^2(\theta), \sigma_t^2(\theta) \ge \underline\omega$, the mean value theorem gives $|\log a - \log b| \le |a-b|/\min(a,b)$ and $|1/a - 1/b| \le |a-b|/\min(a,b)^2$, so
\begin{align*}
\left| \tilde\ell_t(\theta) - \ell_t(\theta) \right| &= \frac{1}{2}\left| \log \tilde\sigma_t^2(\theta) - \log \sigma_t^2(\theta) + \epsilon_t^2\!\left( \frac{1}{\tilde\sigma_t^2(\theta)} - \frac{1}{\sigma_t^2(\theta)} \right) \right| \\
&\le \frac{\Delta_t}{2\underline\omega} + \frac{\epsilon_t^2\, \Delta_t}{2\underline\omega^2} \le C'\, \rho^{\,t}\,(1 + \epsilon_t^2),
\end{align*}
with $C' := \tfrac{C}{2}\max(\underline\omega^{-1}, \underline\omega^{-2})$ a.s. finite and independent of $\theta$. Therefore
\begin{align*}
\sup_{\theta \in \Theta} \left| \tilde L_n(\theta) - L_n(\theta) \right| \le \frac{1}{n}\sum_{t=1}^{n} \sup_{\theta} |\tilde\ell_t(\theta) - \ell_t(\theta)| \le \frac{C'}{n}\sum_{t=1}^{\infty} \rho^{\,t}\,(1 + \epsilon_t^2). \tag{3}
\end{align*}
[claim:The series in (3) converges almost surely]
$\sum_{t \ge 1} \rho^{\,t}(1 + \epsilon_t^2) < \infty$ almost surely.
[/claim]
[proof]
The geometric part $\sum_t \rho^t$ converges. For the second part set $a = -\tfrac{1}{2}\log\rho > 0$. By (A5) and stationarity, $\mathbb{P}(\log^+\epsilon_t^2 > a t) = \mathbb{P}(\log^+\epsilon_0^2 > a t)$, and
\begin{align*}
\sum_{t=1}^{\infty} \mathbb{P}\big( \log^+ \epsilon_0^2 > a t \big) \le \frac{1}{a}\,\mathbb{E}\big[ \log^+ \epsilon_0^2 \big] + 1 < \infty.
\end{align*}
By the [Borel–Cantelli Lemma](/theorems/507), almost surely $\log^+\epsilon_t^2 \le a t$ for all large $t$, hence $\rho^t \epsilon_t^2 \le \rho^t e^{a t} = \rho^{t/2} \to 0$ and $\sum_t \rho^t \epsilon_t^2 < \infty$ a.s.
[/proof]
The right-hand side of (3) is $n^{-1}$ times an a.s.-finite quantity, so
\begin{align*}
\sup_{\theta \in \Theta} \left| \tilde L_n(\theta) - L_n(\theta) \right| \xrightarrow{a.s.} 0 \qquad (n \to \infty). \tag{4}
\end{align*}
It therefore suffices to analyze the stationary criterion $L_n$.[/step]