[step:Apply the VC maximal inequality at bandwidth $h$]We use the [VC-type maximal inequality for bounded empirical processes, van der Vaart and Wellner Theorem 2.14.1](external:van-der-vaart-wellner-1996-theorem-2-14-1) for pointwise measurable VC-subgraph classes. By the pointwise measurability assumption on $\mathcal K$, the fixed-bandwidth subclass $\mathcal F_h\subset h^{-d}\mathcal K$ is pointwise measurable: for this $h$ there is a countable subclass $\mathcal F_{h,0}\subset\mathcal F_h$ such that every $g\in\mathcal F_h$ is the pointwise limit of a sequence in $\mathcal F_{h,0}$. Since every member of $\mathcal F_h$ is bounded by the integrable envelope $F_h$ under $P$, dominated convergence gives $(Pg_{k})\to Pg$ along such an approximating sequence $(g_k)_{k=1}^\infty$; at the finitely many sample points, pointwise convergence also gives $(P_ng_k)\to P_ng$. Hence the supremum over $\mathcal F_h$ agrees with the supremum over the countable class $\mathcal F_{h,0}$ and is measurable. Since $\mathcal K$ is VC-subgraph with bounded envelope $M$, rescaling by the positive scalar $h^{-d}$ and restricting the translation parameter to $x\in A$ preserve the VC-subgraph entropy exponents. Thus there exist constants $a\ge e$ and $v\ge 1$, depending only on the VC characteristics of the original translated-dilated kernel class, and a universal constant $C_0>0$ from the maximal inequality, such that $\mathcal F_h$ has envelope $Mh^{-d}$ and for every $h\in(0,1)$,
\begin{align*}
\mathbb E\left[\sup_{g\in\mathcal F_h}|(P_n-P)g|\right]
\le
C_0\left(
\sqrt{\frac{\sigma_h^2}{n}\log\left(\frac{aMh^{-d}}{\sigma_h}\right)}
+
\frac{Mh^{-d}}{n}\log\left(\frac{aMh^{-d}}{\sigma_h}\right)
\right).
\end{align*}
The hypotheses required for this standard empirical-process inequality are satisfied: $\mathcal F_h$ is pointwise measurable, it is a subclass of the rescaled VC-subgraph class $h^{-d}\mathcal K$, its envelope is $Mh^{-d}$, its VC entropy constants are the fixed constants $a$ and $v$ above, and its variance is bounded by $\sigma_h^2$.
Since $\sigma_h=\|f\|_{\infty,\mathbb R^d}^{1/2}L_Kh^{-d/2}$, we have
\begin{align*}
\frac{aMh^{-d}}{\sigma_h}=\frac{aM}{\|f\|_{\infty,\mathbb R^d}^{1/2}L_K}\,h^{-d/2}.
\end{align*}
Therefore there is a constant $C_1=C_1(C_0,d,K,f,a,v)>0$ such that, for all sufficiently small $h$,
\begin{align*}
\mathbb E\left[\sup_{g\in\mathcal F_h}|(P_n-P)g|\right]
\le
C_1\left(
\sqrt{\frac{\log(1/h)}{nh^d}}
+
\frac{\log(1/h)}{nh^d}
\right).
\end{align*}[/step]