[guided]The lemma is the analytic backbone of both directions of the equivalence. It asserts that even though $\mathcal{H}_\phi|_\mathcal{K}$ is *a priori* larger than $\mathcal{V}$, the two have the same uniform closure inside $C(\mathcal{K})$. We prove it in three movements: (a) write $f \in \mathcal{H}_\phi|_\mathcal{K}$ as the restriction of an element $g$ of the abstract RKHS; (b) approximate $g$ in $\mathcal{H}_\phi$-norm by a finite kernel-section combination $g_n$; (c) convert RKHS-norm closeness to uniform closeness on $\mathcal{K}$ via a pointwise evaluation bound that comes from the reproducing property.
Fix $f \in \mathcal{H}_\phi|_\mathcal{K}$ and $\varepsilon > 0$. By definition of the restriction, there exists $g \in \mathcal{H}_\phi$ with $f = g|_\mathcal{K}$.
**(a) Density of kernel-section combinations in $\mathcal{H}_\phi$.** By the [definition of the RKHS as the closure of the linear span of kernel sections in the $\mathcal{H}_\phi$-norm](/theorems/???), for every $\delta > 0$ there exist $n \in \mathbb{N}$, points $\gamma_1, \ldots, \gamma_n \in \mathcal{C}_p$, and coefficients $c_1, \ldots, c_n \in \mathbb{R}$ with
\begin{align*}
g_n := \sum_{i=1}^n c_i k_\phi(\gamma_i, \cdot) \in \mathcal{H}_\phi, \qquad \|g - g_n\|_{\mathcal{H}_\phi} < \delta.
\end{align*}
We will fix $\delta$ at the end of the argument once we know how RKHS-norm error converts to uniform-norm error.
**(b) Pointwise evaluation bound from the reproducing property.** For any $\gamma \in \mathcal{K}$ the [reproducing property](/theorems/???) gives $g(\gamma) - g_n(\gamma) = \langle g - g_n, k_\phi(\gamma, \cdot)\rangle_{\mathcal{H}_\phi}$. Applying Cauchy--Schwarz in the Hilbert space $\mathcal{H}_\phi$,
\begin{align*}
|g(\gamma) - g_n(\gamma)| \leq \|g - g_n\|_{\mathcal{H}_\phi} \cdot \|k_\phi(\gamma, \cdot)\|_{\mathcal{H}_\phi}.
\end{align*}
The reproducing property applied to $k_\phi(\gamma, \cdot)$ itself yields $\|k_\phi(\gamma, \cdot)\|_{\mathcal{H}_\phi}^2 = \langle k_\phi(\gamma, \cdot), k_\phi(\gamma, \cdot)\rangle_{\mathcal{H}_\phi} = k_\phi(\gamma, \gamma)$, so
\begin{align*}
|g(\gamma) - g_n(\gamma)| \leq \|g - g_n\|_{\mathcal{H}_\phi} \cdot \sqrt{k_\phi(\gamma, \gamma)}.
\end{align*}
This says: small in RKHS-norm forces pointwise smallness, with the conversion factor $\sqrt{k_\phi(\gamma, \gamma)}$ depending on the evaluation point $\gamma$.
**(c) Uniform bound via compactness.** To upgrade the pointwise bound to a uniform bound on $\mathcal{K}$ we control $\sqrt{k_\phi(\gamma, \gamma)}$ uniformly over $\gamma \in \mathcal{K}$. The map
\begin{align*}
\gamma \mapsto k_\phi(\gamma, \gamma) = \langle S(\gamma), S(\gamma)\rangle_\phi = \|S(\gamma)\|_\phi^2
\end{align*}
is continuous on $\mathcal{C}_p$ because $S : \mathcal{C}_p \to T_\phi((V))$ is continuous (a hypothesis of the theorem) and the squared norm $\|\cdot\|_\phi^2 : T_\phi((V)) \to \mathbb{R}$ is continuous on the Hilbert space $T_\phi((V))$. Composition of continuous maps is continuous, and $x \mapsto \sqrt{x}$ is continuous on $[0, \infty)$, so $\gamma \mapsto \sqrt{k_\phi(\gamma, \gamma)}$ is continuous. By the [extreme value theorem](/theorems/???) applied to this continuous function on the compact set $\mathcal{K}$,
\begin{align*}
M_\mathcal{K} := \sup_{\gamma \in \mathcal{K}} \sqrt{k_\phi(\gamma, \gamma)} < \infty.
\end{align*}
Taking the supremum of the bound from (b) over $\gamma \in \mathcal{K}$,
\begin{align*}
\sup_{\gamma \in \mathcal{K}} |g(\gamma) - g_n(\gamma)| \leq M_\mathcal{K} \cdot \|g - g_n\|_{\mathcal{H}_\phi}.
\end{align*}
**Closing.** If $M_\mathcal{K} > 0$, choose $\delta := \varepsilon / M_\mathcal{K}$ in (a) so that the right-hand side is at most $\varepsilon$. If $M_\mathcal{K} = 0$, then $k_\phi(\gamma, \gamma) = 0$ for all $\gamma \in \mathcal{K}$; the pointwise bound gives $|f(\gamma)| = |g(\gamma)| \le \sqrt{k_\phi(\gamma,\gamma)}\|g\|_{\mathcal{H}_\phi} = 0$, so $\mathcal{H}_\phi|_\mathcal{K} = \{0\}$ and the constant $0 \in \mathcal{V}$ already approximates $f$ exactly. In the non-degenerate case, set $v := g_n|_\mathcal{K} \in \mathcal{V}$; then $\|f - v\|_\infty = \sup_{\gamma \in \mathcal{K}}|g(\gamma) - g_n(\gamma)| \le M_\mathcal{K} \cdot \delta = \varepsilon$. Since $f \in \mathcal{H}_\phi|_\mathcal{K}$ and $\varepsilon > 0$ were arbitrary, $\mathcal{V}$ is uniformly dense in $\mathcal{H}_\phi|_\mathcal{K}$.
Why does the centre $\gamma_i$ in the kernel section need to range over all of $\mathcal{C}_p$ in step (a) and not just $\mathcal{K}$? The defining density statement for the RKHS uses $\operatorname{span}\{k_\phi(\gamma, \cdot) : \gamma \in \mathcal{C}_p\}$, but we only ever evaluate $g_n$ on $\mathcal{K}$, so the *restriction* $g_n|_\mathcal{K}$ is what we put into $\mathcal{V}$. The definition of $\mathcal{V}$ in the theorem statement requires centres in $\mathcal{K}$, but our restriction sends a kernel section centred at $\gamma_i \in \mathcal{C}_p$ to a function on $\mathcal{K}$, which agrees pointwise on $\mathcal{K}$ with $k_\phi(\gamma_i, \cdot)|_\mathcal{K}$. If centres outside $\mathcal{K}$ are required, an equivalent argument shows that linear combinations of kernel sections centred in $\mathcal{K}$ are also dense, since the RKHS topology and pointwise evaluation are intertwined by the reproducing property — but for our purposes the key fact is the uniform-density conclusion on $\mathcal{K}$.[/guided]