[step:Define the empirical process and its finite-dimensional marginals]
Let $(\Omega,\mathcal G,\mathbb P)$ be the probability space on which the sample is defined. For each $n \in \mathbb N$, let $X_1,\dots,X_n$ be independent measurable maps $X_i:(\Omega,\mathcal G)\to(\mathcal X,\mathcal A)$ with common law $P$. For every $P$-integrable measurable function $h:\mathcal X\to\mathbb R$, write
\begin{align*}
P h=\int_{\mathcal X} h(x)\,dP(x).
\end{align*}
Define the empirical process map $\alpha_n:\mathcal F\to\mathbb R$ by
\begin{align*}
\alpha_n(f)=\frac{1}{\sqrt n}\sum_{i=1}^n \bigl(f(X_i)-P f\bigr),\qquad f\in\mathcal F.
\end{align*}
Since $|f|\leq F_e$ and $P F_e^2<\infty$, every $f\in\mathcal F$ belongs to $L^2(P)$ and hence $P|f|<\infty$ by Cauchy-Schwarz. For fixed $f_1,\dots,f_k\in\mathcal F$, define the random vector $Z_i:\Omega\to\mathbb R^k$ by
\begin{align*}
Z_i(\omega)=\bigl(f_1(X_i(\omega))-P f_1,\dots,f_k(X_i(\omega))-P f_k\bigr),\qquad \omega\in\Omega.
\end{align*}
The vectors $Z_1,\dots,Z_n$ are independent and identically distributed because $X_1,\dots,X_n$ are independent and identically distributed. They have mean zero by the definition of $P f_j$, and they have finite second moments since $|f_j|\leq F_e$ and $P F_e^2<\infty$ for each $j\in\{1,\dots,k\}$. To prove vector convergence, fix an arbitrary vector $a=(a_1,\dots,a_k)\in\mathbb R^k$ and define the real-valued [random variable](/page/Random%20Variable) $Y_i^a:\Omega\to\mathbb R$ by
\begin{align*}
Y_i^a(\omega)=\sum_{j=1}^k a_j\bigl(f_j(X_i(\omega))-P f_j\bigr),\qquad \omega\in\Omega.
\end{align*}
The variables $Y_1^a,Y_2^a,\dots$ are independent and identically distributed, have mean zero, and have finite variance because
\begin{align*}
|Y_i^a|^2\leq \left(\sum_{j=1}^k |a_j|\,|f_j(X_i)-P f_j|\right)^2
\leq k\sum_{j=1}^k |a_j|^2 |f_j(X_i)-P f_j|^2
\end{align*}
and each summand has finite expectation. Applying the [Central Limit Theorem](/theorems/532) to $(Y_i^a)_{i\geq1}$ gives
\begin{align*}
\sum_{j=1}^k a_j\alpha_n(f_j)=\frac{1}{\sqrt n}\sum_{i=1}^n Y_i^a
\xrightarrow{d}
\mathcal N(0,a^\top\Sigma a),
\end{align*}
where the matrix $\Sigma\in\mathbb R^{k\times k}$ is defined by
\begin{align*}
\Sigma_{ij}=P(f_i f_j)-P f_i\,P f_j.
\end{align*}
By the Cramer-Wold characterization of convergence in distribution in finite-dimensional Euclidean spaces, this proves
\begin{align*}
\bigl(\alpha_n(f_1),\dots,\alpha_n(f_k)\bigr)
\xrightarrow{d}
\mathcal N_k(0,\Sigma).
\end{align*}
Thus the finite-dimensional distributions converge to those of a centred Gaussian random map with the covariance stated in the theorem.
[/step]