[step:Compare the dyadic chaining sum with the entropy integral]Combining the preceding two steps gives
\begin{align*}
\mathbb E_\varepsilon\left[\sup_{f\in\mathcal F}|R_n(f)-R_n(f_*)|\right]
\le
6\sum_{k=1}^{\infty}\delta_k\sqrt{\log(2N(\mathcal F,d_n,\delta_k))}.
\end{align*}
We first record a diameter consequence. If $0<r<D/2$, then $N(\mathcal F,d_n,r)\ge2$. Indeed, if $N(\mathcal F,d_n,r)=1$, then there exists $a\in\mathcal F$ such that $d_n(f,a)\le r$ for every $f\in\mathcal F$, and the triangle inequality for $d_n$ gives $d_n(f,g)\le2r<D$ for every $f,g\in\mathcal F$, contradicting the definition of $D$ as the supremum of all such distances.
For every $k\ge1$ and every $\varepsilon\in(\delta_{k+1},\delta_k)$, we have $\varepsilon<D/2$, so
\begin{align*}
N(\mathcal F,d_n,\varepsilon)\ge2.
\end{align*}
Since covering numbers are nonincreasing as the radius increases and $\varepsilon\le\delta_k$, we also have
\begin{align*}
N(\mathcal F,d_n,\delta_k)\le N(\mathcal F,d_n,\varepsilon).
\end{align*}
Therefore, for $\mathcal L^1$-a.e. $\varepsilon\in(\delta_{k+1},\delta_k)$,
\begin{align*}
\log(2N(\mathcal F,d_n,\delta_k))
\le
\log(2N(\mathcal F,d_n,\varepsilon))
\le
2\log N(\mathcal F,d_n,\varepsilon).
\end{align*}
Using $\delta_k=2(\delta_k-\delta_{k+1})$, we obtain
\begin{align*}
\delta_k\sqrt{\log(2N(\mathcal F,d_n,\delta_k))}
\le
2\sqrt2\int_{\delta_{k+1}}^{\delta_k}\sqrt{\log N(\mathcal F,d_n,\varepsilon)}\,d\mathcal L^1(\varepsilon).
\end{align*}
Summing over $k\ge1$ and using that the intervals $(\delta_{k+1},\delta_k)$ are disjoint subsets of $(0,D)$ gives
\begin{align*}
\sum_{k=1}^{\infty}\delta_k\sqrt{\log(2N(\mathcal F,d_n,\delta_k))}
\le
2\sqrt2\int_0^D\sqrt{\log N(\mathcal F,d_n,\varepsilon)}\,d\mathcal L^1(\varepsilon).
\end{align*}
Thus we may take
\begin{align*}
C_1:=2\sqrt2.
\end{align*}
Consequently
\begin{align*}
\mathbb E_\varepsilon\left[\sup_{f\in\mathcal F}|R_n(f)-R_n(f_*)|\right]
\le
6C_1\int_0^D\sqrt{\log N(\mathcal F,d_n,\varepsilon)}\,d\mathcal L^1(\varepsilon).
\end{align*}[/step]