[step:Introduce independent swaps and bound each fixed labeling by Hoeffding]Let $(\Omega_\varepsilon,\mathcal A_\varepsilon,\mathbb P_\varepsilon)$ be a [probability space](/page/Probability%20Space) carrying independent Rademacher random variables $\varepsilon_1,\dots,\varepsilon_n:\Omega_\varepsilon\to\{-1,1\}$, independent of the original and ghost samples. Here $\mathbb P_\varepsilon$ denotes probability with respect to the Rademacher variables alone. For a fixed trace vector $a=(a_1,\dots,a_{2n})\in\{0,1\}^{2n}$, define the real-valued Rademacher sum $R_a:\Omega_\varepsilon\to\mathbb R$ by
\begin{align*}
R_a:=\frac{1}{n}\sum_{i=1}^{n}\varepsilon_i(a_i-a_{n+i}).
\end{align*}
Since each coefficient $a_i-a_{n+i}$ lies in $\{-1,0,1\}$, the summands $\varepsilon_i(a_i-a_{n+i})$ are independent, mean-zero, and take values in $[-1,1]$. Hoeffding's inequality for bounded independent sums gives
\begin{align*}
\mathbb P_\varepsilon\left(|R_a|>\frac{t}{2}\right)
\le 2\exp\left(-\frac{nt^2}{8}\right).
\end{align*}
For $\varepsilon=(\varepsilon_1,\dots,\varepsilon_n)$ define swapped samples $X_i^\varepsilon,Y_i^\varepsilon:(\Omega\times\Omega'\times\Omega_\varepsilon,\mathcal A\otimes\mathcal A'\otimes\mathcal A_\varepsilon)\to(S,\mathcal S)$ by setting $X_i^\varepsilon=X_i$ and $Y_i^\varepsilon=Y_i$ on $\{\varepsilon_i=1\}$, and $X_i^\varepsilon=Y_i$ and $Y_i^\varepsilon=X_i$ on $\{\varepsilon_i=-1\}$. Because $(X_i,Y_i)$ has product law $P\otimes P$ and is exchangeable under the coordinate swap, and because the pairs are independent, the process
\begin{align*}
\left(\frac{1}{n}\sum_{i=1}^{n}\mathbb{1}_C(X_i)-\frac{1}{n}\sum_{i=1}^{n}\mathbb{1}_C(Y_i)\right)_{C\in\mathcal C}
\end{align*}
has the same law under $\mathbb P\otimes\mathbb P'$ as
\begin{align*}
\left(\frac{1}{n}\sum_{i=1}^{n}\mathbb{1}_C(X_i^\varepsilon)-\frac{1}{n}\sum_{i=1}^{n}\mathbb{1}_C(Y_i^\varepsilon)\right)_{C\in\mathcal C}
\end{align*}
has under $\mathbb P\otimes\mathbb P'\otimes\mathbb P_\varepsilon$.
Conditional on the realised ordered pooled sample $(Z_1,\dots,Z_{2n})$, the swapped difference for a set $C\in\mathcal C$ is
\begin{align*}
\frac{1}{n}\sum_{i=1}^{n}\varepsilon_i(\mathbb{1}_C(Z_i)-\mathbb{1}_C(Z_{n+i})).
\end{align*}
The possible coefficient vectors are exactly the trace vectors in $\mathcal T(Z_1,\dots,Z_{2n})$, whose cardinality is at most $\Pi_{\mathcal C}(2n)$. Applying the preceding Hoeffding bound to each trace vector and taking a union bound over this finite trace set gives, for every realised pooled sample,
\begin{align*}
\mathbb P_\varepsilon\left(\sup_{C\in\mathcal C}\left|\frac{1}{n}\sum_{i=1}^{n}\mathbb{1}_C(X_i^\varepsilon)-\frac{1}{n}\sum_{i=1}^{n}\mathbb{1}_C(Y_i^\varepsilon)\right|>\frac{t}{2}\right)
\le 2\Pi_{\mathcal C}(2n)\exp\left(-\frac{nt^2}{8}\right).
\end{align*}
Integrating over the pooled sample and using the equality in law under independent swaps yields
\begin{align*}
(\mathbb P\otimes\mathbb P')\left(\sup_{C\in\mathcal C}|P_n(C)-P_n'(C)|>\frac{t}{2}\right)
\le 2\Pi_{\mathcal C}(2n)\exp\left(-\frac{nt^2}{8}\right).
\end{align*}[/step]