[step:Translate strong mixing into zero-mean correlation decay]
For measurable sets $A,B\in\mathcal{B}$, define
\begin{align*}
a_A:=\mathbb{1}_A-\mu(A)\mathbb{1}_X,
\qquad
b_B:=\mathbb{1}_B-\mu(B)\mathbb{1}_X.
\end{align*}
Then $a_A,b_B\in L^2_0(X,\mu)$ and, because $\mu(X)=1$,
\begin{align*}
\langle U_T^n a_A,b_B\rangle
=\mu(T^{-n}A\cap B)-\mu(A)\mu(B).
\end{align*}
Thus strong mixing is equivalent to
\begin{align*}
\langle U_T^n a_A,b_B\rangle\longrightarrow 0
\end{align*}
for all centered indicators.
Finite linear combinations of indicators are dense in $L^2(X,\mu)$, and subtracting the mean preserves density in $L^2_0(X,\mu)$. If $f,g\in L^2_0(X,\mu)$ and $s,t$ are centered simple functions, then the [Cauchy-Schwarz Inequality](/theorems/432) and unitarity of $U_T$ give, for all $n$,
\begin{align*}
|\langle U_T^n f,g\rangle-\langle U_T^n s,t\rangle|
&\leq
|\langle U_T^n(f-s),g\rangle|
+|\langle U_T^n s,g-t\rangle|\\
&\leq
\|f-s\|_2\|g\|_2+\|s\|_2\|g-t\|_2.
\end{align*}
Hence strong mixing is equivalent to
\begin{align*}
\langle U_T^n f,g\rangle\longrightarrow0
\end{align*}
for all $f,g\in L^2_0(X,\mu)$.
[/step]