[guided]We first translate the dynamical expression into an operator expression. Fix $f\in\mathcal B_0$, $g\in L^\infty(X,\mathcal B,\mu)$, and an integer $n\ge 0$. The map
\begin{align*}
T^n:X\to X
\end{align*}
is the $n$-fold iterate of $T$, with $T^0=\operatorname{id}_X$. Because $T$ is measurable, $g\circ T^n$ is measurable. Because $T$ is probability-preserving, composition with $T^n$ does not increase the essential supremum: if $M=\|g\|_{L^\infty(X,\mu)}$, then $|g|\le M$ $\mu$-a.e.; the exceptional set has measure zero, and its preimage under $T^n$ also has measure zero. Hence
\begin{align*}
\|g\circ T^n\|_{L^\infty(X,\mu)}\le \|g\|_{L^\infty(X,\mu)}.
\end{align*}
The defining property of the transfer operator says that for every $u\in\mathcal B_0$ and every $h\in L^\infty(X,\mathcal B,\mu)$,
\begin{align*}
\int_X (\mathcal L u)h\,d\mu(x)=\int_X u(h\circ T)\,d\mu(x).
\end{align*}
We apply this identity repeatedly. For $n=0$, there is nothing to prove because $\mathcal L^0=\operatorname{Id}_{\mathcal B_0}$ and $T^0=\operatorname{id}_X$. Suppose $n\ge 1$. Applying the defining identity with $u=\mathcal L^{n-1}f$ and $h=g$ gives
\begin{align*}
\int_X (\mathcal L^n f)g\,d\mu(x)=\int_X (\mathcal L^{n-1}f)(g\circ T)\,d\mu(x).
\end{align*}
Applying the same identity again with $u=\mathcal L^{n-2}f$ and $h=g\circ T$ gives
\begin{align*}
\int_X (\mathcal L^{n-1}f)(g\circ T)\,d\mu(x)=\int_X (\mathcal L^{n-2}f)(g\circ T^2)\,d\mu(x).
\end{align*}
Iterating this argument $n$ times yields
\begin{align*}
\int_X (\mathcal L^n f)g\,d\mu(x)=\int_X f(g\circ T^n)\,d\mu(x).
\end{align*}
This is the key conversion: decay of correlations will follow once we can estimate $\mathcal L^n f-\Pi f$ in a norm that pairs with $g$.[/guided]