[step:Derive the $T_1(C)$ transportation inequality from the assumed $T_2(C)$ inequality]
Let $\nu\in\mathcal P(X)$. The set $\Pi(\nu,\rho)$ and the extended distances $W_1(\nu,\rho)$ and $W_2(\nu,\rho)$ are defined in the theorem statement. If every $\pi\in\Pi(\nu,\rho)$ satisfies
\begin{align*}
\int_{X\times X} d(x,y)^2\,d\pi(x,y)=\infty,
\end{align*}
then $W_2(\nu,\rho)=\infty$, and the inequality $W_1(\nu,\rho)\le W_2(\nu,\rho)$ holds in the extended sense. Otherwise choose $\pi\in\Pi(\nu,\rho)$ such that
\begin{align*}
\int_{X\times X} d(x,y)^2\,d\pi(x,y)<\infty.
\end{align*}
Let $u:X\times X\to[0,\infty)$ be the Borel function $u(x,y):=d(x,y)$. Then $u\in L^2(X\times X,\mathcal B(X\times X),\pi)$, and the Cauchy-Schwarz inequality in this $L^2$ space, applied to $u$ and the constant function $1$, gives
\begin{align*}
\int_{X\times X} d(x,y)\,d\pi(x,y)
\le
\left(\int_{X\times X} d(x,y)^2\,d\pi(x,y)\right)^{1/2}.
\end{align*}
Taking the infimum over all $\pi\in\Pi(\nu,\rho)$ gives
\begin{align*}
W_1(\nu,\rho)\le W_2(\nu,\rho).
\end{align*}
Therefore the hypothesis implies, for every $\nu\in\mathcal P(X)$,
\begin{align*}
W_1(\nu,\rho)^2\le W_2(\nu,\rho)^2\le 2C\,H(\nu\mid\rho).
\end{align*}
Thus $\rho$ satisfies $T_1(C)$ with the same normalization.
[/step]