[step:Bound the Wasserstein distance by total variation using the diameter]
Let $\pi\in\Pi(\nu,\rho)$ be the coupling chosen above. Since $X$ is separable metric, $\mathcal B(X\times X)=\mathcal B(X)\otimes\mathcal B(X)$; hence the continuous function $d:X\times X\to[0,\infty)$ is $\mathcal B(X)\otimes\mathcal B(X)$-measurable. By the definition of the $1$-Wasserstein distance through couplings,
\begin{align*}
W_1(\nu,\rho)\le \int_{X\times X} d(x,y)\,d\pi(x,y).
\end{align*}
For every $(x,y)\in X\times X$, the diameter assumption gives
\begin{align*}
d(x,y)\le D\mathbb{1}_{(X\times X)\setminus\Delta}(x,y),
\end{align*}
because $d(x,x)=0$ on $\Delta$ and $d(x,y)\le D$ off $\Delta$. Integrating this pointwise inequality with respect to $\pi$ gives
\begin{align*}
\int_{X\times X} d(x,y)\,d\pi(x,y)\le D\int_{X\times X}\mathbb{1}_{(X\times X)\setminus\Delta}(x,y)\,d\pi(x,y).
\end{align*}
By the defining property of $\pi$,
\begin{align*}
W_1(\nu,\rho)\le D\pi\bigl((X\times X)\setminus\Delta\bigr)=D\|\nu-\rho\|_{\mathrm{TV}}.
\end{align*}
[/step]