[step:Compress every admissible pair into one Lipschitz potential]
Let $\varphi,\psi:X\to\mathbb R$ be an admissible pair in the Kantorovich dual problem, so that
\begin{align*}
\varphi(y)-\psi(x)\le d(x,y)
\end{align*}
for all $x,y\in X$. Define the envelope function
\begin{align*}
f:X\to\mathbb R,\qquad f(y):=\inf_{x\in X}\{\psi(x)+d(x,y)\}.
\end{align*}
For every $y\in X$, the admissibility inequality gives $\varphi(y)\le \psi(x)+d(x,y)$ for every $x\in X$, hence
\begin{align*}
\varphi(y)\le f(y).
\end{align*}
Taking $x=y$ in the definition of $f$ gives
\begin{align*}
f(y)\le \psi(y).
\end{align*}
We claim that $f$ is $1$-Lipschitz. Let $y,z\in X$. For every $x\in X$, the triangle inequality gives
\begin{align*}
\psi(x)+d(x,y)\le \psi(x)+d(x,z)+d(z,y).
\end{align*}
Taking the infimum over $x\in X$ gives
\begin{align*}
f(y)\le f(z)+d(y,z).
\end{align*}
Interchanging $y$ and $z$ gives
\begin{align*}
f(z)\le f(y)+d(y,z).
\end{align*}
Therefore $|f(y)-f(z)|\le d(y,z)$, so $f$ is $1$-Lipschitz.
Since $\varphi\le f\le \psi$, we obtain
\begin{align*}
\int_X \varphi\,d\nu-\int_X \psi\,d\mu \le \int_X f\,d\nu-\int_X f\,d\mu.
\end{align*}
Thus every admissible two-potential value is bounded above by a value achieved by a single $1$-Lipschitz potential. Consequently,
\begin{align*}
W_1(\mu,\nu)\le \sup\left\{\int_X f\,d\nu-\int_X f\,d\mu : f:X\to\mathbb R \text{ is }1\text{-Lipschitz}\right\}.
\end{align*}
[/step]