[guided]The purpose of the nearest-neighbour construction is to turn an estimator of the parameter into a test of which packing point generated the data. The separation assumption is exactly what makes this reduction quantitative.
Fix an outcome $\omega\in\Omega$ and suppose that the estimator is within distance $s$ of the true packing point:
\begin{align*}
d(\hat{\theta}(X(\omega)),\theta_{V(\omega)})<s.
\end{align*}
We compare the distance from $\hat{\theta}(X(\omega))$ to the true packing point with its distance to any other packing point. Let $k\in\{1,\dots,M\}$ with $k\neq V(\omega)$. Since the packing points are $2s$-separated,
\begin{align*}
d(\theta_{V(\omega)},\theta_k)\geq 2s.
\end{align*}
The triangle inequality in the form $d(a,c)\geq d(b,c)-d(a,b)$, applied with $a=\hat{\theta}(X(\omega))$, $b=\theta_{V(\omega)}$, and $c=\theta_k$, gives
\begin{align*}
d(\hat{\theta}(X(\omega)),\theta_k)\geq d(\theta_{V(\omega)},\theta_k)-d(\hat{\theta}(X(\omega)),\theta_{V(\omega)}).
\end{align*}
Combining $d(\theta_{V(\omega)},\theta_k)\geq 2s$ with $d(\hat{\theta}(X(\omega)),\theta_{V(\omega)})<s$ yields
\begin{align*}
d(\hat{\theta}(X(\omega)),\theta_k)>s.
\end{align*}
At the same time, the assumed small-error event says
\begin{align*}
d(\hat{\theta}(X(\omega)),\theta_{V(\omega)})<s.
\end{align*}
Thus every incorrect packing point $\theta_k$ is farther from $\hat{\theta}(X(\omega))$ than the true packing point $\theta_{V(\omega)}$. Consequently the nearest-neighbour rule must select the true index:
\begin{align*}
\hat V(X(\omega))=V(\omega).
\end{align*}
Equivalently, if the induced test makes an error, then the estimator cannot have been within distance $s$ of the true parameter. Hence
\begin{align*}
\{\hat V(X)\neq V\}\subseteq \{d(\hat{\theta}(X),\theta_V)\geq s\}.
\end{align*}
Multiplying the indicator of the testing error by $s$ gives the pointwise lower bound
\begin{align*}
d(\hat{\theta}(X),\theta_V)\geq s\,\mathbb 1_{\{\hat V(X)\neq V\}}.
\end{align*}
This is the central [testing-to-estimation reduction](/theorems/5895): a lower bound on testing error becomes a lower bound on metric estimation risk.[/guided]