[step:Lower bound: an $\varepsilon$-covering must distinguish $2\varepsilon$-separated points]We prove $\mathcal{P}(A, 2\varepsilon) \le \mathcal{N}(A, \varepsilon)$.
Let $\{x_1, \ldots, x_N\} \subset M$ be a finite $\varepsilon$-net for $A$, so $A \subset \bigcup_{i=1}^N B(x_i, \varepsilon)$ and $N \ge \mathcal{N}(A, \varepsilon)$. Let $S = \{s_1, \ldots, s_P\} \subset A$ be a $2\varepsilon$-separated subset, so $d(s_j, s_k) \ge 2\varepsilon$ for all $j \neq k$.
For each $s_j \in S$, since $s_j \in A \subset \bigcup_{i=1}^N B(x_i, \varepsilon)$, there exists $i(j) \in \{1, \ldots, N\}$ with $d(s_j, x_{i(j)}) < \varepsilon$. We claim the map $j \mapsto i(j)$ is injective. Suppose $i(j) = i(k)$ for $j \neq k$. Then
\begin{align*}
d(s_j, s_k) \le d(s_j, x_{i(j)}) + d(x_{i(j)}, s_k) = d(s_j, x_{i(j)}) + d(x_{i(k)}, s_k) < \varepsilon + \varepsilon = 2\varepsilon,
\end{align*}
contradicting $d(s_j, s_k) \ge 2\varepsilon$. Therefore the map is injective, and $P = |S| \le N$.
Since this holds for every $\varepsilon$-net of size $N$ and every $2\varepsilon$-separated set of size $P$, taking $N = \mathcal{N}(A, \varepsilon)$ (a minimum-size $\varepsilon$-net) and $P = \mathcal{P}(A, 2\varepsilon)$ (a maximum-size $2\varepsilon$-separated set) gives $\mathcal{P}(A, 2\varepsilon) \le \mathcal{N}(A, \varepsilon)$.[/step]