[guided]The purpose of this step is to transfer symbolic separation from the full shift back to $\Lambda$. Let $\rho:\Sigma_k\times\Sigma_k\to[0,\infty)$ be a metric compatible with the product topology. For each symbol $a\in\{1,\dots,k\}$, define the one-symbol cylinder
\begin{align*}
C_a:=\{\omega\in\Sigma_k:\omega_0=a\}.
\end{align*}
Because $\Sigma_k$ is compact and each $C_a$ is closed, each $C_a$ is compact. The cylinders $C_a$ and $C_b$ are disjoint whenever $a\ne b$, so compactness in the [metric space](/page/Metric%20Space) $(\Sigma_k,\rho)$ gives $\operatorname{dist}_\rho(C_a,C_b)>0$. Since there are only finitely many pairs, the number
\begin{align*}
\varepsilon_0:=\frac{1}{2}\min_{a\ne b}\operatorname{dist}_\rho(C_a,C_b)
\end{align*}
is positive. This means that two sequences whose zeroth symbols differ are at least $2\varepsilon_0$ apart in the metric $\rho$.
The map $\pi:\Lambda\to\Sigma_k$ is continuous and $\Lambda$ is compact, so $\pi$ is uniformly continuous. Therefore there exists $\delta>0$ such that for all $x,y\in\Lambda$,
\begin{align*}
d_\Lambda(x,y)<\delta \implies \rho(\pi(x),\pi(y))<\varepsilon_0.
\end{align*}
We will use the contrapositive of this implication: if $\rho(\pi(x),\pi(y))\ge\varepsilon_0$, then $d_\Lambda(x,y)\ge\delta$. The strict separated-set definition requires a strict lower bound, so we set
\begin{align*}
\delta_1:=\frac{\delta}{2}.
\end{align*}
Then any pair with $d_\Lambda(x,y)\ge\delta$ is certainly more than $\delta_1$ apart.
Fix $n\in\mathbb{N}$. For each word $w=(w_0,\dots,w_{n-1})\in\{1,\dots,k\}^n$, choose a sequence $\omega_w\in\Sigma_k$ whose coordinates $0,\dots,n-1$ are $w_0,\dots,w_{n-1}$. This choice is possible both for the one-sided shift indexed by $\mathbb{N}_0$ and for the two-sided shift indexed by $\mathbb{Z}$. Since $\pi$ is surjective, choose $x_w\in\Lambda$ such that $\pi(x_w)=\omega_w$.
Now take two distinct words $w\ne v$. Choose $j\in\{0,\dots,n-1\}$ such that $w_j\ne v_j$. After applying $\sigma^j$, the sequences $\sigma^j(\omega_w)$ and $\sigma^j(\omega_v)$ have different zeroth symbols. Hence they lie in two different one-symbol cylinders, so
\begin{align*}
\rho(\sigma^j(\omega_w),\sigma^j(\omega_v))\ge 2\varepsilon_0>\varepsilon_0.
\end{align*}
The semiconjugacy identity $\pi\circ g=\sigma\circ\pi$ gives, by induction on $j$,
\begin{align*}
\pi(g^j(x_w))=\sigma^j(\pi(x_w))=\sigma^j(\omega_w)
\end{align*}
and likewise $\pi(g^j(x_v))=\sigma^j(\omega_v)$. Therefore
\begin{align*}
\rho(\pi(g^j(x_w)),\pi(g^j(x_v)))=\rho(\sigma^j(\omega_w),\sigma^j(\omega_v))>\varepsilon_0.
\end{align*}
By the contrapositive of uniform continuity, this implies
\begin{align*}
d_\Lambda(g^j(x_w),g^j(x_v))\ge\delta>\delta_1.
\end{align*}
Thus the set
\begin{align*}
E_n:=\{x_w:w\in\{1,\dots,k\}^n\}
\end{align*}
is $(n,\delta_1)$-separated for $g$. Also, $x_w=x_v$ would force $\omega_w=\pi(x_w)=\pi(x_v)=\omega_v$, which is impossible for distinct words because the chosen sequences differ in at least one of the first $n$ coordinates. Hence $|E_n|=k^n$, and so
\begin{align*}
s_\Lambda(n,\delta_1)\ge k^n
\end{align*}
for every $n\in\mathbb{N}$.[/guided]