[step:Take the supremum over finite partitions and recover the countable generator by finite coarsenings]The number $\varepsilon>0$ was arbitrary, so the previous step gives
\begin{align*}
h_\mu(T,\mathcal Q)\le h_\mu(T,\mathcal P)
\end{align*}
for every finite measurable partition $\mathcal Q$ of $X$. Taking the supremum over all finite $\mathcal Q$ yields
\begin{align*}
h_\mu(T)\le h_\mu(T,\mathcal P).
\end{align*}
It remains to justify the reverse inequality for the countable partition $\mathcal P$. Enumerate the atoms of $\mathcal P$ as $(P_i)_{i\ge1}$, allowing empty atoms if necessary, and define the finite coarsening $\mathcal P_m$ by
\begin{align*}
\mathcal P_m:=\{P_1,\dots,P_m,X\setminus\bigcup_{i=1}^{m}P_i\}.
\end{align*}
For each fixed $m$, the partition $\mathcal P_m$ is finite, so
\begin{align*}
h_\mu(T,\mathcal P_m)\le h_\mu(T).
\end{align*}
Since $\mathcal P_m$ is coarser than $\mathcal P$, the chain rule for conditional entropy and monotonicity of conditional entropy give, for every $n\in\mathbb N$,
\begin{align*}
H_\mu(\mathcal P_{[0,n-1]})\le H_\mu((\mathcal P_m)_{[0,n-1]})+H_\mu(\mathcal P_{[0,n-1]}\mid(\mathcal P_m)_{[0,n-1]}).
\end{align*}
The conditional term is bounded by the sum of the one-time conditional entropies:
\begin{align*}
H_\mu(\mathcal P_{[0,n-1]}\mid(\mathcal P_m)_{[0,n-1]})\le\sum_{k=0}^{n-1}H_\mu(T^{-k}\mathcal P\mid T^{-k}\mathcal P_m)=nH_\mu(\mathcal P\mid\mathcal P_m),
\end{align*}
where the equality uses $T$-invariance of $\mu$. Dividing by $n$ and passing to the limit gives
\begin{align*}
h_\mu(T,\mathcal P)\le h_\mu(T,\mathcal P_m)+H_\mu(\mathcal P\mid\mathcal P_m)\le h_\mu(T)+H_\mu(\mathcal P\mid\mathcal P_m).
\end{align*}
Because $\mathcal P_m$ is a coarsening of $\mathcal P$, the chain rule gives
\begin{align*}
H_\mu(\mathcal P\mid\mathcal P_m)=H_\mu(\mathcal P)-H_\mu(\mathcal P_m).
\end{align*}
The finite entropy assumption $H_\mu(\mathcal P)<\infty$ implies $H_\mu(\mathcal P_m)\uparrow H_\mu(\mathcal P)$ as $m\to\infty$, since the partial entropy sums converge to the countable entropy. Hence $H_\mu(\mathcal P\mid\mathcal P_m)\to0$, and letting $m\to\infty$ yields
\begin{align*}
h_\mu(T,\mathcal P)\le h_\mu(T).
\end{align*}
Combining the two inequalities proves
\begin{align*}
h_\mu(T)=h_\mu(T,\mathcal P)=\lim_{n\to\infty}\frac{1}{n}H_\mu(\mathcal P_{[0,n-1]}).
\end{align*}[/step]