[guided]Fix a finite measurable partition $\mathcal P$ of $X$. We first compare the information recorded by observing $\mathcal P$ along the orbit of $T^k$ with the information recorded by observing $\mathcal P$ along the orbit of $T$.
For $n\in\mathbb N$, the $n$-block join under $T^k$ is
\begin{align*}
\mathcal P_n(T^k)=\bigvee_{j=0}^{n-1}(T^k)^{-j}\mathcal P=\bigvee_{j=0}^{n-1}T^{-kj}\mathcal P.
\end{align*}
Thus this partition records the name of a point only at the times $0,k,2k,\dots,k(n-1)$. By contrast, the $kn$-block join under $T$ is
\begin{align*}
\mathcal P_{kn}(T)=\bigvee_{r=0}^{kn-1}T^{-r}\mathcal P,
\end{align*}
which records the name at every time $0,1,\dots,kn-1$. Knowing the full $kn$-block name determines the sparse $n$-block name, so $\mathcal P_{kn}(T)$ refines $\mathcal P_n(T^k)$.
Entropy of finite partitions is monotone under refinement: if $\mathcal A$ refines $\mathcal B$, then $H_\mu(\mathcal B)\le H_\mu(\mathcal A)$. Applying this with $\mathcal A=\mathcal P_{kn}(T)$ and $\mathcal B=\mathcal P_n(T^k)$ gives
\begin{align*}
H_\mu(\mathcal P_n(T^k))\le H_\mu(\mathcal P_{kn}(T)).
\end{align*}
After dividing by $n$, we rewrite the right-hand side in terms of the $T$-entropy rate:
\begin{align*}
\frac{1}{n}H_\mu(\mathcal P_n(T^k))\le k\,\frac{1}{kn}H_\mu(\mathcal P_{kn}(T)).
\end{align*}
Letting $n\to\infty$, the left-hand side converges to $h_\mu(T^k,\mathcal P)$, while the right-hand side converges to $k\,h_\mu(T,\mathcal P)$. Therefore
\begin{align*}
h_\mu(T^k,\mathcal P)\le k\,h_\mu(T,\mathcal P)\le k\,h_\mu(T).
\end{align*}
Since this holds for every finite measurable partition $\mathcal P$, taking the supremum over $\mathcal P$ gives
\begin{align*}
h_\mu(T^k)\le k\,h_\mu(T).
\end{align*}[/guided]