[guided]The point of the hypercube prior is that it creates many independent scalar estimation problems inside the ball. Recall the construction: $m \in \{1,\dots,d\}$ and $a \in [0,\sigma]$ were chosen so that $ma^2\leq R^2$ and $ma^2\geq \frac{1}{2}\min\{R^2,d\sigma^2\}$, and the random parameter is
\begin{align*}
\Theta=(a\varepsilon_1,\dots,a\varepsilon_m,0,\dots,0),
\end{align*}
where $\varepsilon_1,\dots,\varepsilon_m$ are independent Rademacher random variables. Let $\hat{\theta}:\mathbb{R}^d\to\mathbb{R}^d$ be arbitrary, and write its coordinate functions as $\hat{\theta}_1,\dots,\hat{\theta}_d:\mathbb{R}^d\to\mathbb{R}$, so that $\hat{\theta}(x)=(\hat{\theta}_1(x),\dots,\hat{\theta}_d(x))$ for $x\in\mathbb{R}^d$. Under this prior, the observation has the form
\begin{align*}
X=\Theta+\sigma Z,
\end{align*}
where $Z=(Z_1,\dots,Z_d)\sim \mathcal{N}(0,I_d)$ is independent of $\Theta$. Expanding the squared Euclidean norm coordinate by coordinate gives
\begin{align*}
\mathbb{E}[|\hat{\theta}(X)-\Theta|^2]
=
\sum_{i=1}^d \mathbb{E}[(\hat{\theta}_i(X)-\Theta_i)^2].
\end{align*}
The inactive coordinates only add non-negative terms, so
\begin{align*}
\mathbb{E}[|\hat{\theta}(X)-\Theta|^2]
\geq
\sum_{i=1}^m \mathbb{E}[(\hat{\theta}_i(X)-a\varepsilon_i)^2].
\end{align*}
Now fix an active coordinate $i$. The estimator $\hat{\theta}_i(X)$ is allowed to depend on all coordinates of $X$, not just $X_i$, so we must justify why the scalar lower bound still applies. Define $X_{-i}$ to be the vector obtained from $X$ by deleting its $i$th coordinate, and let $\mathbb{P}_{X_{-i}}$ denote the law of $X_{-i}$ on $\mathbb{R}^{d-1}$. Because the prior signs $\varepsilon_1,\dots,\varepsilon_m$ are independent and the Gaussian noises $Z_1,\dots,Z_d$ are independent, the random vector $X_{-i}$ is independent of $\varepsilon_i$. Hence, after conditioning on $X_{-i}=z$, the only remaining information about $\varepsilon_i$ is contained in
\begin{align*}
X_i=a\varepsilon_i+\sigma Z_i.
\end{align*}
Because $X_{-i}$ takes values in a Euclidean space, regular conditional distributions exist. For $\mathbb{P}_{X_{-i}}$-almost every fixed value $z$, define the map $T_z:\mathbb{R}\to\mathbb{R}$ by $T_z(x_i)=\hat{\theta}_i(x_i,z)$ for $x_i\in\mathbb{R}$. This is a scalar estimator of $a\varepsilon_i$ from the one-dimensional Gaussian observation $X_i$. The scalar two-point bound from the previous step applies because its hypotheses are satisfied: $\varepsilon_i$ is Rademacher, $\sigma Z_i\sim\mathcal{N}(0,\sigma^2)$ is independent of $\varepsilon_i$, and the amplitude satisfies $0\leq a\leq\sigma$. Hence
\begin{align*}
\mathbb{E}[(T_z(X_i)-a\varepsilon_i)^2\mid X_{-i}=z]
\geq
a^2\Phi(-1)
\end{align*}
for $\mathbb{P}_{X_{-i}}$-almost every $z$. Equivalently,
\begin{align*}
\mathbb{E}[(\hat{\theta}_i(X)-a\varepsilon_i)^2\mid X_{-i}=z]
\geq
a^2\Phi(-1).
\end{align*}
Integrating this conditional inequality over the distribution of $X_{-i}$ gives
\begin{align*}
\mathbb{E}[(\hat{\theta}_i(X)-a\varepsilon_i)^2]
\geq
a^2\Phi(-1).
\end{align*}
Because this holds for every active coordinate $i=1,\dots,m$, summation gives
\begin{align*}
\mathbb{E}[|\hat{\theta}(X)-\Theta|^2]
\geq
m a^2\Phi(-1).
\end{align*}
The construction ensured $ma^2\geq \frac{1}{2}\min\{R^2,d\sigma^2\}$, so
\begin{align*}
\mathbb{E}[|\hat{\theta}(X)-\Theta|^2]
\geq
\frac{\Phi(-1)}{2}\min\{R^2,d\sigma^2\}.
\end{align*}[/guided]