[guided]The previous step reduced the recovery error to a stochastic supremum over the cone. We now estimate that supremum.
Define
\begin{align*}
K_S := \mathcal{C}(S)\cap \{v\in\mathbb{R}^d: |v|=1\}.
\end{align*}
This is the set of unit directions in which the error can point. If $\sigma=0$, then $\varepsilon=0$ almost surely, so the stochastic term is zero and the final recovery estimate follows immediately from the deterministic reduction. Thus the only nontrivial case is $\sigma>0$. In that case define the map $g:(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}^n$ by
\begin{align*}
g(\omega):=\sigma^{-1}\varepsilon(\omega).
\end{align*}
Because $\varepsilon\sim\mathcal{N}(0,\sigma^2 I_n)$, this $g$ is a standard Gaussian random vector and $\varepsilon=\sigma g$ on the same probability space. Define
\begin{align*}
Z_S := \sup_{v\in K_S} g\cdot Av.
\end{align*}
Before using expectations or concentration, we check that this supremum is a legitimate [random variable](/page/Random%20Variable) with finite expectation. The cone $\mathcal{C}(S)$ is closed because it is defined by the continuous inequality $\|v_{S^c}\|_1\leq\|v_S\|_1$, and therefore $K_S$ is a closed subset of the Euclidean unit sphere in $\mathbb{R}^d$. Hence $K_S$ is compact. For each fixed $w\in\mathbb{R}^n$, the map $v\mapsto w\cdot Av$ is continuous on $K_S$, so the supremum is finite and attained. With $\Phi_S(w):=\sup_{v\in K_S}w\cdot Av$, this gives $Z_S=\Phi_S(g)$, and the Lipschitz estimate proved below implies that $\Phi_S$ is continuous, hence Borel measurable. Finally, since $|Av|\leq\Lambda$ for $v\in K_S$, we have $Z_S\leq\Lambda |g|$, and $\mathbb{E}[|g|]<\infty$ for a standard Gaussian vector in $\mathbb{R}^n$. Thus $\mathbb{E}[Z_S]$ is finite.
The Gaussian complexity hypothesis from the theorem, recorded in the first step, applies because $S$ has cardinality at most $k$ and $K_S$ is the corresponding unit cone section. It gives
\begin{align*}
\mathbb{E}[Z_S]\leq \Gamma\sqrt{k\log\left(\frac{ed}{k}\right)}.
\end{align*}
To pass from an expectation estimate to a high-probability estimate, we verify the Lipschitz constant of the supremum functional. Define
\begin{align*}
\Phi_S:\mathbb{R}^n\to\mathbb{R}
\end{align*}
by
\begin{align*}
\Phi_S(w):=\sup_{v\in K_S} w\cdot Av.
\end{align*}
For two vectors $w_1,w_2\in\mathbb{R}^n$,
\begin{align*}
|\Phi_S(w_1)-\Phi_S(w_2)|
\leq
\sup_{v\in K_S}|(w_1-w_2)\cdot Av|.
\end{align*}
The [Cauchy-Schwarz inequality](/theorems/432) in the Euclidean space $\mathbb{R}^n$ gives
\begin{align*}
|(w_1-w_2)\cdot Av|\leq |w_1-w_2|\,|Av|.
\end{align*}
Since $v\in K_S\subset \mathcal{C}(S)$ and $|v|=1$, the upper restricted isometry bound gives
\begin{align*}
|Av|\leq \Lambda.
\end{align*}
Thus
\begin{align*}
|\Phi_S(w_1)-\Phi_S(w_2)|\leq \Lambda |w_1-w_2|.
\end{align*}
So $\Phi_S$ is $\Lambda$-Lipschitz.
We now apply the [Gaussian concentration inequality for Lipschitz functions](/page/Gaussian%20Concentration%20Inequality). Its hypotheses are satisfied because $g$ is a standard Gaussian random vector in $\mathbb{R}^n$, the functional $\Phi_S:\mathbb{R}^n\to\mathbb{R}$ is Borel measurable with finite expectation as verified above, and $\Phi_S$ is $\Lambda$-Lipschitz. Applied to $\Phi_S$, it gives, for every $r\geq 0$,
\begin{align*}
\mathbb{P}\left(\Phi_S(g)\leq \mathbb{E}[\Phi_S(g)]+\Lambda r\right)\geq 1-e^{-r^2/2}.
\end{align*}
Since $\Phi_S(g)=Z_S$, this becomes
\begin{align*}
\mathbb{P}\left(Z_S\leq \mathbb{E}[Z_S]+\Lambda r\right)\geq 1-e^{-r^2/2}.
\end{align*}
Using the expectation bound for $Z_S$, we conclude that with probability at least $1-e^{-r^2/2}$,
\begin{align*}
Z_S
\leq
\Gamma\sqrt{k\log\left(\frac{ed}{k}\right)}
+
\Lambda r.
\end{align*}
Multiplying by $\sigma$ gives the corresponding bound for the original noise vector:
\begin{align*}
\sup_{v\in K_S}\varepsilon\cdot Av
=
\sigma Z_S
\leq
\sigma
\left(
\Gamma\sqrt{k\log\left(\frac{ed}{k}\right)}
+
\Lambda r
\right).
\end{align*}[/guided]