Attributions & Verification

Track contributions and verify content correctness

Proof

custom_env admin

[guided]The purpose of this step is to turn the random quantity involving $\varepsilon$ into a deterministic norm bound and then separate the active coordinates $S$ from the inactive coordinates $S^c$. On the event $\mathcal{E}_\lambda$, the vector $\frac{1}{n}X^\top\varepsilon \in \mathbb{R}^p$ has $\ell^\infty$ norm at most $\lambda/2$. Applying the [dual norm inequality](/page/Dual%20Norm) between $\ell^\infty$ and $\ell^1$ to the vectors $\frac{1}{n}X^\top\varepsilon$ and $\Delta$ gives \begin{align*} \left|\frac{2}{n}\varepsilon^\top X\Delta\right| = 2\left|\left(\frac{1}{n}X^\top\varepsilon\right)^\top \Delta\right| \le 2\left\|\frac{1}{n}X^\top\varepsilon\right\|_\infty \|\Delta\|_1 \le \lambda\|\Delta\|_1. \end{align*} This is exactly where the event $\mathcal{E}_\lambda$ is used. Next we exploit sparsity. Since $S=\operatorname{supp}(\beta^*)$, the inactive coordinates satisfy $\beta^*_{S^c}=0$. Hence \begin{align*} \hat{\beta}_{S^c} = \beta^*_{S^c}+\Delta_{S^c} = \Delta_{S^c}. \end{align*} On the active coordinates, we also have \begin{align*} \hat{\beta}_S = \beta^*_S+\Delta_S. \end{align*} Therefore \begin{align*} \|\hat{\beta}\|_1 = \|\beta^*_S+\Delta_S\|_1+\|\Delta_{S^c}\|_1. \end{align*} By the [reverse triangle inequality](/page/Reverse%20Triangle%20Inequality) on $\mathbb{R}^S$, \begin{align*} \|\beta^*_S\|_1-\|\beta^*_S+\Delta_S\|_1 \le \|\Delta_S\|_1. \end{align*} Since $\|\beta^*\|_1=\|\beta^*_S\|_1$, it follows that \begin{align*} \|\beta^*\|_1-\|\hat{\beta}\|_1 = \|\beta^*_S\|_1-\|\beta^*_S+\Delta_S\|_1-\|\Delta_{S^c}\|_1. \end{align*} Combining this identity with the [reverse triangle inequality](/theorems/2300) gives \begin{align*} \|\beta^*\|_1-\|\hat{\beta}\|_1 \le \|\Delta_S\|_1-\|\Delta_{S^c}\|_1. \end{align*} Substituting the stochastic bound and this support estimate into the basic inequality yields \begin{align*} \frac{1}{n}|X\Delta|^2 \le \lambda\|\Delta\|_1 + 2\lambda\bigl(\|\Delta_S\|_1-\|\Delta_{S^c}\|_1\bigr). \end{align*} Using the support decomposition $\|\Delta\|_1=\|\Delta_S\|_1+\|\Delta_{S^c}\|_1$, the right-hand side is \begin{align*} \lambda\bigl(\|\Delta_S\|_1+\|\Delta_{S^c}\|_1\bigr) + 2\lambda\|\Delta_S\|_1 - 2\lambda\|\Delta_{S^c}\|_1. \end{align*} Collecting the active and inactive terms gives \begin{align*} 3\lambda\|\Delta_S\|_1-\lambda\|\Delta_{S^c}\|_1. \end{align*} Dropping the non-positive term $-\lambda\|\Delta_{S^c}\|_1$ gives the sharper prediction estimate \begin{align*} \frac{1}{n}|X\Delta|^2 \le 3\lambda\|\Delta_S\|_1. \end{align*}[/guided]

custom_env admin

Verification Progress

9 Total Blocks

0 Verified

0% verified

Contributors

admin 9 blocks (0 verified)

Who Can Verify

Areas: Probability & Statistics

Viktor Miykov Admin

Max Vassiliev Global Reviewer

Horia Neagu Global Reviewer

강현욱 Global Reviewer

Demo Testing Global Reviewer

Archie Pennycook Global Reviewer

Quick Actions

Edit Theorem

What brings you to Androma?

Start with a route through the knowledge graph.

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Sign in to Androma

Check your inbox

One last step

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Raw Attribution Data