Attributions & Verification

Track contributions and verify content correctness

Proof

custom_env admin

[guided]We start from the definition because the subdifferential is a supporting-hyperplane condition. The function under consideration is the map $f: \mathbb{R}^p \to \mathbb{R}$ given by $x \mapsto \|x\|_1$. Equivalently, for each $x \in \mathbb{R}^p$, \begin{align*} f(x) = \|x\|_1 = \sum_{k=1}^p |x_k|. \end{align*} By the definition of the [convex subdifferential](/page/Convex%20Subdifferential), a vector $z \in \mathbb{R}^p$ lies in $\partial f(\beta)$ precisely when \begin{align*} f(\gamma) \geq f(\beta) + z \cdot (\gamma - \beta) \end{align*} for every $\gamma \in \mathbb{R}^p$. To learn what this condition says about one coordinate, fix $j \in \{1,\dots,p\}$ and let $e_j \in \mathbb{R}^p$ be the $j$-th standard basis vector. We test the subgradient inequality only on points of the form $\gamma = \beta + t e_j$, where $t \in \mathbb{R}$. This changes the $j$-th coordinate and leaves every other coordinate fixed. Therefore \begin{align*} f(\beta + t e_j) - f(\beta) = \sum_{k=1}^p |\beta_k + t(e_j)_k| - \sum_{k=1}^p |\beta_k|. \end{align*} Since $(e_j)_j = 1$ and $(e_j)_k = 0$ for $k \ne j$, all terms with $k \ne j$ cancel, so \begin{align*} f(\beta + t e_j) - f(\beta) = |\beta_j + t| - |\beta_j|. \end{align*} Also, \begin{align*} z \cdot ((\beta + t e_j) - \beta) = z \cdot (t e_j) = z_j t. \end{align*} Thus the subgradient inequality implies the one-dimensional inequality \begin{align*} |\beta_j + t| - |\beta_j| \geq z_j t \end{align*} for every $t \in \mathbb{R}$. Now suppose $\beta_j \ne 0$. Define $s_j := \operatorname{sgn}(\beta_j)$. For every $t$ with $|t| < |\beta_j|$, the perturbation is too small to cross zero, so $\beta_j$ and $\beta_j + t$ have the same sign. Hence \begin{align*} |\beta_j + t| - |\beta_j| = s_j t. \end{align*} Substituting this into the one-dimensional inequality gives \begin{align*} s_j t \geq z_j t \end{align*} for every $t$ with $|t| < |\beta_j|$. If $t > 0$, division by $t$ gives $s_j \geq z_j$. If $t < 0$, division reverses the inequality and gives $s_j \leq z_j$. These two inequalities force \begin{align*} z_j = s_j = \operatorname{sgn}(\beta_j). \end{align*} Now suppose $\beta_j = 0$. The same one-dimensional inequality reduces to \begin{align*} |t| \geq z_j t \end{align*} for every $t \in \mathbb{R}$. Taking $t > 0$ gives $t \geq z_j t$, hence $z_j \leq 1$. Taking $t < 0$ gives $-t \geq z_j t$, and dividing by the negative number $t$ gives $z_j \geq -1$. Therefore \begin{align*} z_j \in [-1,1]. \end{align*} Since the coordinate $j$ was arbitrary, the subgradient condition forces exactly the stated coordinate restrictions.[/guided]

custom_env admin

Verification Progress

5 Total Blocks

0 Verified

0% verified

Contributors

admin 5 blocks (0 verified)

Who Can Verify

Areas: Probability & Statistics

Viktor Miykov Admin

Max Vassiliev Global Reviewer

Horia Neagu Global Reviewer

강현욱 Global Reviewer

Demo Testing Global Reviewer

Archie Pennycook Global Reviewer

Quick Actions

Edit Theorem

What brings you to Androma?

Start with a route through the knowledge graph.

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Sign in to Androma

Check your inbox

One last step

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Raw Attribution Data