Attributions & Verification

Track contributions and verify content correctness

Proof

custom_env admin

[step:Build the closest-target test induced by the estimator]Since $\hat\psi_n:\mathcal X^n\to\Psi$ is an estimator, it is $\mathcal A^{\otimes n}$-to-Borel measurable. The maps $y\mapsto d(y,\theta_0)$ and $y\mapsto d(y,\theta_1)$ from $\Psi$ to $\mathbb R$ are continuous because $d$ is a metric, so the set \begin{align*} A_0:=\{x\in\mathcal X^n: d(\hat\psi_n(x),\theta_0)\le d(\hat\psi_n(x),\theta_1)\} \end{align*} belongs to $\mathcal A^{\otimes n}$. Define \begin{align*} A_1:=\mathcal X^n\setminus A_0. \end{align*} Then $A_1\in\mathcal A^{\otimes n}$ as well. Define the induced test $\varphi:\mathcal X^n\to\{0,1\}$ by setting $\varphi(x)=0$ for $x\in A_0$ and $\varphi(x)=1$ for $x\in A_1$. The test $\varphi$ selects the target value closer to $\hat\psi_n(x)$, with ties assigned to $0$. On $A_1$, we have $d(\hat\psi_n(x),\theta_1)<d(\hat\psi_n(x),\theta_0)$. By the separation assumption, \begin{align*} 2s\le d(\theta_0,\theta_1). \end{align*} By the triangle inequality and the defining inequality of $A_1$, \begin{align*} d(\theta_0,\theta_1)\le d(\theta_0,\hat\psi_n(x))+d(\hat\psi_n(x),\theta_1)<2d(\theta_0,\hat\psi_n(x)). \end{align*} Thus $d(\hat\psi_n(x),\theta_0)>s$ for every $x\in A_1$. Therefore \begin{align*} R_0 =\int_{\mathcal X^n} d(\hat\psi_n(x),\theta_0)\,dQ_0(x) \ge s\,Q_0(A_1). \end{align*} On $A_0$, the definition of $A_0$ gives $d(\hat\psi_n(x),\theta_0)\le d(\hat\psi_n(x),\theta_1)$. Again, the separation assumption gives \begin{align*} 2s\le d(\theta_0,\theta_1). \end{align*} The triangle inequality and the defining inequality of $A_0$ give \begin{align*} d(\theta_0,\theta_1)\le d(\theta_0,\hat\psi_n(x))+d(\hat\psi_n(x),\theta_1)\le 2d(\hat\psi_n(x),\theta_1). \end{align*} Thus $d(\hat\psi_n(x),\theta_1)\ge s$ for every $x\in A_0$. Hence \begin{align*} R_1 =\int_{\mathcal X^n} d(\hat\psi_n(x),\theta_1)\,dQ_1(x) \ge s\,Q_1(A_0). \end{align*} Combining the two endpoint bounds gives \begin{align*} R_0+R_1\ge s\bigl(Q_0(A_1)+Q_1(A_0)\bigr). \end{align*}[/step]

custom_env admin

[guided]The estimator $\hat\psi_n$ takes values in the target space $\Psi$, not in the label set $\{0,1\}$. Recall that the two target points are defined by $\theta_i:=\psi(P_i)$ for $i\in\{0,1\}$. To compare estimation with testing, we turn the estimator into a test by asking which target point it is closer to. Since $\hat\psi_n:\mathcal X^n\to\Psi$ is $\mathcal A^{\otimes n}$-to-Borel measurable and the maps $y\mapsto d(y,\theta_0)$ and $y\mapsto d(y,\theta_1)$ are continuous on the [metric space](/page/Metric%20Space) $\Psi$, the set \begin{align*} A_0:=\{x\in\mathcal X^n: d(\hat\psi_n(x),\theta_0)\le d(\hat\psi_n(x),\theta_1)\} \end{align*} belongs to $\mathcal A^{\otimes n}$. Define \begin{align*} A_1:=\mathcal X^n\setminus A_0. \end{align*} Then $A_1\in\mathcal A^{\otimes n}$. Define $\varphi:\mathcal X^n\to\{0,1\}$ by $\varphi(x)=0$ on $A_0$ and $\varphi(x)=1$ on $A_1$. Thus $\varphi$ chooses model $0$ when $\hat\psi_n(x)$ is at least as close to $\theta_0$ as to $\theta_1$, and chooses model $1$ otherwise. Why does a wrong testing decision force a large estimation error? Suppose first that the true model is $0$, but the induced test chooses $1$, so $x\in A_1$. Then \begin{align*} d(\hat\psi_n(x),\theta_1)<d(\hat\psi_n(x),\theta_0). \end{align*} The separation assumption gives \begin{align*} 2s\le d(\theta_0,\theta_1). \end{align*} Using the triangle inequality between $\theta_0$, $\hat\psi_n(x)$, and $\theta_1$, and then using the displayed inequality above, we get \begin{align*} d(\theta_0,\theta_1)\le d(\theta_0,\hat\psi_n(x))+d(\hat\psi_n(x),\theta_1)<2d(\theta_0,\hat\psi_n(x)). \end{align*} Therefore $d(\hat\psi_n(x),\theta_0)>s$ on $A_1$. Integrating this pointwise lower bound with respect to the true law $Q_0$ gives \begin{align*} R_0 =\int_{\mathcal X^n} d(\hat\psi_n(x),\theta_0)\,dQ_0(x) \ge s\,Q_0(A_1). \end{align*} Now suppose the true model is $1$, but the induced test chooses $0$, so $x\in A_0$. The definition of $A_0$ gives \begin{align*} d(\hat\psi_n(x),\theta_0)\le d(\hat\psi_n(x),\theta_1). \end{align*} The separation assumption gives \begin{align*} 2s\le d(\theta_0,\theta_1). \end{align*} The same triangle inequality calculation gives \begin{align*} d(\theta_0,\theta_1)\le d(\theta_0,\hat\psi_n(x))+d(\hat\psi_n(x),\theta_1)\le 2d(\hat\psi_n(x),\theta_1). \end{align*} Therefore $d(\hat\psi_n(x),\theta_1)\ge s$ on $A_0$, and integration with respect to $Q_1$ yields \begin{align*} R_1 =\int_{\mathcal X^n} d(\hat\psi_n(x),\theta_1)\,dQ_1(x) \ge s\,Q_1(A_0). \end{align*} Adding the two inequalities gives the reduction from estimation risk to testing error: \begin{align*} R_0+R_1\ge s\bigl(Q_0(A_1)+Q_1(A_0)\bigr). \end{align*}[/guided]

custom_env admin

Verification Progress

6 Total Blocks

0 Verified

0% verified

Contributors

admin 6 blocks (0 verified)

Who Can Verify

Areas: Probability & Statistics

Viktor Miykov Admin

Max Vassiliev Global Reviewer

Horia Neagu Global Reviewer

강현욱 Global Reviewer

Demo Testing Global Reviewer

Archie Pennycook Global Reviewer

Quick Actions

Edit Theorem

What brings you to Androma?

Start with a route through the knowledge graph.

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Sign in to Androma

Check your inbox

One last step

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Raw Attribution Data