Jackknife Bias Reduction — Statement & Proof

Jackknife Bias Reduction (Theorem # 1986)

Theorem

Edit Issues Pull Requests Attributions Admin

Discussion

No discussion available for this theorem.

Proof

[proofplan] The regularity assumption provides a bias expansion $\mathbb{E}[T_n] = \theta + a(\theta)/n + O(1/n^2)$ for the full-sample estimator, and the same expansion applied to each leave-one-out estimator $T_{n-1,i}$ gives $\mathbb{E}[T_{n-1,i}] = \theta + a(\theta)/(n-1) + O(1/n^2)$. We form the jackknife estimator as a specific linear combination of $T_n$ and $\bar{T}_{n-1} = \frac{1}{n}\sum_{i=1}^n T_{n-1,i}$ designed to cancel the $O(1/n)$ term. Computing the expectation of this combination confirms that only the $O(1/n^2)$ remainder survives. [/proofplan] [step:Write the bias expansion for the full-sample estimator $T_n$] The regularity assumption on the bias states that $\mathbb{E}_\theta[T_n]$ admits the uniform expansion \begin{align*} \mathbb{E}_\theta[T_n] = \theta + \frac{a(\theta)}{n} + \frac{b(\theta)}{n^2} + O\!\left(\frac{1}{n^3}\right), \end{align*} where $a(\theta)$ and $b(\theta)$ are functions of $\theta$ that do not depend on $n$, and the $O(1/n^3)$ remainder is uniform in $\theta$. The leading bias term is $a(\theta)/n$. [/step] [step:Write the bias expansion for each leave-one-out estimator $T_{n-1,i}$] For each $i \in \{1, \ldots, n\}$, the leave-one-out estimator $T_{n-1,i}$ is computed from the sample $(X_1, \ldots, X_{i-1}, X_{i+1}, \ldots, X_n)$ of size $n-1$. Since the $X_j$ are i.i.d., $T_{n-1,i}$ has the same distribution as $T_{n-1}$ (an estimator computed from $n-1$ i.i.d. observations), so the same bias expansion holds with $n$ replaced by $n-1$: \begin{align*} \mathbb{E}_\theta[T_{n-1,i}] = \theta + \frac{a(\theta)}{n-1} + \frac{b(\theta)}{(n-1)^2} + O\!\left(\frac{1}{n^3}\right). \end{align*} Define the leave-one-out average $\bar{T}_{n-1} = \frac{1}{n}\sum_{i=1}^n T_{n-1,i}$. By linearity of expectation: \begin{align*} \mathbb{E}_\theta[\bar{T}_{n-1}] = \theta + \frac{a(\theta)}{n-1} + \frac{b(\theta)}{(n-1)^2} + O\!\left(\frac{1}{n^3}\right). \end{align*} [guided] Each $T_{n-1,i}$ is computed from $n-1$ observations drawn i.i.d. from the same distribution. The regularity assumption is a statement about the bias of $T_m$ as a function of sample size $m$, so it applies equally well to $m = n-1$. The crucial point is that $a(\theta)$ is the same function regardless of sample size; only the denominator changes from $n$ to $n-1$. Since the leave-one-out average $\bar{T}_{n-1}$ is a simple average of $n$ identically distributed terms, its expectation equals the common expectation $\mathbb{E}[T_{n-1,i}]$. [/guided] [/step] [step:Form the jackknife estimator and cancel the $O(1/n)$ bias] The jackknife bias-corrected estimator is defined as \begin{align*} \tilde{T}_{\mathrm{JACK}} = n T_n - (n-1) \bar{T}_{n-1}. \end{align*} Taking expectations: \begin{align*} \mathbb{E}_\theta[\tilde{T}_{\mathrm{JACK}}] &= n \mathbb{E}_\theta[T_n] - (n-1) \mathbb{E}_\theta[\bar{T}_{n-1}] \\ &= n\!\left(\theta + \frac{a(\theta)}{n} + \frac{b(\theta)}{n^2} + O\!\left(\frac{1}{n^3}\right)\right) - (n-1)\!\left(\theta + \frac{a(\theta)}{n-1} + \frac{b(\theta)}{(n-1)^2} + O\!\left(\frac{1}{n^3}\right)\right). \end{align*} Expanding term by term: The $\theta$ terms: $n\theta - (n-1)\theta = \theta$. The $a(\theta)$ terms: $n \cdot \frac{a(\theta)}{n} - (n-1) \cdot \frac{a(\theta)}{n-1} = a(\theta) - a(\theta) = 0$. The $b(\theta)$ terms: $n \cdot \frac{b(\theta)}{n^2} - (n-1) \cdot \frac{b(\theta)}{(n-1)^2} = \frac{b(\theta)}{n} - \frac{b(\theta)}{n-1}$. Now $\frac{1}{n} - \frac{1}{n-1} = \frac{n-1-n}{n(n-1)} = \frac{-1}{n(n-1)}$, so the $b(\theta)$ contribution is $\frac{-b(\theta)}{n(n-1)} = O(1/n^2)$. The remaining terms contribute $n \cdot O(1/n^3) + (n-1) \cdot O(1/n^3) = O(1/n^2)$. Combining: \begin{align*} \mathbb{E}_\theta[\tilde{T}_{\mathrm{JACK}}] = \theta + O\!\left(\frac{1}{n^2}\right). \end{align*} [guided] The jackknife is a clever linear combination designed to kill the $a(\theta)/n$ term. Why does the particular choice $nT_n - (n-1)\bar{T}_{n-1}$ work? Consider the general linear combination $\alpha T_n + \beta \bar{T}_{n-1}$. For unbiasedness up to $O(1/n^2)$, we need two conditions: 1. The $\theta$ coefficient is 1: $\alpha + \beta = 1$. 2. The $a(\theta)$ coefficient is 0: $\alpha \cdot \frac{1}{n} + \beta \cdot \frac{1}{n-1} = 0$. From condition 2: $\alpha(n-1) + \beta n = 0$, so $\beta = -\alpha(n-1)/n$. Substituting into condition 1: $\alpha - \alpha(n-1)/n = 1$, giving $\alpha/n = 1$, so $\alpha = n$ and $\beta = -(n-1)$. This yields $\tilde{T}_{\mathrm{JACK}} = nT_n - (n-1)\bar{T}_{n-1}$. The leading bias $a(\theta)/n$ cancels exactly, and the next-order term $b(\theta)$ is reduced from $O(1/n)$ to $O(1/n^2)$ because \begin{align*} n \cdot \frac{b(\theta)}{n^2} - (n-1) \cdot \frac{b(\theta)}{(n-1)^2} = b(\theta)\!\left(\frac{1}{n} - \frac{1}{n-1}\right) = \frac{-b(\theta)}{n(n-1)} = O\!\left(\frac{1}{n^2}\right). \end{align*} The conclusion is \begin{align*} \left|\mathbb{E}_\theta[\tilde{T}_{\mathrm{JACK}}] - \theta\right| = O\!\left(\frac{1}{n^2}\right), \end{align*} confirming that the jackknife reduces the bias from $O(1/n)$ to $O(1/n^2)$. [/guided] [/step]

Explore Further

Multivariate Delta Method Statistics Stein's Lemma Statistics Null Distribution of the Sample Correlation Statistics James-Stein Risk Statistics Distribution of the Least Squares Estimator Statistics t-Statistic Distribution Statistics Univariate Delta Method Statistics Fisher Information as Negative Expected Hessian Statistics

What brings you to Androma?

Start with a route through the knowledge graph.

Jackknife Bias Reduction (Theorem # 1986)

Discussion

Proof

Explore Further

Sign in to Androma

Check your inbox

One last step

Jackknife Bias Reduction (Theorem # 1986)

Discussion

Proof

Explore Further