Androma — The Home of Mathematics on the Internet

custom_env admin

[step:Rewrite the centred estimator as a triangular-array sum] Let $(\Omega,\mathcal F,\mathbb P)$ denote the probability space on which the real-valued random variables $X_i:(\Omega,\mathcal F)\to(\mathbb R,\mathcal B(\mathbb R))$ are defined. Let $\mathcal L^1$ denote one-dimensional [Lebesgue measure](/page/Lebesgue%20Measure) on $\mathbb R$, let $\mathbb E[Z]:=\int_\Omega Z\,d\mathbb P$ denote expectation of an integrable real-valued [random variable](/page/Random%20Variable) $Z$, and let $\operatorname{Var}(Z):=\mathbb E[(Z-\mathbb E[Z])^2]$ denote its variance when $Z\in L^2(\Omega,\mathcal F,\mathbb P)$. Define the kernel estimator by \begin{align*} \hat f_{n,h_n}(x):=\frac{1}{nh_n}\sum_{i=1}^{n}K\left(\frac{x-X_i}{h_n}\right). \end{align*} Define the squared kernel integral by \begin{align*} R(K):=\int_{\mathbb R}K(t)^2\,d\mathcal L^1(t). \end{align*} Let $\operatorname{supp}K$ denote the closed support of the measurable kernel $K:\mathbb R\to\mathbb R$. Because $K$ is measurable, each composition $\omega\mapsto K((x-X_i(\omega))/h_n)$ is measurable. For each $n\in\mathbb N$ and $1\le i\le n$, define the random variable \begin{align*} W_{n,i}:\Omega\to\mathbb R,\qquad W_{n,i}(\omega)=h_n^{-1}K\left(\frac{x-X_i(\omega)}{h_n}\right), \end{align*} and define \begin{align*} Y_{n,i}:\Omega\to\mathbb R,\qquad Y_{n,i}(\omega)=\sqrt{\frac{h_n}{n}}\left(W_{n,i}(\omega)-\mathbb E[W_{n,i}]\right). \end{align*} For each fixed $n$, the random variables $Y_{n,1},\dots,Y_{n,n}$ are independent because $X_1,\dots,X_n$ are independent, and they are centred by construction. Moreover, \begin{align*} \sum_{i=1}^{n}Y_{n,i}=\sqrt{\frac{h_n}{n}}\sum_{i=1}^{n}\left(h_n^{-1}K\left(\frac{x-X_i}{h_n}\right)-\mathbb E\left[h_n^{-1}K\left(\frac{x-X_i}{h_n}\right)\right]\right). \end{align*} Equivalently, \begin{align*} \sum_{i=1}^{n}Y_{n,i}=\sqrt{nh_n}\left(\frac{1}{nh_n}\sum_{i=1}^{n}K\left(\frac{x-X_i}{h_n}\right)-\mathbb E[\hat f_{n,h_n}(x)]\right). \end{align*} By the definition of $\hat f_{n,h_n}(x)$, \begin{align*} \sum_{i=1}^{n}Y_{n,i}=\sqrt{nh_n}\left(\hat f_{n,h_n}(x)-\mathbb E[\hat f_{n,h_n}(x)]\right). \end{align*} Thus it suffices to prove a [central limit theorem](/theorems/521) for the triangular array $(Y_{n,i})_{1\le i\le n}$. [/step]

custom_env admin

[step:Compute the limiting variance of the triangular array]Let \begin{align*} \sigma_n^2:=\sum_{i=1}^{n}\operatorname{Var}(Y_{n,i}). \end{align*} Since the $Y_{n,i}$ have the same distribution for fixed $n$, \begin{align*} \sigma_n^2=n\operatorname{Var}(Y_{n,1})=h_n\operatorname{Var}(W_{n,1}). \end{align*} We compute the two terms in this variance. Since $X_1$ has density $f$ with respect to $\mathcal L^1$, \begin{align*} \mathbb E[W_{n,1}] &=\int_{\mathbb R} h_n^{-1}K\left(\frac{x-u}{h_n}\right)f(u)\,d\mathcal L^1(u). \end{align*} Under the substitution $t=(x-u)/h_n$, equivalently $u=x-h_nt$, the one-dimensional Lebesgue measure transforms as $d\mathcal L^1(u)=h_n\,d\mathcal L^1(t)$, and the domain $\mathbb R$ is mapped onto $\mathbb R$. Therefore \begin{align*} \mathbb E[W_{n,1}] &=\int_{\mathbb R}K(t)f(x-h_nt)\,d\mathcal L^1(t). \end{align*} Similarly, \begin{align*} h_n\mathbb E[W_{n,1}^2] =h_n\int_{\mathbb R}h_n^{-2}K\left(\frac{x-u}{h_n}\right)^2f(u)\,d\mathcal L^1(u). \end{align*} Applying the same substitution gives \begin{align*} h_n\mathbb E[W_{n,1}^2] =\int_{\mathbb R}K(t)^2f(x-h_nt)\,d\mathcal L^1(t). \end{align*} Choose $A>0$ such that $\operatorname{supp}K\subset[-A,A]$. Since $f$ is continuous at $x$, there exist $\delta>0$ and $B<\infty$ such that $f(y)\le B$ whenever $|y-x|<\delta$. For all sufficiently large $n$, $h_nA<\delta$, so on $\operatorname{supp}K$ we have $f(x-h_nt)\le B$. Since $K$ is bounded and supported in $[-A,A]$, the functions $t\mapsto B K(t)^2$ and $t\mapsto B|K(t)|$ are integrable with respect to $\mathcal L^1$. The dominated convergence theorem, applied first with dominating function $B K(t)^2$ and then with dominating function $B|K(t)|$, gives \begin{align*} \int_{\mathbb R}K(t)^2f(x-h_nt)\,d\mathcal L^1(t) \to f(x)\int_{\mathbb R}K(t)^2\,d\mathcal L^1(t) =f(x)R(K), \end{align*} and \begin{align*} \int_{\mathbb R}K(t)f(x-h_nt)\,d\mathcal L^1(t) \to f(x)\int_{\mathbb R}K(t)\,d\mathcal L^1(t). \end{align*} Hence $\mathbb E[W_{n,1}]$ is bounded as $n\to\infty$, and because $h_n\to0$, \begin{align*} h_n(\mathbb E[W_{n,1}])^2\to0. \end{align*} Combining the preceding identities, \begin{align*} \sigma_n^2=h_n\operatorname{Var}(W_{n,1}). \end{align*} The variance identity gives \begin{align*} \sigma_n^2=h_n\mathbb E[W_{n,1}^2]-h_n(\mathbb E[W_{n,1}])^2. \end{align*} Therefore \begin{align*} \sigma_n^2\to f(x)R(K). \end{align*}[/step]

custom_env admin

[guided]The variance computation is the point where the scaling $\sqrt{nh_n}$ is determined. We need the total variance of the triangular array to converge to a finite non-zero quantity, and the natural candidate is the local value of the density times the squared $L^2$ size of the kernel. For each $n\in\mathbb N$, define \begin{align*} \sigma_n^2:=\sum_{i=1}^{n}\operatorname{Var}(Y_{n,i}). \end{align*} Because $X_1,\dots,X_n$ are identically distributed, the random variables $Y_{n,1},\dots,Y_{n,n}$ also have the same distribution. Therefore \begin{align*} \sigma_n^2=n\operatorname{Var}(Y_{n,1})=n\operatorname{Var}\left(\sqrt{\frac{h_n}{n}}W_{n,1}\right)=h_n\operatorname{Var}(W_{n,1}), \end{align*} where subtracting the mean inside the definition of $Y_{n,1}$ does not change the variance except for centring. We now compute $\mathbb E[W_{n,1}]$ and $\mathbb E[W_{n,1}^2]$ from the density of $X_1$. Since $X_1$ has density $f$ with respect to $\mathcal L^1$, \begin{align*} \mathbb E[W_{n,1}] &=\int_{\mathbb R} h_n^{-1}K\left(\frac{x-u}{h_n}\right)f(u)\,d\mathcal L^1(u). \end{align*} Use the substitution $t=(x-u)/h_n$, so that $u=x-h_nt$. The map $t\mapsto x-h_nt$ sends $\mathbb R$ onto $\mathbb R$, and the one-dimensional Lebesgue measure transforms by \begin{align*} d\mathcal L^1(u)=h_n\,d\mathcal L^1(t). \end{align*} Thus \begin{align*} \mathbb E[W_{n,1}] &=\int_{\mathbb R}K(t)f(x-h_nt)\,d\mathcal L^1(t). \end{align*} The same substitution applied to the second moment starts from \begin{align*} h_n\mathbb E[W_{n,1}^2] =h_n\int_{\mathbb R}h_n^{-2}K\left(\frac{x-u}{h_n}\right)^2f(u)\,d\mathcal L^1(u). \end{align*} After substituting $t=(x-u)/h_n$, this becomes \begin{align*} h_n\mathbb E[W_{n,1}^2] =\int_{\mathbb R}K(t)^2f(x-h_nt)\,d\mathcal L^1(t). \end{align*} Now we justify the limiting passage. Since $K$ has compact support, choose $A>0$ such that $\operatorname{supp}K\subset[-A,A]$. Since $f$ is continuous at $x$, it is bounded in a neighbourhood of $x$: there exist $\delta>0$ and $B<\infty$ such that $f(y)\le B$ whenever $|y-x|<\delta$. For all sufficiently large $n$, $h_nA<\delta$. Hence, whenever $t\in\operatorname{supp}K$, we have \begin{align*} |x-h_nt-x|\le h_nA<\delta, \end{align*} and therefore $f(x-h_nt)\le B$. The functions $t\mapsto K(t)^2f(x-h_nt)$ are then dominated by the integrable function $t\mapsto B K(t)^2$ supported on $[-A,A]$. Also $f(x-h_nt)\to f(x)$ for each fixed $t\in\mathbb R$, because $h_n\to0$ and $f$ is continuous at $x$. By the dominated convergence theorem, \begin{align*} \int_{\mathbb R}K(t)^2f(x-h_nt)\,d\mathcal L^1(t) \to f(x)\int_{\mathbb R}K(t)^2\,d\mathcal L^1(t) =f(x)R(K). \end{align*} For the first moment, the possible sign changes of $K$ require domination by the absolute value. Since $K$ is bounded and compactly supported, the function $t\mapsto B|K(t)|$ is integrable with respect to $\mathcal L^1$, and \begin{align*} |K(t)f(x-h_nt)|\le B|K(t)| \end{align*} for all sufficiently large $n$ and all $t\in\operatorname{supp}K$. Applying the dominated convergence theorem again gives \begin{align*} \int_{\mathbb R}K(t)f(x-h_nt)\,d\mathcal L^1(t) \to f(x)\int_{\mathbb R}K(t)\,d\mathcal L^1(t). \end{align*} In particular, $\mathbb E[W_{n,1}]$ is bounded as $n\to\infty$. Therefore \begin{align*} h_n(\mathbb E[W_{n,1}])^2\to0, \end{align*} because $h_n\to0$. Finally, \begin{align*} \sigma_n^2=h_n\operatorname{Var}(W_{n,1}). \end{align*} Using the variance identity gives \begin{align*} \sigma_n^2=h_n\mathbb E[W_{n,1}^2]-h_n(\mathbb E[W_{n,1}])^2. \end{align*} Combining the two limits proved above, \begin{align*} \sigma_n^2\to f(x)R(K). \end{align*} This proves that the total variance of the centred triangular array converges to the variance appearing in the claimed normal limit.[/guided]

custom_env admin

What brings you to Androma?

Start with a route through the knowledge graph.

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Sign in to Androma

Check your inbox

One last step

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Raw Attribution Data