Attributions & Verification

Track contributions and verify content correctness

Proof

custom_env admin

[step:Prove the weak law for integrable finite-dimensional random vectors] Let $d \in \mathbb{N}$ and let $Z_1,Z_2,\dots : \Omega \to \mathbb{R}^d$ be i.i.d. random vectors with $\mathbb{E}[|Z_1|] < \infty$. We claim that \begin{align*} \frac{1}{n}\sum_{i=1}^n Z_i \xrightarrow{\mathbb{P}} \mathbb{E}[Z_1]. \end{align*} For $M > 0$, define the truncated random vector $Z_i^{(M)} : \Omega \to \mathbb{R}^d$ by \begin{align*} Z_i^{(M)} := Z_i\,\mathbb{1}_{\{|Z_i|\le M\}}. \end{align*} Then $|Z_i^{(M)}| \le M$ and \begin{align*} \mathbb{E}\left[\left|Z_1 - Z_1^{(M)}\right|\right] = \mathbb{E}\left[|Z_1|\mathbb{1}_{\{|Z_1|>M\}}\right] \to 0 \end{align*} as $M \to \infty$, by integrability of $|Z_1|$. Fix $\varepsilon > 0$. Choose $M > 0$ such that \begin{align*} \mathbb{E}\left[\left|Z_1 - Z_1^{(M)}\right|\right] < \frac{\varepsilon^2}{12}. \end{align*} By Markov's inequality applied to the non-negative random variable \begin{align*} \left|\frac{1}{n}\sum_{i=1}^n (Z_i-Z_i^{(M)})\right|, \end{align*} we get \begin{align*} \mathbb{P}\left( \left|\frac{1}{n}\sum_{i=1}^n (Z_i-Z_i^{(M)})\right|>\frac{\varepsilon}{3} \right) \le \frac{3}{\varepsilon}\mathbb{E}\left[\left|Z_1-Z_1^{(M)}\right|\right]. \end{align*} Also \begin{align*} \left|\mathbb{E}[Z_1-Z_1^{(M)}]\right| \le \mathbb{E}\left[\left|Z_1-Z_1^{(M)}\right|\right]. \end{align*} For the bounded centered average, independence gives \begin{align*} \mathbb{E}\left[ \left| \frac{1}{n}\sum_{i=1}^n \left(Z_i^{(M)}-\mathbb{E}[Z_1^{(M)}]\right) \right|^2 \right] = \frac{1}{n}\mathbb{E}\left[\left|Z_1^{(M)}-\mathbb{E}[Z_1^{(M)}]\right|^2\right] \le \frac{4M^2}{n}. \end{align*} Therefore [Chebyshev's inequality](/theorems/1126) gives \begin{align*} \mathbb{P}\left( \left| \frac{1}{n}\sum_{i=1}^n \left(Z_i^{(M)}-\mathbb{E}[Z_1^{(M)}]\right) \right|>\frac{\varepsilon}{3} \right) \le \frac{36M^2}{n\varepsilon^2}. \end{align*} Combining the three estimates by the triangle inequality, we obtain \begin{align*} \limsup_{n\to\infty} \mathbb{P}\left( \left| \frac{1}{n}\sum_{i=1}^n Z_i-\mathbb{E}[Z_1] \right|>\varepsilon \right) \le \frac{3}{\varepsilon}\mathbb{E}\left[\left|Z_1-Z_1^{(M)}\right|\right]. \end{align*} The chosen $M$ makes the right-hand side arbitrarily small by sending $M \to \infty$. Hence the claimed convergence in probability holds. [/step]

custom_env admin

[guided]We now use the regression equation to expose the error term. On the event that $S_n$ is invertible, the OLS estimator is $\hat{\beta}_n=S_n^{-1}r_n$. Since $Y_i=X_i^\top\beta_0+u_i$, we compute \begin{align*} r_n = \frac{1}{n}\sum_{i=1}^n X_iY_i = \frac{1}{n}\sum_{i=1}^n X_i(X_i^\top\beta_0+u_i) = \left(\frac{1}{n}\sum_{i=1}^n X_iX_i^\top\right)\beta_0 + \frac{1}{n}\sum_{i=1}^n X_i u_i. \end{align*} Using the definitions \begin{align*} S_n := \frac{1}{n}\sum_{i=1}^n X_iX_i^\top,\qquad m_n := \frac{1}{n}\sum_{i=1}^n X_i u_i, \end{align*} this becomes \begin{align*} r_n=S_n\beta_0+m_n. \end{align*} Multiplying by $S_n^{-1}$ on the event where $S_n$ is invertible gives \begin{align*} \hat{\beta}_n-\beta_0 = S_n^{-1}r_n-\beta_0 = S_n^{-1}(S_n\beta_0+m_n)-\beta_0 = S_n^{-1}m_n. \end{align*} Now fix $\varepsilon>0$ and let \begin{align*} E_n := \left\{|S_n-Q|_{\mathrm{op}}<\delta\right\}, \end{align*} where $\delta=\lambda_{\min}(Q)/2$. The previous step showed that $\mathbb{P}(E_n)\to 1$, and that on $E_n$ the matrix $S_n$ is invertible with \begin{align*} \|S_n^{-1}\|_{\mathrm{op}}\le \frac{2}{\lambda_{\min}(Q)}. \end{align*} Therefore, on $E_n$, \begin{align*} |\hat{\beta}_n-\beta_0| = |S_n^{-1}m_n| \le \|S_n^{-1}\|_{\mathrm{op}}|m_n| \le \frac{2}{\lambda_{\min}(Q)}|m_n|. \end{align*} Consequently, \begin{align*} \mathbb{P}\left(|\hat{\beta}_n-\beta_0|>\varepsilon\right) \le \mathbb{P}(E_n^c) + \mathbb{P}\left( \frac{2}{\lambda_{\min}(Q)}|m_n|>\varepsilon \right), \end{align*} that is, \begin{align*} \mathbb{P}\left(|\hat{\beta}_n-\beta_0|>\varepsilon\right) \le \mathbb{P}(E_n^c) + \mathbb{P}\left( |m_n|>\frac{\varepsilon\lambda_{\min}(Q)}{2} \right). \end{align*} The first probability tends to $0$ because $S_n$ is close to $Q$ with probability tending to $1$. The second tends to $0$ because the sample orthogonality moment $m_n$ converges to $0$ in probability. Since this holds for every $\varepsilon>0$, we conclude \begin{align*} \hat{\beta}_n \xrightarrow{\mathbb{P}} \beta_0. \end{align*}[/guided]

custom_env admin

Verification Progress

8 Total Blocks

0 Verified

0% verified

Contributors

admin 8 blocks (0 verified)

Who Can Verify

Areas: Probability & Statistics

Viktor Miykov Admin

Max Vassiliev Global Reviewer

Horia Neagu Global Reviewer

강현욱 Global Reviewer

Demo Testing Global Reviewer

Archie Pennycook Global Reviewer

Quick Actions

Edit Theorem

What brings you to Androma?

Start with a route through the knowledge graph.

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Sign in to Androma

Check your inbox

One last step

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Raw Attribution Data