Conditional Expectations are Uniformly Integrable

Conditional Expectations are Uniformly Integrable (Theorem # 1161)

Theorem

Edit Issues Pull Requests Attributions Admin

Discussion

Proof

[proofplan] We exploit the absolute [continuity](/page/Continuity) of the [Lebesgue integral](/page/Lebesgue%20Integral) to convert the $L^1$ integrability of $X$ into an $\varepsilon$-$\delta$ condition on [sets](/page/Set) of small measure. We then use Markov's inequality applied to $|\mathbb{E}[X \mid \mathcal{G}]|$ — whose $L^1$ norm is controlled by that of $X$ via the conditional triangle inequality — to show that the set $\{|\mathbb{E}[X \mid \mathcal{G}]| \geq \lambda\}$ has uniformly small measure for $\lambda$ large, regardless of the choice of sub-$\sigma$-algebra $\mathcal{G}$. The defining property of conditional expectation then transfers the tail [integral](/page/Integral) of $|\mathbb{E}[X \mid \mathcal{G}]|$ back to an integral of $|X|$ over a set of small measure, which is small by absolute continuity. [/proofplan] [step:Extract the $\varepsilon$-$\delta$ condition from absolute continuity of the integral of $|X|$] Fix $\varepsilon > 0$. Since $X \in L^1(\Omega, \mathcal{F}, \mathbb{P})$, the set [function](/page/Function) \begin{align*} \nu \colon \mathcal{F} &\to [0, \infty) \\ A &\mapsto \int_A |X| \, d\mathbb{P} \end{align*} is a finite measure on $(\Omega, \mathcal{F})$ that is absolutely continuous with respect to $\mathbb{P}$. By absolute continuity of the Lebesgue integral, there exists $\delta > 0$ such that for every $A \in \mathcal{F}$, \begin{align*} \mathbb{P}(A) < \delta \implies \mathbb{E}[|X| \mathbb{1}_A] < \varepsilon. \end{align*} [guided] Fix $\varepsilon > 0$. The proof requires an $\varepsilon$-$\delta$ link between the $\mathbb{P}$-measure of a set and the integral of $|X|$ over that set. Where does this come from? Since $X \in L^1(\Omega, \mathcal{F}, \mathbb{P})$, the set function \begin{align*} \nu \colon \mathcal{F} &\to [0, \infty) \\ A &\mapsto \int_A |X| \, d\mathbb{P} = \mathbb{E}[|X| \mathbb{1}_A] \end{align*} defines a finite measure on $(\Omega, \mathcal{F})$ with $\nu(\Omega) = \mathbb{E}[|X|] < \infty$. This measure is absolutely continuous with respect to $\mathbb{P}$: if $\mathbb{P}(A) = 0$, then $|X| \mathbb{1}_A = 0$ $\mathbb{P}$-a.s., so $\nu(A) = 0$. The standard characterisation of absolute continuity for finite measures gives the $\varepsilon$-$\delta$ form: there exists $\delta > 0$ such that for every $A \in \mathcal{F}$, \begin{align*} \mathbb{P}(A) < \delta \implies \mathbb{E}[|X| \mathbb{1}_A] < \varepsilon. \end{align*} This $\delta$ depends on $\varepsilon$ and on the function $X$, but not on the choice of $A$ or of any sub-$\sigma$-algebra. It is this uniformity that will drive the entire argument. [/guided] [/step] [step:Choose $\lambda$ so that every conditional expectation has tail measure less than $\delta$] Let $\mathcal{G} \subset \mathcal{F}$ be an arbitrary sub-$\sigma$-algebra, and write $Y := \mathbb{E}[X \mid \mathcal{G}]$, which exists by the [Existence and Uniqueness of Conditional Expectation](/theorems/1147) since $X \in L^1(\Omega, \mathcal{F}, \mathbb{P})$. By the [Conditional Triangle Inequality](/theorems/1148) (part (vi)), $|Y| \leq \mathbb{E}[|X| \mid \mathcal{G}]$ $\mathbb{P}$-a.s. Integrating both sides and applying the [Averaging Property](/theorems/1148) (part (i)) to $|X|$ yields \begin{align*} \mathbb{E}[|Y|] \leq \mathbb{E}\bigl[\mathbb{E}[|X| \mid \mathcal{G}]\bigr] = \mathbb{E}[|X|]. \end{align*} We apply [Markov's Inequality](/theorems/514) to the non-negative random variable $|Y|$ at level $\lambda > 0$. Since $|Y| \colon \Omega \to [0, \infty)$ is $\mathcal{G}$-measurable (hence $\mathcal{F}$-measurable) and non-negative, the hypotheses are satisfied: \begin{align*} \mathbb{P}(|Y| \geq \lambda) \leq \frac{\mathbb{E}[|Y|]}{\lambda} \leq \frac{\mathbb{E}[|X|]}{\lambda}. \end{align*} Choose $\lambda := \mathbb{E}[|X|] / \delta + 1$, so that \begin{align*} \frac{\mathbb{E}[|X|]}{\lambda} = \frac{\mathbb{E}[|X|]}{\mathbb{E}[|X|]/\delta + 1} < \frac{\mathbb{E}[|X|]}{\mathbb{E}[|X|]/\delta} = \delta. \end{align*} Hence $\mathbb{P}(|Y| \geq \lambda) < \delta$. This bound holds for every sub-$\sigma$-algebra $\mathcal{G} \subset \mathcal{F}$ simultaneously, since $\lambda$ depends only on $\mathbb{E}[|X|]$ and $\delta$ — neither of which involves $\mathcal{G}$. [guided] The goal is to find a single threshold $\lambda > 0$, independent of $\mathcal{G}$, such that the set where $|\mathbb{E}[X \mid \mathcal{G}]|$ exceeds $\lambda$ has $\mathbb{P}$-measure strictly less than $\delta$. Write $Y := \mathbb{E}[X \mid \mathcal{G}]$, which exists by the [Existence and Uniqueness of Conditional Expectation](/theorems/1147) since $X \in L^1(\Omega, \mathcal{F}, \mathbb{P})$. We first need an $L^1$ bound on $Y$ that is uniform over all sub-$\sigma$-algebras. The [Conditional Triangle Inequality](/theorems/1148) (part (vi)) gives $|Y| = |\mathbb{E}[X \mid \mathcal{G}]| \leq \mathbb{E}[|X| \mid \mathcal{G}]$ $\mathbb{P}$-a.s. Since conditional expectation preserves non-negativity by the [Positivity Property](/theorems/1148) (part (iv)), $\mathbb{E}[|X| \mid \mathcal{G}] \geq 0$ a.s., so $|Y|$ is bounded above by a non-negative conditional expectation. Integrating and applying the [Averaging Property](/theorems/1148) (part (i)) to $|X|$: \begin{align*} \mathbb{E}[|Y|] \leq \mathbb{E}\bigl[\mathbb{E}[|X| \mid \mathcal{G}]\bigr] = \mathbb{E}[|X|]. \end{align*} This is the $L^1$ contraction property of conditional expectation. The right-hand side $\mathbb{E}[|X|]$ does not depend on $\mathcal{G}$ — this uniformity is the essential point. Now apply [Markov's Inequality](/theorems/514) to the non-negative [measurable function](/page/Measurable%20Functions) $|Y|$ at level $\lambda > 0$. Since $|Y| \colon \Omega \to [0, \infty)$ is $\mathcal{G}$-measurable (hence $\mathcal{F}$-measurable) and non-negative, the hypotheses of Markov's inequality are satisfied: \begin{align*} \mathbb{P}(|Y| \geq \lambda) \leq \frac{\mathbb{E}[|Y|]}{\lambda} \leq \frac{\mathbb{E}[|X|]}{\lambda}. \end{align*} To make this strictly less than $\delta$, we need $\lambda > \mathbb{E}[|X|] / \delta$. Choose $\lambda := \mathbb{E}[|X|] / \delta + 1$. Then \begin{align*} \frac{\mathbb{E}[|X|]}{\lambda} = \frac{\mathbb{E}[|X|]}{\mathbb{E}[|X|]/\delta + 1} < \frac{\mathbb{E}[|X|]}{\mathbb{E}[|X|]/\delta} = \delta, \end{align*} so $\mathbb{P}(|Y| \geq \lambda) < \delta$. The key observation is that $\lambda$ depends only on $\mathbb{E}[|X|]$ and $\delta$, both of which are determined by $X$ and $\varepsilon$ alone. The same $\lambda$ works for every sub-$\sigma$-algebra $\mathcal{G} \subset \mathcal{F}$. [/guided] [/step] [step:Transfer the tail integral of $|Y|$ to an integral of $|X|$ via the integral-matching property] Fix an arbitrary sub-$\sigma$-algebra $\mathcal{G} \subset \mathcal{F}$ and set $Y := \mathbb{E}[X \mid \mathcal{G}]$ as before. The set $A := \{|Y| \geq \lambda\}$ belongs to $\mathcal{G}$ since $Y$ is $\mathcal{G}$-measurable. By the [Conditional Triangle Inequality](/theorems/1148) (part (vi)), $|Y| \leq \mathbb{E}[|X| \mid \mathcal{G}]$ $\mathbb{P}$-a.s. Since $\mathbb{1}_A \geq 0$, multiplying both sides by $\mathbb{1}_A$ preserves the inequality. Taking expectations (which preserves the ordering by monotonicity of the integral): \begin{align*} \mathbb{E}[|Y| \mathbb{1}_A] \leq \mathbb{E}\bigl[\mathbb{E}[|X| \mid \mathcal{G}] \cdot \mathbb{1}_A\bigr]. \end{align*} Since $A \in \mathcal{G}$, the integral-matching condition in the [Existence and Uniqueness of Conditional Expectation](/theorems/1147) (part (ii)), applied to the integrable random variable $|X|$, gives \begin{align*} \mathbb{E}\bigl[\mathbb{E}[|X| \mid \mathcal{G}] \cdot \mathbb{1}_A\bigr] = \mathbb{E}[|X| \mathbb{1}_A]. \end{align*} Combining: \begin{align*} \mathbb{E}[|Y| \mathbb{1}_A] \leq \mathbb{E}[|X| \mathbb{1}_A]. \end{align*} [guided] We need to bound $\mathbb{E}[|Y| \mathbb{1}_{|Y| \geq \lambda}]$. The difficulty is that both $|Y|$ and $\mathbb{1}_{|Y| \geq \lambda}$ are defined in terms of $Y = \mathbb{E}[X \mid \mathcal{G}]$, so we cannot directly apply bounds on $X$. The strategy is to replace $|Y|$ by $\mathbb{E}[|X| \mid \mathcal{G}]$ (which dominates $|Y|$ pointwise a.s.) and then use the integral-matching property to pass from the conditional expectation back to $|X|$. Set $A := \{|Y| \geq \lambda\}$. This set belongs to $\mathcal{G}$ because $Y$ is $\mathcal{G}$-measurable, so $|Y|$ is $\mathcal{G}$-measurable, and the preimage of $[\lambda, \infty)$ under a measurable function is measurable. By the [Conditional Triangle Inequality](/theorems/1148) (part (vi)), $|Y| \leq \mathbb{E}[|X| \mid \mathcal{G}]$ $\mathbb{P}$-a.s. Since $\mathbb{1}_A \geq 0$, multiplying preserves the inequality: \begin{align*} |Y| \mathbb{1}_A \leq \mathbb{E}[|X| \mid \mathcal{G}] \cdot \mathbb{1}_A \quad \mathbb{P}\text{-a.s.} \end{align*} Taking expectations (which preserves the inequality by monotonicity of the integral): \begin{align*} \mathbb{E}[|Y| \mathbb{1}_A] \leq \mathbb{E}\bigl[\mathbb{E}[|X| \mid \mathcal{G}] \cdot \mathbb{1}_A\bigr]. \end{align*} Now comes the key step. Since $A \in \mathcal{G}$ and $\mathbb{1}_A$ is a bounded $\mathcal{G}$-measurable function, the integral-matching condition from the [Existence and Uniqueness of Conditional Expectation](/theorems/1147) (part (ii)) gives \begin{align*} \mathbb{E}\bigl[\mathbb{E}[|X| \mid \mathcal{G}] \cdot \mathbb{1}_A\bigr] = \mathbb{E}[|X| \mathbb{1}_A]. \end{align*} This is the defining identity of conditional expectation: integrating a conditional expectation against a $\mathcal{G}$-measurable indicator recovers the unconditional integral over the same set. Combining: \begin{align*} \mathbb{E}[|Y| \mathbb{1}_A] \leq \mathbb{E}[|X| \mathbb{1}_A]. \end{align*} The right-hand side is now an integral of the original function $|X|$ over a set $A$ whose $\mathbb{P}$-measure we have already controlled. [/guided] [/step] [step:Conclude uniform integrability by combining absolute continuity with the uniform tail bound] From the previous steps, for every sub-$\sigma$-algebra $\mathcal{G} \subset \mathcal{F}$ with $Y = \mathbb{E}[X \mid \mathcal{G}]$ and $A = \{|Y| \geq \lambda\}$: 1. $\mathbb{P}(A) < \delta$, with $\lambda$ independent of $\mathcal{G}$. 2. $\mathbb{E}[|Y| \mathbb{1}_A] \leq \mathbb{E}[|X| \mathbb{1}_A]$. Since $A \in \mathcal{F}$ satisfies $\mathbb{P}(A) < \delta$, the absolute continuity condition from the first step gives $\mathbb{E}[|X| \mathbb{1}_A] < \varepsilon$. Therefore \begin{align*} \mathbb{E}\bigl[|\mathbb{E}[X \mid \mathcal{G}]| \cdot \mathbb{1}_{|\mathbb{E}[X \mid \mathcal{G}]| \geq \lambda}\bigr] \leq \mathbb{E}[|X| \mathbb{1}_{|Y| \geq \lambda}] < \varepsilon. \end{align*} This holds for every sub-$\sigma$-algebra $\mathcal{G} \subset \mathcal{F}$ with the same $\lambda$. Hence \begin{align*} \sup_{\mathcal{G} \subset \mathcal{F}} \mathbb{E}\bigl[|\mathbb{E}[X \mid \mathcal{G}]| \cdot \mathbb{1}_{|\mathbb{E}[X \mid \mathcal{G}]| \geq \lambda}\bigr] \leq \varepsilon, \end{align*} which is the definition of uniform integrability for the family $\{\mathbb{E}[X \mid \mathcal{G}] : \mathcal{G} \subset \mathcal{F}\}$. [guided] We assemble the three ingredients: - **Absolute continuity** (first step): there exists $\delta > 0$ such that $\mathbb{P}(A) < \delta$ implies $\mathbb{E}[|X| \mathbb{1}_A] < \varepsilon$. - **Uniform tail bound** (second step): there exists $\lambda > 0$ (depending only on $\mathbb{E}[|X|]$ and $\delta$) such that $\mathbb{P}(|Y| \geq \lambda) < \delta$ for every $Y = \mathbb{E}[X \mid \mathcal{G}]$. - **Integral transfer** (third step): $\mathbb{E}[|Y| \mathbb{1}_{|Y| \geq \lambda}] \leq \mathbb{E}[|X| \mathbb{1}_{|Y| \geq \lambda}]$. Chaining these: the set $A = \{|Y| \geq \lambda\}$ has $\mathbb{P}(A) < \delta$ by the second step, so $\mathbb{E}[|X| \mathbb{1}_A] < \varepsilon$ by the first step. The integral transfer from the third step then gives \begin{align*} \mathbb{E}[|Y| \mathbb{1}_{|Y| \geq \lambda}] \leq \mathbb{E}[|X| \mathbb{1}_{|Y| \geq \lambda}] < \varepsilon. \end{align*} Since neither $\lambda$ nor $\varepsilon$ depends on the choice of $\mathcal{G}$, the bound $\mathbb{E}[|Y| \mathbb{1}_{|Y| \geq \lambda}] < \varepsilon$ holds uniformly over all sub-$\sigma$-algebras $\mathcal{G} \subset \mathcal{F}$. Taking the supremum: \begin{align*} \sup_{\mathcal{G} \subset \mathcal{F}} \mathbb{E}\bigl[|\mathbb{E}[X \mid \mathcal{G}]| \cdot \mathbb{1}_{|\mathbb{E}[X \mid \mathcal{G}]| \geq \lambda}\bigr] \leq \varepsilon. \end{align*} As $\varepsilon > 0$ was arbitrary (with $\lambda$ depending on $\varepsilon$), this confirms that the family $\{\mathbb{E}[X \mid \mathcal{G}] : \mathcal{G} \text{ a sub-}\sigma\text{-algebra of } \mathcal{F}\}$ is uniformly integrable. [/guided] [/step]

Prerequisites (0/5 completed)

Prerequisites Graph

Interactive dependency map showing how this theorem builds on foundational concepts

Loading dependency graph...

Theorems

Definitions & Concepts

Explore Further

conditional expectation Definition uniform integrability Definition Markov Inequality Theorem #514 Existence and Uniqueness of Conditional Expectation Theorem #1147 Basic Properties of Conditional Expectation Theorem #1148 Affine Regression as Projection of the Conditional Mean Probability & Statistics Logistic Regression Log-Likelihood, Score, and Observed Hessian Probability & Statistics Density of a Transformed Variable Probability Theory Positive Disjoint Events Are Not Independent Probability Theory Complement Rule Probability Theory Student's t Distribution of the Regression Coefficient Test Statistic Probability & Statistics Chi-Squared Distribution of the Residual Sum of Squares Probability & Statistics Confidence Interval for a Single Regression Coefficient in the Normal Linear Model Probability & Statistics Probability & Statistics Area

What brings you to Androma?

Start with a route through the knowledge graph.