Back-Door Adjustment Theorem — Statement & Proof

Back-Door Adjustment Theorem (Theorem # 9672)

Theorem

Edit Issues Pull Requests Attributions Admin

Let $G$ be a finite causal directed acyclic graph whose endogenous variables include a treatment [random variable](/page/Random%20Variable) $A:(\Omega,\mathcal F)\to(\mathcal A,\mathcal E_{\mathcal A})$, an outcome random variable $Y:(\Omega,\mathcal F)\to(\mathcal Y,\mathcal E_{\mathcal Y})$, and a covariate random element $Z:(\Omega,\mathcal F)\to(\mathcal Z,\mathcal E_{\mathcal Z})$, where the measurable spaces are standard Borel and $Z$ is disjoint from $A$ and $Y$. Let the observational law be generated by a recursive structural causal model compatible with $G$, with well-defined interventions, consistency, and modularity semantics. Suppose the observational law is Markov with respect to $G$, and suppose the single-world intervention graph construction for $do(A=a)$ represents the corresponding joint law of the observed variables and counterfactual outcome $Y_a$ in the sense used by the [single-world intervention graph adjustment criterion](/theorems/9673). Assume that $Z$ satisfies the back-door criterion relative to $(A,Y)$: no element of $Z$ is a descendant of $A$, and $Z$ blocks every path from $A$ to $Y$ that has an arrow into $A$. Fix a treatment level $a\in\mathcal A$. Let $\mu_Z:=\mathbb P\circ Z^{-1}$ be the observational marginal law of $Z$. Assume there exists a regular [conditional probability](/page/Conditional%20Probability) kernel \begin{align*} K_{A\mid Z}:\mathcal Z\times\mathcal E_{\mathcal A}\to[0,1] \end{align*} for the law of $A$ given $Z$ such that \begin{align*} K_{A\mid Z}(z,\{a\})>0 \end{align*} for $\mu_Z$-almost every $z\in\mathcal Z$. Assume also that there exists a regular conditional probability kernel \begin{align*} K_{Y\mid A,Z}:\mathcal A\times\mathcal Z\times\mathcal E_{\mathcal Y}\to[0,1] \end{align*} for the law of $Y$ given $(A,Z)$; write $K_{Y\mid A,Z}(a,z,B)$ as $\mathbb P(Y\in B\mid A=a,Z=z)$ for $B\in\mathcal E_{\mathcal Y}$ and for $\mu_Z$-almost every $z$. Then, for every measurable set $B\in\mathcal E_{\mathcal Y}$, \begin{align*} \mathbb P(Y\in B\mid do(A=a))=\int_{\mathcal Z}\mathbb P(Y\in B\mid A=a,Z=z)\,d\mu_Z(z). \end{align*}

Discussion

Proof

[proofplan] The proof converts the graphical back-door condition into the probabilistic hypotheses of the conditional-exchangeability adjustment formula. The causal semantics give a potential outcome $Y_a$ corresponding to the intervention $do(A=a)$ and give consistency on the event $\{A=a\}$. The [single-world intervention graph adjustment criterion](/theorems/9673) converts the back-door blocking condition into conditional exchangeability of $Y_a$ and $A$ given $Z$. We then apply the point-treatment adjustment formula and identify the interventional law with the law of $Y_a$. [/proofplan] [step:Introduce the intervention outcome and the observational law of the covariates] Let $Y_a:(\Omega,\mathcal F)\to(\mathcal Y,\mathcal E_{\mathcal Y})$ denote the potential outcome generated by the compatible causal model under the intervention $do(A=a)$. Let \begin{align*} \mu_Z:\mathcal E_{\mathcal Z}\to[0,1] \end{align*} be the observational marginal law of $Z$, defined by \begin{align*} \mu_Z(C)=\mathbb P(Z\in C) \end{align*} for each $C\in\mathcal E_{\mathcal Z}$. By the intervention semantics of the compatible causal model, the interventional outcome law is the law of this potential outcome: \begin{align*} \mathbb P(Y\in B\mid do(A=a))=\mathbb P(Y_a\in B) \end{align*} for every $B\in\mathcal E_{\mathcal Y}$. By consistency, the observed outcome agrees with the intervention outcome on the observed treatment stratum: \begin{align*} Y=Y_a\quad\text{a.s. on }\{A=a\}. \end{align*} [/step] [step:Use the back-door criterion to obtain conditional exchangeability] The back-door criterion says that $Z$ blocks every path from $A$ to $Y$ that has an arrow into $A$ and contains no descendant of $A$. These are exactly the graphical hypotheses needed to apply the [citetheorem:9673] to the single-world intervention graph for $do(A=a)$. The theorem applies because the formal statement assumes that this graph represents the joint law of the observed variables and the counterfactual outcome $Y_a$ for the compatible causal model. Therefore, \begin{align*} Y_a\perp\!\!\!\perp A\mid Z. \end{align*} [guided] We need to justify the probabilistic independence assumption that will feed into the adjustment formula. The object whose distribution we want is $Y_a$, the outcome that would be observed if the structural mechanism for $A$ were replaced by the constant value $a$. Since $Y_a$ is a counterfactual outcome, ordinary d-separation in the observational graph $G$ is not by itself enough: $Y_a$ is not an ordinary observed node of $G$. The statement supplies the needed bridge by assuming that the single-world intervention graph construction for $do(A=a)$ represents the joint law of the observed variables and $Y_a$ for the compatible causal model. We now verify the hypotheses of the [citetheorem:9673]. The first hypothesis is that $Z$ contains no descendant of the treatment intervention in the relevant single-world intervention graph. This follows from the back-door criterion assumption that no element of $Z$ is a descendant of $A$ in $G$. The second hypothesis is the graphical blocking condition: $Z$ blocks every path from $A$ to $Y$ that has an arrow into $A$. This is exactly the remaining part of the back-door criterion. Applying the [citetheorem:9673] with treatment node $A$, outcome node $Y$, covariate set $Z$, and intervention value $a$ gives the conditional independence statement \begin{align*} Y_a\perp\!\!\!\perp A\mid Z. \end{align*} This is the conditional exchangeability hypothesis needed for adjustment. [/guided] [/step] [step:Apply conditional exchangeability adjustment at the fixed treatment level] The variables $A$, $Z$, $Y$, and $Y_a$ are measurable random elements on standard Borel spaces, so the regular conditional laws appearing below may be chosen as probability kernels. The conditional treatment kernel satisfies $K_{A\mid Z}(z,\{a\})>0$ for $\mu_Z$-almost every $z$, and the regular conditional outcome kernel gives the observed conditional law \begin{align*} z\mapsto \mathbb P(Y\in B\mid A=a,Z=z) \end{align*} for $\mu_Z$-almost every $z$ and every $B\in\mathcal E_{\mathcal Y}$. The preceding steps give the hypotheses of the [point-treatment adjustment formula under conditional exchangeability](/theorems/9671) [citetheorem:9671]: consistency gives $Y=Y_a$ a.s. on $\{A=a\}$, conditional exchangeability gives $Y_a\perp\!\!\!\perp A\mid Z$, and positivity gives the required conditional law on the support of $Z$. Therefore, for every $B\in\mathcal E_{\mathcal Y}$, \begin{align*} \mathbb P(Y_a\in B)=\int_{\mathcal Z}\mathbb P(Y\in B\mid A=a,Z=z)\,d\mu_Z(z). \end{align*} [/step] [step:Identify the potential-outcome law with the interventional law] Combining the intervention-law identity from the first step with the adjustment identity from the previous step yields \begin{align*} \mathbb P(Y\in B\mid do(A=a))=\int_{\mathcal Z}\mathbb P(Y\in B\mid A=a,Z=z)\,d\mu_Z(z). \end{align*} This holds for every measurable set $B\in\mathcal E_{\mathcal Y}$, because the stated regular conditional outcome kernel defines the observed conditional probabilities at $a$ and the conditional treatment positivity condition holds for $\mu_Z$-almost every $z$. Hence the [back-door adjustment formula](/theorems/9695) follows. [/step]

Prerequisites (0/2 completed)

Prerequisites Graph

Interactive dependency map showing how this theorem builds on foundational concepts

Loading dependency graph...

Definitions & Concepts

Explore Further

What brings you to Androma?

Start with a route through the knowledge graph.

Back-Door Adjustment Theorem (Theorem # 9672)

Discussion

Proof

Prerequisites (0/2 completed)

Prerequisites Graph

Explore Further

Sign in to Androma

Check your inbox

One last step

Back-Door Adjustment Theorem (Theorem # 9672)

Discussion

Proof

Prerequisites (0/2 completed)

Prerequisites Graph

Explore Further