Sklar's Theorem — Statement & Proof

Sklar's Theorem (Theorem # 4074)

Theorem

Edit Issues Pull Requests Attributions Admin

Discussion

Proof

[proofplan] We first construct a copula for a given joint distribution by passing from each coordinate $X_j$ to a uniform random variable through the randomized distributional transform. The generalized inverse then recovers $X_j$ from this uniform variable almost surely, which gives the [representation formula](/theorems/39). Uniqueness follows because a continuous distribution function has range dense enough, together with right-continuity of copulas, to determine the copula on all of $[0,1]^p$. For the converse, we verify directly that composing a copula with univariate distribution functions preserves monotonicity, right-continuity, boundary limits, and nonnegative rectangle probabilities, and then compute the margins using the uniform marginal property of the copula. [/proofplan] [step:Construct uniform coordinates by the randomized distributional transform] Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space carrying a random vector \begin{align*} X: \Omega &\to \mathbb{R}^p,\\ \omega &\mapsto (X_1(\omega),\dots,X_p(\omega)), \end{align*} whose joint distribution function is $F$. For each $j \in \{1,\dots,p\}$, let \begin{align*} F_j(t-) := \lim_{s \uparrow t} F_j(s) \end{align*} denote the left limit of the marginal distribution function $F_j$ at $t \in \mathbb{R}$. Enlarge the probability space, if necessary, so that it also carries random variables \begin{align*} V_j: \Omega \to [0,1], \qquad j \in \{1,\dots,p\}, \end{align*} which are independent, each uniformly distributed on $[0,1]$, and independent of $X$. Define \begin{align*} U_j: \Omega &\to [0,1],\\ \omega &\mapsto F_j(X_j(\omega)-)+V_j(\omega)\bigl(F_j(X_j(\omega))-F_j(X_j(\omega)-)\bigr). \end{align*} Then each $U_j$ is uniformly distributed on $[0,1]$. [guided] The purpose of the variables $U_j$ is to replace each marginal coordinate by a uniform coordinate without assuming that the marginal distribution function is continuous. If $F_j$ were continuous, the simpler transform $F_j(X_j)$ would already be uniform. When $F_j$ has jumps, the expression \begin{align*} F_j(X_j-)+V_j\bigl(F_j(X_j)-F_j(X_j-)\bigr) \end{align*} spreads the mass at each atom uniformly across the jump interval. We verify the uniformity. Fix $j \in \{1,\dots,p\}$ and $u \in [0,1]$. Define the generalized inverse \begin{align*} Q_j: (0,1) &\to \mathbb{R},\\ r &\mapsto \inf\{t \in \mathbb{R}: F_j(t) \ge r\}. \end{align*} For $u=0$ and $u=1$, the identity $\mathbb{P}(U_j \le u)=u$ follows from the definition and from the fact that $V_j$ has no atoms at $0$ or $1$. Now assume $0<u<1$ and set $a:=Q_j(u)$. By definition of $Q_j$, we have \begin{align*} F_j(a-) \le u \le F_j(a). \end{align*} The event $\{U_j \le u\}$ is split according to whether $X_j<a$, $X_j=a$, or $X_j>a$. If $X_j<a$, then $F_j(X_j)\le F_j(a-)\le u$, so $U_j\le u$. If $X_j>a$, then $F_j(X_j-)\ge F_j(a)\ge u$, and the event $U_j\le u$ has probability zero except at the endpoint case, which is harmless because $V_j$ has no atoms. On the atom $\{X_j=a\}$, conditional on $X_j=a$, the random variable $U_j$ is uniformly distributed on the interval $[F_j(a-),F_j(a)]$. Therefore \begin{align*} \mathbb{P}(U_j \le u) &= \mathbb{P}(X_j<a) + \mathbb{P}(X_j=a)\frac{u-F_j(a-)}{F_j(a)-F_j(a-)}\\ &= F_j(a-) + \bigl(F_j(a)-F_j(a-)\bigr) \frac{u-F_j(a-)}{F_j(a)-F_j(a-)}\\ &= u, \end{align*} with the second term interpreted as $0$ when $F_j(a)=F_j(a-)$. Hence $U_j$ is uniformly distributed on $[0,1]$. [/guided] [/step] [step:Recover the original coordinates from generalized inverses] For each $j \in \{1,\dots,p\}$, define the generalized inverse \begin{align*} Q_j: (0,1) &\to \mathbb{R},\\ u &\mapsto \inf\{t \in \mathbb{R}: F_j(t) \ge u\}. \end{align*} Then \begin{align*} Q_j(U_j)=X_j \end{align*} almost surely. Indeed, if $X_j(\omega)=x$, then \begin{align*} F_j(x-) \le U_j(\omega) \le F_j(x), \end{align*} and, except on the null event where $V_j(\omega)$ is an endpoint value inside a jump interval, this places $U_j(\omega)$ strictly inside the jump interval corresponding to $x$ whenever $x$ is an atom. Hence the defining property of $Q_j$ gives $Q_j(U_j(\omega))=x$ almost surely. Moreover, for every $x_j \in \mathbb{R}$ and every $u \in (0,1)$, \begin{align*} Q_j(u)\le x_j \quad \Longleftrightarrow \quad u \le F_j(x_j). \end{align*} [guided] The generalized inverse is the device that turns the uniform variable back into a variable with distribution function $F_j$. Its basic order property is \begin{align*} Q_j(u)\le x_j \quad \Longleftrightarrow \quad u \le F_j(x_j). \end{align*} To prove it, first suppose $Q_j(u)\le x_j$. By the definition of $Q_j(u)$ as an infimum, choose a decreasing sequence $(t_n)_{n=1}^{\infty}$ with $t_n \downarrow Q_j(u)$ and $F_j(t_n)\ge u$ for every $n$. Since $F_j$ is right-continuous, \begin{align*} F_j(Q_j(u))=\lim_{n\to\infty}F_j(t_n)\ge u. \end{align*} Because $F_j$ is nondecreasing and $Q_j(u)\le x_j$, it follows that $u\le F_j(x_j)$. Conversely, if $u\le F_j(x_j)$, then $x_j$ belongs to the set \begin{align*} \{t \in \mathbb{R}: F_j(t)\ge u\}, \end{align*} so the infimum of that set satisfies $Q_j(u)\le x_j$. Now apply this to $u=U_j(\omega)$. By construction, \begin{align*} F_j(X_j(\omega)-)\le U_j(\omega)\le F_j(X_j(\omega)). \end{align*} On non-atomic values of $X_j$, the interval collapses and the inverse returns $X_j(\omega)$. On atomic values, the auxiliary uniform variable $V_j$ chooses a point inside the jump interval, and endpoint choices occur with probability zero. Thus \begin{align*} Q_j(U_j)=X_j \end{align*} almost surely. [/guided] [/step] [step:Define the copula and prove the representation formula] Define \begin{align*} C: [0,1]^p &\to [0,1],\\ (u_1,\dots,u_p) &\mapsto \mathbb{P}(U_1\le u_1,\dots,U_p\le u_p). \end{align*} Since $(U_1,\dots,U_p)$ is a random vector with values in $[0,1]^p$ and each $U_j$ is uniformly distributed on $[0,1]$, the function $C$ is a $p$-copula. For $x=(x_1,\dots,x_p)\in\mathbb{R}^p$, using $X_j=Q_j(U_j)$ almost surely and the equivalence $Q_j(u)\le x_j \Longleftrightarrow u\le F_j(x_j)$, we obtain \begin{align*} F(x_1,\dots,x_p) &= \mathbb{P}(X_1\le x_1,\dots,X_p\le x_p)\\ &= \mathbb{P}(Q_1(U_1)\le x_1,\dots,Q_p(U_p)\le x_p)\\ &= \mathbb{P}(U_1\le F_1(x_1),\dots,U_p\le F_p(x_p))\\ &= C(F_1(x_1),\dots,F_p(x_p)). \end{align*} This proves existence. [/step] [step:Prove uniqueness when the marginal distribution functions are continuous] Assume now that $F_1,\dots,F_p$ are continuous. Let $C$ and $\widetilde{C}$ be $p$-copulas satisfying \begin{align*} F(x_1,\dots,x_p) = C(F_1(x_1),\dots,F_p(x_p)) = \widetilde{C}(F_1(x_1),\dots,F_p(x_p)) \end{align*} for every $x=(x_1,\dots,x_p)\in\mathbb{R}^p$. For each $j$, continuity of $F_j$, together with \begin{align*} \lim_{t\to -\infty}F_j(t)=0, \qquad \lim_{t\to \infty}F_j(t)=1, \end{align*} implies that the range $F_j(\mathbb{R})$ is dense in $[0,1]$. Therefore the product set \begin{align*} F_1(\mathbb{R})\times \cdots \times F_p(\mathbb{R}) \end{align*} is dense in $[0,1]^p$. The preceding identity gives $C=\widetilde{C}$ on this dense product set. Since copulas are distribution functions on $[0,1]^p$, they are right-continuous in each coordinate; hence equality on a [dense subset](/page/Dense%20Subset) implies equality at every point of $[0,1]^p$. Thus $C=\widetilde{C}$. [guided] To prove uniqueness, we must show that the representation formula determines $C(u_1,\dots,u_p)$ for every $u\in[0,1]^p$. The formula directly determines $C$ only at points of the form \begin{align*} (F_1(x_1),\dots,F_p(x_p)). \end{align*} So the key question is: are these points dense in the whole cube? For each $j$, the distribution function $F_j$ is continuous, nondecreasing, and satisfies \begin{align*} \lim_{t\to -\infty}F_j(t)=0, \qquad \lim_{t\to \infty}F_j(t)=1. \end{align*} Fix $u_j\in(0,1)$ and $\varepsilon>0$. Choose $a,b\in\mathbb{R}$ such that \begin{align*} F_j(a)<u_j< F_j(b). \end{align*} By the intermediate value property for the continuous function $F_j$, there exists $x_j\in[a,b]$ with $F_j(x_j)=u_j$. Endpoint values $0$ and $1$ are obtained as limits as $x_j\to-\infty$ and $x_j\to\infty$. Thus $F_j(\mathbb{R})$ is dense in $[0,1]$. Now let $C$ and $\widetilde C$ be two copulas giving the same representation of $F$. For every $x\in\mathbb{R}^p$, \begin{align*} C(F_1(x_1),\dots,F_p(x_p)) = \widetilde C(F_1(x_1),\dots,F_p(x_p)). \end{align*} Hence $C$ and $\widetilde C$ agree on the dense product \begin{align*} F_1(\mathbb{R})\times\cdots\times F_p(\mathbb{R}). \end{align*} Copulas are joint distribution functions, so they are right-continuous in each coordinate. Given any $u\in[0,1]^p$, choose a sequence $(u_k)_{k=1}^{\infty}$ from the dense product with $u_k\downarrow u$ coordinatewise. Right-continuity gives \begin{align*} C(u)=\lim_{k\to\infty}C(u_k) =\lim_{k\to\infty}\widetilde C(u_k) =\widetilde C(u). \end{align*} Therefore $C=\widetilde C$ on $[0,1]^p$. [/guided] [/step] [step:Verify that composing a copula with marginal distribution functions gives a joint distribution function] Conversely, let $C:[0,1]^p\to[0,1]$ be a $p$-copula and let $F_1,\dots,F_p$ be univariate distribution functions. Define \begin{align*} G: \mathbb{R}^p &\to [0,1],\\ (x_1,\dots,x_p) &\mapsto C(F_1(x_1),\dots,F_p(x_p)). \end{align*} Since each $F_j$ is nondecreasing and $C$ is nondecreasing in each coordinate, $G$ is nondecreasing in each coordinate. Since each $F_j$ is right-continuous and $C$ is right-continuous, $G$ is right-continuous. Let $a_j,b_j\in\mathbb{R}$ satisfy $a_j<b_j$ for every $j$. The $G$-mass of the rectangle \begin{align*} (a_1,b_1]\times\cdots\times(a_p,b_p] \end{align*} is \begin{align*} \sum_{\varepsilon\in\{0,1\}^p} (-1)^{p-\varepsilon_1-\cdots-\varepsilon_p} G(y_{\varepsilon,1},\dots,y_{\varepsilon,p}), \end{align*} where $y_{\varepsilon,j}=b_j$ if $\varepsilon_j=1$ and $y_{\varepsilon,j}=a_j$ if $\varepsilon_j=0$. Substituting the definition of $G$ gives the corresponding $C$-mass of the rectangle \begin{align*} (F_1(a_1),F_1(b_1)]\times\cdots\times(F_p(a_p),F_p(b_p)] \end{align*} inside $[0,1]^p$, which is nonnegative because $C$ is a copula. Thus $G$ has nonnegative rectangle probabilities. Finally, \begin{align*} \lim_{x_1,\dots,x_p\to\infty}G(x_1,\dots,x_p)=C(1,\dots,1)=1, \end{align*} and if $x_j\to-\infty$ in any coordinate, then $F_j(x_j)\to0$, so the groundedness of the copula gives \begin{align*} G(x_1,\dots,x_p)\to0. \end{align*} Therefore $G$ is a joint distribution function on $\mathbb{R}^p$. [/step] [step:Compute the margins of the constructed joint distribution function] Fix $j\in\{1,\dots,p\}$ and $x_j\in\mathbb{R}$. The $j$-th marginal distribution function of $G$ is obtained by sending every coordinate except the $j$-th one to $+\infty$. Since each $F_i$ satisfies $\lim_{t\to\infty}F_i(t)=1$, and since a copula has uniform one-dimensional margins, we get \begin{align*} \lim_{\substack{x_i\to\infty\\ i\ne j}} G(x_1,\dots,x_p) &= C(1,\dots,1,F_j(x_j),1,\dots,1)\\ &=F_j(x_j). \end{align*} Thus the $j$-th marginal distribution function of $G$ is $F_j$. Since $j$ was arbitrary, all prescribed margins are recovered, completing the proof. [/step]

Prerequisites (0/3 completed)

Prerequisites Graph

Interactive dependency map showing how this theorem builds on foundational concepts

Loading dependency graph...

Theorems

Representation Formula

Definitions & Concepts

Explore Further

Continuity Definition Distribution Definition Representation Formula Theorem #39 Covariance Matrix Is Positive Semidefinite probability Rank Correlations for the Bivariate Normal Distribution probability Covariance Stationarity Criterion for the GARCH(1,1) Process probability Tracy-Widom Limit for the Largest Eigenvalue of a Real Wishart Matrix probability Differencing Removes a Simple Unit Root probability Eckart–Young–Mirsky Theorem probability Independence of the Sample Mean and Sample Covariance for a Multivariate Normal Sample probability Kalman Filter Recursion Theorem probability

What brings you to Androma?

Start with a route through the knowledge graph.

Sklar's Theorem (Theorem # 4074)

Discussion

Proof

Prerequisites (0/3 completed)

Prerequisites Graph

Explore Further

Sign in to Androma

Check your inbox

One last step

Sklar's Theorem (Theorem # 4074)

Discussion

Proof

Prerequisites (0/3 completed)

Prerequisites Graph

Explore Further