Attributions & Verification

Track contributions and verify content correctness

Proof

custom_env admin

[step:Prove entropy tensorization on a finite product probability space] [claim:Entropy tensorizes over finite products] Let $\nu=\nu_1\otimes\cdots\otimes\nu_n$ be a finite product of probability measures. For every non-negative integrable $g:E\to[0,\infty)$ with $g\log g\in L^1(\nu)$, one has \begin{align*} \operatorname{Ent}_\nu(g)\leq \sum_{i=1}^n \int_{E_{-i}} \operatorname{Ent}_{\nu_i}(g_{x_{-i}})\,d\nu_{-i}(x_{-i}). \end{align*} [/claim] [proof] It is enough to prove the two-factor inequality and then iterate. Let $(X,\mu)$ and $(Y,\rho)$ be probability spaces, let \begin{align*} h:X\times Y\to[0,\infty) \end{align*} be integrable, and assume $h\log h\in L^1(\mu\otimes\rho)$. Define \begin{align*} a:X\to[0,\infty) \end{align*} by \begin{align*} x\mapsto \int_Y h(x,y)\,d\rho(y). \end{align*} The integrability hypothesis $h\log h\in L^1(\mu\otimes\rho)$ and the lower bound for $s\mapsto s\log s$ ensure that the positive and negative parts occurring below are integrable. Hence the [Fubini theorem](/theorems/513) applies to the entropy decomposition. By the definition of entropy and the product measure, \begin{align*} \operatorname{Ent}_{\mu\otimes\rho}(h)=\int_X \operatorname{Ent}_\rho(h(x,\cdot))\,d\mu(x)+\operatorname{Ent}_\mu(a). \end{align*} It remains to bound the second term by the entropy in the $X$ variable. We use the convexity of the relative entropy integrand. For a probability measure $\mu$, define \begin{align*} \Psi:[0,\infty)\times(0,\infty)\to\mathbb{R} \end{align*} by \begin{align*} (s,r)\mapsto s\log(s/r)-s+r. \end{align*} For fixed $r>0$, take $0\log(0/r)=0$. The function $\Psi$ is jointly convex on $[0,\infty)\times(0,\infty)$. Therefore, for every non-negative $u:X\to[0,\infty)$ with $u\log u\in L^1(\mu)$, \begin{align*} \operatorname{Ent}_\mu(u)=\inf_{r>0}\int_X \Psi(u(x),r)\,d\mu(x). \end{align*} Indeed, the infimum in $r$ is attained at \begin{align*} r=\int_X u\,d\mu. \end{align*} Apply this formula to $a=\int_Y h(\cdot,y)\,d\rho(y)$. Convexity of $\Psi$ and the fact that $\rho$ is a probability measure give, for every integrable map $r:Y\to(0,\infty)$, \begin{align*} \int_X \Psi\left(\int_Y h(x,y)\,d\rho(y),\int_Y r(y)\,d\rho(y)\right)\,d\mu(x)\leq \int_Y\int_X \Psi(h(x,y),r(y))\,d\mu(x)\,d\rho(y). \end{align*} The iterated integral is justified by Tonelli for the non-negative function $\Psi(h(x,y),r(y))$. To pass from the right-hand side to pointwise entropies, choose for each $m\in\mathbb{N}$ a positive [simple function](/page/Simple%20Function) $r_m:Y\to(0,\infty)$ whose values approximate, within $1/m$ on each level set, the scalar minimizers in the variational formula for $\operatorname{Ent}_\mu(h(\cdot,y))$. Monotone approximation of measurable non-negative functions and the variational formula then give \begin{align*} \operatorname{Ent}_\mu(a)\leq \int_Y \operatorname{Ent}_\mu(h(\cdot,y))\,d\rho(y). \end{align*} Substituting this estimate into the entropy decomposition yields \begin{align*} \operatorname{Ent}_{\mu\otimes\rho}(h)\leq \int_X \operatorname{Ent}_\rho(h(x,\cdot))\,d\mu(x)+\int_Y \operatorname{Ent}_\mu(h(\cdot,y))\,d\rho(y). \end{align*} This is the two-factor tensorization inequality. For $n>2$, apply the two-factor inequality to the decomposition $E=(E_1\times\cdots\times E_{n-1})\times E_n$. The first term is then tensorized again over $E_1,\dots,E_{n-1}$. Induction over $n$ gives the asserted finite-product estimate. [/proof] [/step]

custom_env admin

[guided]The point of tensorization is that the entropy of $f^2$ under the product measure can be controlled by entropies in one coordinate at a time. We apply the tensorization estimate to the non-negative function $g:E\to[0,\infty)$ given by $x\mapsto f(x)^2$. Because $f\in C_c^\infty(E)$, the function $g$ is integrable and $g\log g$ is integrable with respect to $\nu$. The tensorization estimate gives \begin{align*} \operatorname{Ent}_\nu(f^2)\leq \sum_{i=1}^n \int_{E_{-i}}\operatorname{Ent}_{\nu_i}\bigl(f_{x_{-i}}^2\bigr)\,d\nu_{-i}(x_{-i}). \end{align*} Now fix an index $i\in\{1,\dots,n\}$ and fix the remaining coordinates $x_{-i}\in E_{-i}$. The function to which we apply the coordinate logarithmic Sobolev inequality is not $f$ on the whole product, but the one-variable slice \begin{align*} f_{x_{-i}}:E_i\to\mathbb{R}. \end{align*} It is given by \begin{align*} x_i\mapsto f(x_1,\dots,x_n). \end{align*} This slice is smooth in the $E_i$ variable. Its support is contained in the projection of $\operatorname{supp} f$ onto $E_i$, hence is compact. Therefore $f_{x_{-i}}\in C_c^\infty(E_i)$, exactly the test class allowed in the hypothesis. The assumed $\operatorname{LSI}(C_i)$ on $(E_i,\nu_i)$ applies to this slice and gives \begin{align*} \operatorname{Ent}_{\nu_i}\bigl(f_{x_{-i}}^2\bigr)\leq 2C_i\int_{E_i}|\nabla_i f_{x_{-i}}(x_i)|^2\,d\nu_i(x_i). \end{align*} By definition of the coordinate gradient, $\nabla_i f_{x_{-i}}(x_i)$ is exactly the gradient of $f$ in the $E_i$ variable at the point $x=(x_1,\dots,x_n)$. Thus \begin{align*} \int_{E_i}|\nabla_i f_{x_{-i}}(x_i)|^2\,d\nu_i(x_i)=\int_{E_i}|\nabla_i f(x_1,\dots,x_n)|^2\,d\nu_i(x_i). \end{align*} We now integrate this inequality over all frozen values $x_{-i}$ with respect to $\nu_{-i}$. Since $\nu=\nu_{-i}\otimes\nu_i$ after the coordinates are ordered with $E_i$ last, Tonelli's theorem applies to the non-negative gradient-square integrand and identifies the iterated integral with the full product integral: \begin{align*} \int_{E_{-i}}\operatorname{Ent}_{\nu_i}\bigl(f_{x_{-i}}^2\bigr)\,d\nu_{-i}(x_{-i})\leq 2C_i\int_E|\nabla_i f(x)|^2\,d\nu(x). \end{align*} Substituting this estimate into the tensorized entropy bound and summing over $i$ gives \begin{align*} \operatorname{Ent}_\nu(f^2)\leq 2\sum_{i=1}^n C_i\int_E|\nabla_i f(x)|^2\,d\nu(x). \end{align*} This is the refined tensorized logarithmic Sobolev inequality, with the individual coordinate constants still visible.[/guided]

custom_env admin

Verification Progress

7 Total Blocks

0 Verified

0% verified

Contributors

admin 7 blocks (0 verified)

Who Can Verify

Areas: Probability & Statistics

Viktor Miykov Admin

Max Vassiliev Global Reviewer

Horia Neagu Global Reviewer

강현욱 Global Reviewer

Demo Testing Global Reviewer

Archie Pennycook Global Reviewer

Quick Actions

Edit Theorem

What brings you to Androma?

Start with a route through the knowledge graph.

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Sign in to Androma

Check your inbox

One last step

Attributions & Verification

Proof

Verification Progress

Contributors

Who Can Verify

Quick Actions

Raw Attribution Data