[step:Prove the Cauchy-Schwarz estimate for one distinguished function]Let $d\ge 1$ be an integer. Let $c_1,\dots,c_d\in G$ be elements such that multiplication by each $c_i$ is a bijection of $G$. Let
\begin{align*}
F:G&\to\mathbb C,\\
x&\mapsto F(x)
\end{align*}
be any function, and for each $i\in\{1,\dots,d\}$ let
\begin{align*}
g_i:G&\to\mathbb C,\\
x&\mapsto g_i(x)
\end{align*}
be a function satisfying $|g_i(x)|\le 1$ for every $x\in G$. We claim that
\begin{align*}
\left|
\mathbb E_{x,r\in G}
F(x)\prod_{i=1}^d g_i(x+c_i r)
\right|
\le
\|F\|_{U^d(G)}.
\end{align*}
We prove the claim by induction on $d$. For $d=1$, since multiplication by $c_1$ is bijective, the map
\begin{align*}
G\times G&\to G\times G,\\
(x,r)&\mapsto (x,x+c_1r)
\end{align*}
is a bijection. Hence
\begin{align*}
\mathbb E_{x,r\in G}F(x)g_1(x+c_1r)
=
\left(\mathbb E_{x\in G}F(x)\right)
\left(\mathbb E_{y\in G}g_1(y)\right).
\end{align*}
Because $|g_1|\le 1$, the second factor has absolute value at most $1$, and therefore
\begin{align*}
\left|
\mathbb E_{x,r\in G}F(x)g_1(x+c_1r)
\right|
\le
\left|\mathbb E_{x\in G}F(x)\right|
=
\|F\|_{U^1(G)}.
\end{align*}
Assume the estimate holds for $d-1$. Define
\begin{align*}
A
:=
\mathbb E_{x,r\in G}
F(x)\prod_{i=1}^d g_i(x+c_i r).
\end{align*}
Make the bijective change of variables $u=x+c_d r$, so that $x=u-c_d r$. Then
\begin{align*}
A
=
\mathbb E_{u,r\in G}
g_d(u)F(u-c_d r)
\prod_{i=1}^{d-1}g_i(u+(c_i-c_d)r).
\end{align*}
Applying Cauchy-Schwarz in the $u$-variable and using $|g_d(u)|\le 1$ gives
\begin{align*}
|A|^2
\le
\mathbb E_{u\in G}
\left|
\mathbb E_{r\in G}
F(u-c_d r)
\prod_{i=1}^{d-1}g_i(u+(c_i-c_d)r)
\right|^2.
\end{align*}
Expanding the square and setting $h=s-r$, $y=u-c_d r$, we obtain
\begin{align*}
|A|^2
\le
\mathbb E_{h\in G}
\left|
\mathbb E_{y,r\in G}
F(y)\overline{F(y-c_dh)}
\prod_{i=1}^{d-1}
g_i(y+c_i r)
\overline{g_i(y+c_i r+(c_i-c_d)h)}
\right|.
\end{align*}
For each fixed $h\in G$, define
\begin{align*}
F_h:G&\to\mathbb C,\\
y&\mapsto F(y)\overline{F(y-c_dh)},
\end{align*}
and, for $i\in\{1,\dots,d-1\}$, define
\begin{align*}
G_{i,h}:G&\to\mathbb C,\\
z&\mapsto g_i(z)\overline{g_i(z+(c_i-c_d)h)}.
\end{align*}
Since $|g_i|\le 1$, each $G_{i,h}$ is bounded in absolute value by $1$. The induction hypothesis applied to $F_h$ and the coefficients $c_1,\dots,c_{d-1}$ gives
\begin{align*}
|A|^2
\le
\mathbb E_{h\in G}\|F_h\|_{U^{d-1}(G)}.
\end{align*}
By [Jensen's inequality](/theorems/1977) for the convex function $t\mapsto t^{2^{d-1}}$ on $[0,\infty)$,
\begin{align*}
|A|^{2^d}
\le
\mathbb E_{h\in G}\|F_h\|_{U^{d-1}(G)}^{2^{d-1}}.
\end{align*}
Expanding the $U^{d-1}(G)$ norm gives
\begin{align*}
|A|^{2^d}
\le
\mathbb E_{h,y,h_1,\dots,h_{d-1}\in G}
\prod_{\omega\in\{0,1\}^{d-1}}
\mathcal C^{|\omega|}F(y+\omega_1h_1+\cdots+\omega_{d-1}h_{d-1})
\\
\cdot
\prod_{\omega\in\{0,1\}^{d-1}}
\mathcal C^{|\omega|+1}F(y+\omega_1h_1+\cdots+\omega_{d-1}h_{d-1}-c_dh).
\end{align*}
Since multiplication by $c_d$ is a bijection of $G$, the substitution $h_d=-c_dh$ preserves normalized counting measure on $G$. The last average is exactly
\begin{align*}
\mathbb E_{y,h_1,\dots,h_d\in G}
\prod_{\omega\in\{0,1\}^{d}}
\mathcal C^{|\omega|}
F(y+\omega_1h_1+\cdots+\omega_dh_d)
=
\|F\|_{U^d(G)}^{2^d}.
\end{align*}
Taking $2^d$-th roots proves the claim.[/step]