[step:Bound each martingale increment by the corresponding bounded difference constant]For $i\in\{1,\dots,n\}$, define the martingale difference $D_i:\Omega\to\mathbb R$ by
\begin{align*}
D_i(\omega):=M_i(\omega)-M_{i-1}(\omega).
\end{align*}
We claim that $D_i$ is almost surely contained in an interval of length at most $c_i$ conditional on $\mathcal F_{i-1}$.
Fix $i$. Let $\mu_j$ denote the law of $X_j$ on $(\mathcal X_j,\mathcal A_j)$. Let $\nu_i$ denote the product probability measure $\mu_{i+1}\otimes\cdots\otimes\mu_n$ on $\mathcal X_{i+1}\times\cdots\times\mathcal X_n$, with the convention that for $i=n$ this is the one-point probability measure on the empty tail. Independence means that the law of $(X_1,\dots,X_n)$ on the product measurable space is $\mu_1\otimes\cdots\otimes\mu_n$. We will use this product-law identity directly to verify conditional expectations, rather than invoking a regular conditional distribution of the unrevealed coordinates.
Define a measurable version $G_i:\mathcal X_1\times\cdots\times\mathcal X_i\to\mathbb R$ of the conditional expectation section as follows. Since $Z\in L^1(\Omega,\mathcal F,\mathbb P)$, the Tonelli theorem applied to $|f|$ under the product law is valid because $|f|$ is nonnegative and measurable. Hence for $(\mu_1\otimes\cdots\otimes\mu_i)$-almost every prefix $(x_1,\dots,x_i)$ the integral of $|f(x_1,\dots,x_i,\cdot)|$ with respect to $\nu_i$ is finite. On this full-measure set define
\begin{align*}
G_i(x_1,\dots,x_i):=\int_{\mathcal X_{i+1}\times\cdots\times\mathcal X_n} f(x_1,\dots,x_i,y_{i+1},\dots,y_n)\, d\nu_i(y_{i+1},\dots,y_n),
\end{align*}
and define $G_i$ to be $0$ on the exceptional null set. For $i=n$, this formula reads $G_n=f$ outside a null set. The function $G_i$ is $\mathcal A_1\otimes\cdots\otimes\mathcal A_i$-measurable by the measurable-parameter form of Tonelli theorem applied to the positive and negative parts of the integrable function $f$.
We verify the conditional-expectation identity by the defining property. Let $H:\Omega\to\mathbb R$ be a bounded $\mathcal F_i$-measurable random variable. By the Doob-Dynkin factorization lemma for $\sigma(X_1,\dots,X_i)$-measurable real random variables, there is a bounded $\mathcal A_1\otimes\cdots\otimes\mathcal A_i$-measurable function $h:\mathcal X_1\times\cdots\times\mathcal X_i\to\mathbb R$ such that
\begin{align*}
H=h(X_1,\dots,X_i)
\end{align*}
almost surely. Since $h$ is bounded and $f$ is integrable under the product law, [Fubini's theorem](/theorems/2961) applies to the integrable function
\begin{align*}
(x_1,\dots,x_n)\mapsto h(x_1,\dots,x_i)f(x_1,\dots,x_n).
\end{align*}
Using the product law of $(X_1,\dots,X_n)$, we have
\begin{align*}
\mathbb E[HZ]
=
\int_{\mathcal X_1\times\cdots\times\mathcal X_n}
h(x_1,\dots,x_i)f(x_1,\dots,x_n)\,d(\mu_1\otimes\cdots\otimes\mu_n)(x_1,\dots,x_n).
\end{align*}
[Fubini's theorem](/theorems/2961) identifies the last integral as
\begin{align*}
\int_{\mathcal X_1\times\cdots\times\mathcal X_i}
h(x_1,\dots,x_i)G_i(x_1,\dots,x_i)\,d(\mu_1\otimes\cdots\otimes\mu_i)(x_1,\dots,x_i).
\end{align*}
By the product law of $(X_1,\dots,X_i)$, this equals
\begin{align*}
\mathbb E[H G_i(X_1,\dots,X_i)].
\end{align*}
Thus $G_i(X_1,\dots,X_i)$ is integrable, $\mathcal F_i$-measurable, and satisfies the defining identity for $\mathbb E[Z\mid\mathcal F_i]$. Hence
\begin{align*}
M_i=G_i(X_1,\dots,X_i)
\end{align*}
almost surely.
By [Fubini's theorem](/theorems/2961) applied to the integrable function $|f|$ under the product law, there is a set $B_i\subset\mathcal X_1\times\cdots\times\mathcal X_{i-1}$ with $(\mu_1\otimes\cdots\otimes\mu_{i-1})(B_i)=1$ such that, for every prefix $x=(x_1,\dots,x_{i-1})\in B_i$, the section $u\mapsto G_i(x_1,\dots,x_{i-1},u)$ is finite for $\mu_i$-almost every $u\in\mathcal X_i$. For such a prefix $x$, define the finite-a.e. one-coordinate section $g_{i,x}:\mathcal X_i\to\mathbb R$ by $g_{i,x}(u):=G_i(x_1,\dots,x_{i-1},u)$ on its full-measure domain.
If $u,v$ belong to this full-measure domain, then the bounded differences hypothesis gives, for every tail point outside a $\nu_i$-null set depending on $u$ and $v$,
\begin{align*}
\left|
f(x_1,\dots,x_{i-1},u,y_{i+1},\dots,y_n)
-
f(x_1,\dots,x_{i-1},v,y_{i+1},\dots,y_n)
\right|
\le c_i.
\end{align*}
Integrating this pointwise inequality with respect to the probability measure $\nu_i$ gives
\begin{align*}
|g_{i,x}(u)-g_{i,x}(v)|
\le c_i.
\end{align*}
Thus the essential range of $g_{i,x}(X_i)$ under $\mu_i$ is contained in an interval of length at most $c_i$. Conditionally on $\mathcal F_{i-1}$, $M_i$ has this essential range and
\begin{align*}
M_{i-1}=\mathbb E[M_i\mid\mathcal F_{i-1}].
\end{align*}
Subtracting $M_{i-1}$ translates the interval, so conditionally on $\mathcal F_{i-1}$, $D_i=M_i-M_{i-1}$ has mean $0$ and is supported, in the essential-range sense, in an interval of length at most $c_i$.[/step]