[guided]The goal is to convert the exponential-moment hypothesis into a tail bound for the average. The standard method is Chernoff's bound: exponentiate the event, use Markov's inequality, and then optimize the exponential parameter.
Define the centered sum $S_n: \Omega \to \mathbb R$ by
\begin{align*}
S_n(\omega) := \sum_{i=1}^n (X_i(\omega)-\mu).
\end{align*}
The event $\{\bar X_n-\mu \ge r\}$ is exactly the event $\{S_n \ge nr\}$. Since $\lambda>0$, the exponential map is increasing, so this event is also $\{e^{\lambda S_n} \ge e^{\lambda nr}\}$. For a parameter $\lambda \in (0,1/b)$, the map $\omega \mapsto e^{\lambda S_n(\omega)}$ is a non-negative random variable, so the [Markov inequality](/theorems/514) applies. We obtain
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r) = \mathbb P(e^{\lambda S_n} \ge e^{\lambda nr}).
\end{align*}
Then Markov's inequality gives
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r) \le e^{-\lambda nr}\mathbb E[e^{\lambda S_n}].
\end{align*}
Now we estimate the expectation. Since $X_1,\dots,X_n$ are independent, the centered variables $X_1-\mu,\dots,X_n-\mu$ are also independent. Therefore the exponential of the sum factors. First,
\begin{align*}
\mathbb E[e^{\lambda S_n}] = \mathbb E\left[\prod_{i=1}^n e^{\lambda(X_i-\mu)}\right].
\end{align*}
By independence of the centered variables, the expectation of the product is the product of the expectations:
\begin{align*}
\mathbb E[e^{\lambda S_n}] = \prod_{i=1}^n \mathbb E[e^{\lambda(X_i-\mu)}].
\end{align*}
The hypothesis applies to this positive $\lambda$ because $\lambda \in (0,1/b)$. Hence
\begin{align*}
\mathbb E[e^{\lambda S_n}] \le \prod_{i=1}^n \exp\left(\frac{\nu^2\lambda^2}{2}\right) = \exp\left(\frac{n\nu^2\lambda^2}{2}\right).
\end{align*}
Substituting this into the Markov estimate gives
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r)
\le \exp\left(-n\lambda r+\frac{n\nu^2\lambda^2}{2}\right).
\end{align*}
We now choose $\lambda$ to make the exponent as negative as possible while respecting $\lambda<1/b$. If $0<r<\nu^2/b$, the unconstrained minimizer $\lambda=r/\nu^2$ lies in $(0,1/b)$. Substituting this value of $\lambda$ gives
\begin{align*}
-n\lambda r+\frac{n\nu^2\lambda^2}{2} = -\frac{nr^2}{2\nu^2}.
\end{align*}
Thus
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r)
\le \exp\left(-\frac{nr^2}{2\nu^2}\right).
\end{align*}
If $r \ge \nu^2/b$, the minimizer $r/\nu^2$ is not necessarily allowed, so we push $\lambda$ up to the boundary of the admissible interval. The hypothesis is stated for $\lambda<1/b$, so we use the bound for $\lambda \in (0,1/b)$ and then take the limit $\lambda \uparrow 1/b$. This gives
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r)
&\le \exp\left(-\frac{nr}{b}+\frac{n\nu^2}{2b^2}\right).
\end{align*}
Because $r \ge \nu^2/b$, we have $\nu^2/b^2 \le r/b$, and hence
\begin{align*}
-\frac{nr}{b}+\frac{n\nu^2}{2b^2} \le -\frac{nr}{2b}.
\end{align*}
Therefore
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r)
\le \exp\left(-\frac{nr}{2b}\right).
\end{align*}
Combining the small-deviation and large-deviation cases gives the single bound
\begin{align*}
\mathbb P(\bar X_n-\mu \ge r)
\le \exp\left(-\frac{n}{2}\min\left\{\frac{r^2}{\nu^2},\frac{r}{b}\right\}\right).
\end{align*}[/guided]