[guided]Fix $x \in X$ and $h > 0$. We want to show that the average $x_h := \frac{1}{h}\int_0^h T(s)x\, d\mathcal{L}^1(s)$ is in the domain of $A$, and that we can compute $Ax_h$ explicitly. To do this, we must evaluate the limit
\begin{align*}
\lim_{r \to 0^+} \frac{T(r)x_h - x_h}{r}
\end{align*}
and show it exists in $X$. The strategy is to apply $T(r)$ inside the integral and use the semigroup property to convert $T(r)T(s) = T(r + s)$ into a translation of the integration variable; the resulting expression then becomes a difference of two integrals over short intervals, whose limit we recognise via the fundamental theorem of calculus for Bochner integrals.
Let $r \in (0, h)$. The bounded operator $T(r) \in \mathcal{L}(X)$ commutes with the Bochner integral, and the semigroup law gives
\begin{align*}
T(r) x_h &= \frac{1}{h} T(r) \int_0^h T(s) x \, d\mathcal{L}^1(s) = \frac{1}{h} \int_0^h T(r + s) x \, d\mathcal{L}^1(s).
\end{align*}
We change variables: let $u = r + s$, so $d\mathcal{L}^1(s) = d\mathcal{L}^1(u)$ and as $s$ ranges over $[0, h]$ the new variable $u$ ranges over $[r, r + h]$. This gives
\begin{align*}
T(r) x_h &= \frac{1}{h} \int_r^{r+h} T(u) x \, d\mathcal{L}^1(u).
\end{align*}
Now we subtract $x_h = \frac{1}{h}\int_0^h T(u) x\, d\mathcal{L}^1(u)$. The two integrals overlap on $[r, h]$ (here is where we use $r < h$, so the interval is non-empty), and the overlap cancels:
\begin{align*}
T(r) x_h - x_h &= \frac{1}{h}\left( \int_h^{r+h} T(u) x \, d\mathcal{L}^1(u) - \int_0^r T(u) x \, d\mathcal{L}^1(u) \right).
\end{align*}
Dividing by $r$ rearranges this into a sum of two short-interval averages:
\begin{align*}
\frac{T(r) x_h - x_h}{r} &= \frac{1}{h}\left( \frac{1}{r}\int_h^{r+h} T(u) x\, d\mathcal{L}^1(u) - \frac{1}{r}\int_0^r T(u) x\, d\mathcal{L}^1(u) \right).
\end{align*}
Each of these is the average of a continuous function $u \mapsto T(u)x$ over a small interval. Since $u \mapsto T(u)x$ is continuous on $[0, \infty)$ (the strong continuity hypothesis), the fundamental theorem of calculus for Bochner integrals yields
\begin{align*}
\lim_{r \to 0^+} \frac{1}{r}\int_h^{r+h} T(u)x\, d\mathcal{L}^1(u) &= T(h)x, & \lim_{r \to 0^+} \frac{1}{r}\int_0^r T(u)x\, d\mathcal{L}^1(u) &= T(0)x = x.
\end{align*}
Therefore the limit defining $A x_h$ exists, which by definition means $x_h \in D(A)$, and
\begin{align*}
A x_h &= \frac{1}{h}(T(h)x - x).
\end{align*}
The key idea is that averaging the orbit smooths it: even when $x \notin D(A)$, the average $x_h$ lies in $D(A)$ because the integral can absorb a derivative via the fundamental theorem of calculus.[/guided]