[step:Derive the right derivative from the mild formula]
Fix $t\in[0,T)$ and let $h>0$ with $t+h\leq T$. The semigroup property gives
\begin{align*}
S(t+h)u_0=S(h)S(t)u_0.
\end{align*}
For the integral term, split the interval $[0,t+h]$ into $[0,t]$ and $[t,t+h]$. On $[0,t]$, use $S(t+h-s)=S(h)S(t-s)$. Thus
\begin{align*}
u(t+h)=S(h)u(t)+\int_t^{t+h}S(t+h-s)f(s)\,d\mathcal{L}^1(s).
\end{align*}
Therefore
\begin{align*}
\frac{u(t+h)-u(t)}{h}
=
\frac{S(h)u(t)-u(t)}{h}
+
\frac{1}{h}\int_t^{t+h}S(t+h-s)f(s)\,d\mathcal{L}^1(s).
\end{align*}
Since $u(t)\in D(A)$, the first term converges to $Au(t)$ in $X$ as $h\downarrow 0$.
For the second term, define the map $F_h:[0,h]\to X$ by $F_h(r)=S(r)f(t+h-r)$. After the change of variables $r=t+h-s$, with $d\mathcal{L}^1(r)=d\mathcal{L}^1(s)$ and $s\in[t,t+h]$ corresponding to $r\in[h,0]$, reversing the interval gives
\begin{align*}
\frac{1}{h}\int_t^{t+h}S(t+h-s)f(s)\,d\mathcal{L}^1(s)
=
\frac{1}{h}\int_0^h S(r)f(t+h-r)\,d\mathcal{L}^1(r).
\end{align*}
Let $I_X:X\to X$ denote the identity map. The integrand converges uniformly to $f(t)$ for $0\leq r\leq h$ as $h\downarrow 0$. Indeed,
\begin{align*}
\|S(r)f(t+h-r)-f(t)\|_X\leq M_T\|f(t+h-r)-f(t)\|_X+\|S(r)f(t)-I_Xf(t)\|_X.
\end{align*}
The first term tends to $0$ uniformly for $0\leq r\leq h$ by continuity of $f$ at $t$, and the second term tends to $0$ uniformly for $0\leq r\leq h$ by strong continuity of $S$ at $0$ applied to the fixed vector $f(t)\in X$. Hence
\begin{align*}
\frac{1}{h}\int_0^h S(r)f(t+h-r)\,d\mathcal{L}^1(r)\to f(t)
\end{align*}
in $X$. Since $u\in C^1([0,T];X)$, the right derivative equals $u'(t)$, so
\begin{align*}
u'(t)=Au(t)+f(t)
\end{align*}
for every $t\in[0,T)$.
[/step]