[step:Match the two mixed derivatives and obtain the identity]Define
\begin{align*}
F:\mathbb R^2\to\mathbb R
\end{align*}
by
\begin{align*}
F(t,s)=f(\exp(tX)\exp(sY)\exp(-tX)).
\end{align*}
For each fixed $t$, the $s$-derivative at $s=0$ is exactly
\begin{align*}
\left.\frac{d}{ds}\right|_{s=0}F(t,s)
=
(\operatorname{Ad}_{\exp(tX)}Y)(f),
\end{align*}
so
\begin{align*}
\operatorname{ad}_X(Y)(f)
=
\left.\frac{d}{dt}\right|_{t=0}
\left.\frac{d}{ds}\right|_{s=0}F(t,s).
\end{align*}
To compare this with the bracket, fix $s$ near $0$ and apply the product rule to the smooth map
\begin{align*}
t\mapsto \exp(tX)\exp(sY)\exp(-tX)
\end{align*}
at $t=0$. More explicitly, write the multiplication map as $m:G\times G\to G$, $m(g,h)=gh$, and define $p_s:G\times G\to G$ by $p_s(g,h)=g\exp(sY)h$. Then the curve above is $p_s(\exp(tX),\exp(-tX))$. Since $\frac{d}{dt}|_{t=0}\exp(tX)=X\in T_eG$ and $\frac{d}{dt}|_{t=0}\exp(-tX)=-X\in T_eG$, the chain rule gives its velocity at $t=0$ as
\begin{align*}
d(p_s)_{(e,e)}(X,-X).
\end{align*}
By linearity of the differential in the product tangent space $T_{(e,e)}(G\times G)\cong T_eG\oplus T_eG$, this is the sum of the contribution from $(X,0)$ and the contribution from $(0,-X)$. After applying $df_{\exp(sY)}$, the first contribution is represented by $t\mapsto f(\exp(tX)\exp(sY))$, and the second contribution is the negative of the contribution represented by $t\mapsto f(\exp(sY)\exp(tX))$. Therefore
\begin{align*}
\left.\frac{d}{dt}\right|_{t=0}F(t,s)
=
\left.\frac{d}{dt}\right|_{t=0}f(\exp(tX)\exp(sY))
-
\left.\frac{d}{dt}\right|_{t=0}f(\exp(sY)\exp(tX)).
\end{align*}
The functions of $(t,s)$ appearing here are smooth near $(0,0)$, because multiplication, inversion, the exponential map, and $f$ are smooth. Hence equality of mixed partial derivatives gives
\begin{align*}
\left.\frac{d}{ds}\right|_{s=0}\left.\frac{d}{dt}\right|_{t=0}f(\exp(tX)\exp(sY))
=
\left.\frac{d}{dt}\right|_{t=0}\left.\frac{d}{ds}\right|_{s=0}f(\exp(tX)\exp(sY)).
\end{align*}
Differentiating the displayed identity for $\partial_tF(0,s)$ with respect to $s$ at $s=0$ and using this equality of mixed partial derivatives gives
\begin{align*}
\left.\frac{d}{dt}\right|_{t=0}
\left.\frac{d}{ds}\right|_{s=0}
f(\exp(tX)\exp(sY)\exp(-tX))
=
[X^L,Y^L]_e(f).
\end{align*}
Combining this with the computation of the infinitesimal adjoint action yields
\begin{align*}
\operatorname{ad}_X(Y)(f)=[X^L,Y^L]_e(f).
\end{align*}
Since this holds for every smooth germ $f$ at $e$, the tangent vectors are equal:
\begin{align*}
\operatorname{ad}_X(Y)=[X^L,Y^L]_e=[X,Y].
\end{align*}
This proves the theorem.[/step]