[guided]We first verify the domain hypothesis. The theorem assumes that the comparison curve $\psi: [0, 1] \to T_pM$ takes values in the maximal domain $\mathcal{D}_p \subseteq T_pM$ on which $\exp_p$ is defined, so that $\sigma := \exp_p \circ \psi$ is well-defined as a curve in $M$. Without this assumption the inequality "$\operatorname{length}(\exp_p \circ \psi) \ge |a|$" is vacuous. Note that $\mathcal{D}_p$ is open in $T_pM$ (by openness of the maximal flow domain of the geodesic spray), and since $\exp_p(a)$ is defined by hypothesis, the line segment $\{ta : t \in [0, 1]\}$ lies in $\mathcal{D}_p$, so the radial comparison curve $\varphi(t) = ta$ is automatically in $\mathcal{D}_p$. We work under this hypothesis throughout.
Now we analyse the velocity of $\sigma$ on the open subinterval $I_k$ (where $\psi$ avoids $0$ and the polar decomposition $\psi = \rho\, \mathbf{v}$ is defined). The chain rule applied to the composition $\sigma = \exp_p \circ \psi$ yields, at each point of differentiability,
\begin{align*}
\dot\sigma(t) = (d\exp_p)_{\psi(t)}(\dot\psi(t)) = (d\exp_p)_{\psi(t)}\bigl(\dot\rho(t)\, \mathbf{v}(t)\bigr) + (d\exp_p)_{\psi(t)}\bigl(\rho(t)\, \dot{\mathbf{v}}(t)\bigr).
\end{align*}
At points where $\psi$ is only piecewise $C^1$, this identity holds on each smooth piece; since the length integral is finitely additive over partitions, the finitely many non-differentiability points pose no issue. The differential $(d\exp_p)_{\psi(t)}: T_{\psi(t)}(T_pM) \cong T_pM \to T_{\sigma(t)}M$ is $\mathbb{R}$-linear, so we may pull the scalars $\dot\rho(t)$ and $\rho(t)$ out:
\begin{align*}
\dot\sigma(t) = \dot\rho(t)\, (d\exp_p)_{\psi(t)}(\mathbf{v}(t)) + \rho(t)\, (d\exp_p)_{\psi(t)}(\dot{\mathbf{v}}(t)). \tag{1}
\end{align*}
Write $A(t) := (d\exp_p)_{\psi(t)}(\mathbf{v}(t))$ for the **radial image** and $B(t) := (d\exp_p)_{\psi(t)}(\dot{\mathbf{v}}(t))$ for the **angular image**. The decomposition (1) reads $\dot\sigma = \dot\rho\, A + \rho\, B$: the velocity of $\sigma$ splits into a radial component (with coefficient $\dot\rho$, the radial speed of $\psi$) and an angular component (with coefficient $\rho$).
To extract a useful bound, we want the inner products $g_{\sigma(t)}(A, A)$, $g_{\sigma(t)}(A, B)$, and $g_{\sigma(t)}(B, B)$. The key input is the [Gauss Lemma — Covariant Form](/theorems/2714), which states: for any $a' \in T_pM$ in the domain of $\exp_p$ and any $u, w \in T_pM$,
\begin{align*}
g_{\exp_p(a')}\bigl((d\exp_p)_{a'}(a'),\, (d\exp_p)_{a'}(w)\bigr) = g_p(a', w).
\end{align*}
Why does this lemma hold? It expresses the fact that $\exp_p$ preserves radial geodesics: the curve $t \mapsto \exp_p(ta')$ is a geodesic of constant speed $|a'|_{g_p}$, and its velocity $(d\exp_p)_{ta'}(a')$ has constant length $|a'|_{g_p}$. The orthogonality of angular directions follows from the variational characterisation of geodesics. We will apply this with $a' = \psi(t) = \rho(t)\, \mathbf{v}(t)$, which lies in $\mathcal{D}_p$ by the domain hypothesis.
**Computing $g_{\sigma(t)}(A, B) = 0$.** Apply the covariant Gauss lemma with $a' = \psi(t)$ and $w = \dot{\mathbf{v}}(t)$:
\begin{align*}
g_{\sigma(t)}\bigl((d\exp_p)_{\psi(t)}(\psi(t)),\, (d\exp_p)_{\psi(t)}(\dot{\mathbf{v}}(t))\bigr) = g_p(\psi(t), \dot{\mathbf{v}}(t)) = \rho(t)\, g_p(\mathbf{v}(t), \dot{\mathbf{v}}(t)) = 0,
\end{align*}
where the last equality uses that $g_p(\mathbf{v}(t), \dot{\mathbf{v}}(t)) = 0$: this follows by differentiating the identity $g_p(\mathbf{v}, \mathbf{v}) = 1$ (which holds because $\mathbf{v}$ is unit by construction). The left-hand side equals $\rho(t)\, g_{\sigma(t)}(A(t), B(t))$ by linearity of $(d\exp_p)$ in the first slot and bilinearity of $g$. Since $\rho(t) > 0$ on $I_k$, we may divide:
\begin{align*}
g_{\sigma(t)}(A(t), B(t)) = 0. \tag{2}
\end{align*}
**Computing $|A(t)|_{g_{\sigma(t)}} = 1$.** Apply the covariant Gauss lemma with $a' = \psi(t)$ and $w = \mathbf{v}(t) = \psi(t)/\rho(t)$:
\begin{align*}
g_{\sigma(t)}\bigl((d\exp_p)_{\psi(t)}(\psi(t)),\, (d\exp_p)_{\psi(t)}(\mathbf{v}(t))\bigr) = g_p(\psi(t), \mathbf{v}(t)) = \rho(t).
\end{align*}
The left-hand side equals $\rho(t)\, g_{\sigma(t)}(A(t), A(t))$ by linearity, so $\rho(t)\, g_{\sigma(t)}(A(t), A(t)) = \rho(t)$, and dividing by $\rho(t) > 0$:
\begin{align*}
g_{\sigma(t)}(A(t), A(t)) = 1, \quad \text{i.e., } |A(t)|_{g_{\sigma(t)}} = 1. \tag{3}
\end{align*}
**The Pythagorean decomposition.** Combining (2) and (3) and expanding the squared norm of (1) using bilinearity of $g_{\sigma(t)}$:
\begin{align*}
|\dot\sigma(t)|_{g_{\sigma(t)}}^2 &= |\dot\rho(t)\, A(t) + \rho(t)\, B(t)|_{g_{\sigma(t)}}^2 \\
&= \dot\rho(t)^2 \, g_{\sigma(t)}(A, A) + 2\dot\rho(t)\rho(t)\, g_{\sigma(t)}(A, B) + \rho(t)^2\, g_{\sigma(t)}(B, B) \\
&= \dot\rho(t)^2 \cdot 1 + 0 + \rho(t)^2 |B(t)|_{g_{\sigma(t)}}^2 \\
&= \dot\rho(t)^2 + \rho(t)^2 |B(t)|_{g_{\sigma(t)}}^2. \tag{4}
\end{align*}
The geometric meaning: on $T_pM$ we are using polar coordinates $(\rho, \mathbf{v})$ adapted to the origin — the radial direction is the line through $\psi(t)$ from $0$, and the angular direction is tangent to the sphere of radius $\rho(t)$. The Gauss lemma says that $\exp_p$ maps these two orthogonal directions on $T_pM$ to two orthogonal directions on $M$, and preserves the metric in the radial direction. Hence the radial speed of $\sigma$ on $M$ exactly equals the radial speed $|\dot\rho|$ of $\psi$ in $T_pM$. The angular term $\rho(t)^2 |B(t)|^2$ is non-negative; it vanishes precisely when $\dot{\mathbf{v}}(t) = 0$ (the unit direction $\mathbf{v}$ is momentarily constant) and so measures how much $\psi$ deviates from a purely radial path.[/guided]