[guided]The central point is that the error equation has no linear dependence on $x$ at $e=0$. We first isolate the three vector fields involved. The nominal closed-loop vector field is $\phi: X_0\to \mathbb{R}^n$, defined by $\phi(x)=f(x,k(x))$. The actual state component in error coordinates is $\psi: X_0\times E_0\to \mathbb{R}^n$, defined by $\psi(x,e)=f(x,k(x+e))$. The error component is $g: X_0\times E_0\to \mathbb{R}^n$, defined by
\begin{align*}
g(x,e)=q(x+e,h(x),k(x+e))-f(x,k(x+e)).
\end{align*}
The exactness assumption says precisely that, when the observer state equals the plant state, the observer reproduces the plant dynamics. In error coordinates, equality of observer and plant state means $e=0$. Therefore, for every $x\in X_0\subset X_{\mathrm{ex}}$,
\begin{align*}
g(x,0)=q(x,h(x),k(x))-f(x,k(x))=0.
\end{align*}
This identity holds on an open neighbourhood of $0$, not merely at the point $0$. Since $g$ is $C^1$, the derivative of the map $x\mapsto g(x,0)$ at $0$ exists, and because that map is identically zero, its derivative is the zero linear map:
\begin{align*}
D_x g_{(0,0)}=0.
\end{align*}
Define $A=D_x\psi_{(0,0)}$, $B=D_e\psi_{(0,0)}$, and $C=D_e g_{(0,0)}$ as linear maps from $\mathbb{R}^n$ to $\mathbb{R}^n$. Thus the Jacobian matrix of the full vector field $(\psi,g)$ at $(0,0)$ acts on $(\xi,\eta)\in\mathbb{R}^n\times\mathbb{R}^n$ by
\begin{align*}
J(\xi,\eta)=(A\xi+B\eta,C\eta).
\end{align*}
This is exactly the block upper triangular separation structure: the linearized error equation does not receive a first-order forcing term from $x$.
Finally, when $e=0$, the state component is exactly the nominal feedback vector field:
\begin{align*}
\psi(x,0)=f(x,k(x))=\phi(x).
\end{align*}
Differentiating this identity with respect to $x$ at $0$ gives
\begin{align*}
D_x\psi_{(0,0)}=D\phi_0.
\end{align*}[/guided]