[step:Apply the local positive commutator estimate to the normalized order-zero operator]
We apply the local positive commutator estimate for scalar order-zero real-principal-type pseudodifferential operators to $L:=QP\in\Psi^0(X)$, microlocally on $U$. The estimate says: if an order-zero scalar operator has real principal symbol $\ell$, if $d\ell\ne 0$ on its characteristic set in a flow box, and if symbols $a,b,g$ of orders $s+1/2,s,s$ satisfy $H_\ell(a^2)=b^2-g^2$ on the characteristic set with $b$ elliptic at the target and $g$ supported in the incoming region, then properly supported quantizations give an estimate
\begin{align*}
\|Bu\|_{H^s} \le C\bigl(\|G u\|_{H^s}+\|E Lu\|_{H^{s+1}}+\|F u\|_{H^{s-\delta}}\bigr)
\end{align*}
for some $\delta>0$. Here $B,G,E,F\in\Psi^0(X)$ are properly supported microlocal cutoffs, $B$ is elliptic on the elliptic set of $b$, $G$ is microsupported where $g$ is supported, $E$ and $F$ are microsupported in $V$, and $C>0$ depends on finitely many symbol seminorms of $L$ and on the chosen cutoffs.
The hypotheses of this estimate hold for $L$. Its principal symbol on $U$ is the real-valued function $p_0$, and $d p_0\ne 0$ on $\Sigma$, so $L$ is of real principal type in the flow box. The previous step constructed symbols $a,b,g$ with
\begin{align*}
H_{p_0}(a^2)=b^2-g^2
\end{align*}
on $\Sigma$, with $b$ elliptic at the target and $g$ supported in the incoming region. The principal commutator calculation is therefore applied to $L$, not to $P$: since $L$ has order $0$ and $A\in\Psi^{s+1/2}(X)$, the commutator has order $2s$, and its principal symbol is governed by $H_{p_0}(a^2)$, up to the sign convention used in the estimate.
The hypotheses give $\|E Lu\|_{H^{s+1}}<\infty$, because $Lu=QPu$ and $Q\in\Psi^{-m}(X)$ maps microlocal $H^{s-m+1}$ regularity to microlocal $H^{s+1}$ regularity. The incoming assumption gives $\|G u\|_{H^s}<\infty$. Therefore the estimate implies $\|Bu\|_{H^s}<\infty$ once the lower-order term $\|F u\|_{H^{s-\delta}}$ is known to be finite.
The external result being used here is precisely the local positive commutator estimate for scalar order-zero real-principal-type pseudodifferential operators, in the form stated in the first paragraph of this step.
[/step]