[step:Remove the auxiliary spatial cutoff by nonstationary phase]It remains to control $A_h((1-q\chi)u_h)$. The factor $\rho(x)$ is contained in the symbol $a(x,\xi)=\rho(x)\psi(\xi)$, so the Schwartz kernel only involves $x\in\operatorname{supp}\rho$. Since $q\chi=1$ on a neighbourhood of $\operatorname{supp}\rho$, choose $\theta:U\to[0,1]$ with $\theta\in C_c^\infty(U)$, $\theta=1$ on a neighbourhood of $\operatorname{supp}\rho$, and $q\chi=1$ on a neighbourhood of $\operatorname{supp}\theta$. Then
\begin{align*}
A_h((1-q\chi)u_h)=A_h((1-q\chi)\theta u_h)+A_h((1-q\chi)(1-\theta)u_h).
\end{align*}
The first term is identically zero because $(1-q\chi)\theta=0$.
For the second term, the spatial supports of $\rho$ and $(1-q\chi)(1-\theta)$ are separated. Thus there exists $\delta>0$ such that
\begin{align*}
|x-y|\geq \delta
\end{align*}
whenever $x\in\operatorname{supp}\rho$ and $y\in\operatorname{supp}(1-\theta)$.
The Schwartz kernel of $A_h(1-q\chi)(1-\theta)$ is
\begin{align*}
K_h(x,y)=(2\pi h)^{-n}\int_{\mathbb{R}^n}e^{i(x-y)\cdot\xi/h}a(x,\xi)(1-q(y)\chi(y))(1-\theta(y))\,d\mathcal{L}^n(\xi).
\end{align*}
Because the quantization in the theorem statement is properly supported, for every compact output set $K_x\Subset U$ there is a compact set $K_y\Subset U$ such that the Schwartz kernel of $A_h$ over $K_x\times U$ is supported in $K_x\times K_y$. Hence, when estimating $A_h((1-q\chi)(1-\theta)u_h)$ on $K_x$, the distribution $u_h$ is applied only to compactly supported test functions in the $y$ variable.
For every compact set $K_x\Subset U$, every pair of multi-indices $\alpha$ and $\gamma$, and every $M\geq0$, repeated integration by parts in $\xi$ with
\begin{align*}
L_{x,y,\xi}=\frac{h}{i|x-y|^2}\sum_{j=1}^n (x_j-y_j)\partial_{\xi_j}
\end{align*}
on the compact set $(K_x\cap\operatorname{supp}\rho)\times K_y$ gives
\begin{align*}
\sup_{x\in K_x,\,y\in K_y}|\partial_x^\alpha\partial_y^\gamma K_h(x,y)|\leq C_{\alpha,\gamma,K_x,M}h^M.
\end{align*}
Here the constants are finite because $|x-y|\geq\delta$ on the support of the amplitude, the $\xi$-support of $a$ is compact, and only finitely many derivatives of $a$, $q\chi$, and $\theta$ occur. Since $(u_h)$ is semiclassically tempered as a distribution on $U$, its action on compactly supported test functions in $K_y$ grows at most polynomially in $h^{-1}$ in local distribution seminorms. Choosing the number of integrations by parts larger than this polynomial loss gives
\begin{align*}
A_h((1-q\chi)(1-\theta)u_h)=O(h^\infty)
\end{align*}
in $C^\infty_{\mathrm{loc}}(U)$, and therefore also
\begin{align*}
A_h((1-q\chi)u_h)=O(h^\infty)
\end{align*}
in $C^\infty_{\mathrm{loc}}(U)$.
Combining this with the previous step gives
\begin{align*}
A_hu_h=A_h(q\chi u_h)+A_h((1-q\chi)u_h)=O(h^\infty)
\end{align*}
in $C^\infty_{\mathrm{loc}}(U)$.[/step]