[guided]We now explain why the summed phase contains exactly the geometry of canonical-relation composition. The composed kernel is obtained by multiplying $K_A(x,y)$ and $K_B(y,z)$ and then integrating out $y$. Therefore the total oscillation is governed by the phase
\begin{align*}
\Phi(x,y,z,\theta,\tau) := \phi_1(x,y,\theta)+\phi_2(y,z,\tau).
\end{align*}
The remaining phase variables are $y$, $\theta$, and $\tau$, because $x$ and $z$ are the external kernel variables.
A point contributes to the associated canonical relation only when the phase is stationary in all internal variables. Stationarity in $\theta$ gives
\begin{align*}
\partial_\theta \phi_1(x,y,\theta)=0,
\end{align*}
so the nondegenerate phase $\phi_1$ produces the point
\begin{align*}
(x,\partial_x\phi_1(x,y,\theta),y,-\partial_y\phi_1(x,y,\theta)) \in C_1.
\end{align*}
Stationarity in $\tau$ gives
\begin{align*}
\partial_\tau \phi_2(y,z,\tau)=0,
\end{align*}
so the nondegenerate phase $\phi_2$ produces the point
\begin{align*}
(y,\partial_y\phi_2(y,z,\tau),z,-\partial_z\phi_2(y,z,\tau)) \in C_2.
\end{align*}
It remains to interpret the $y$-stationarity equation. Since $y$ is the variable being integrated out, stationary phase requires
\begin{align*}
\partial_y \Phi(x,y,z,\theta,\tau)=0.
\end{align*}
By the definition of $\Phi$, this is
\begin{align*}
\partial_y \phi_1(x,y,\theta)+\partial_y \phi_2(y,z,\tau)=0.
\end{align*}
Equivalently,
\begin{align*}
-\partial_y\phi_1(x,y,\theta)=\partial_y\phi_2(y,z,\tau).
\end{align*}
The covector on the $Y$ side of $C_1$ is $-\partial_y\phi_1$, while the covector on the $Y$ side of $C_2$ is $\partial_y\phi_2$. Thus this equation is exactly the condition that the intermediate covectors agree. Consequently, the critical set of $\Phi$ in the variables $(y,\theta,\tau)$ is the local model for
\begin{align*}
C_1 \times_{T^*Y} C_2.
\end{align*}
This is the geometric reason that the product phase parametrizes the composition rather than some unrelated relation.[/guided]