Convergence of Scaled Forward Euler Defect Multipliers to the Continuous Costate (Theorem # 7645)
Theorem
Let $T > 0$ and let $n,m \in \mathbb{N}$. Let $f \in C^2(\mathbb{R}^n \times \mathbb{R}^m;\mathbb{R}^n)$, $L \in C^2(\mathbb{R}^n \times \mathbb{R}^m;\mathbb{R})$, and $\Phi \in C^2(\mathbb{R}^n;\mathbb{R})$. Fix $x_0^* \in \mathbb{R}^n$ and consider the fixed-initial-state, free-terminal-state optimal control problem of minimizing
\begin{align*}
J[x,u]=\int_0^T L(x(t),u(t))\,d\mathcal{L}^1(t)+\Phi(x(T))
\end{align*}
over trajectories $x \in C^1([0,T];\mathbb{R}^n)$ and controls $u \in C([0,T];\mathbb{R}^m)$ subject to
\begin{align*}
\dot{x}(t)=f(x(t),u(t)) \quad \text{for every } t\in[0,T]
\end{align*}
and
\begin{align*}
x(0)=x_0^*.
\end{align*}
Define the Pontryagin Hamiltonian $H: \mathbb{R}^n \times \mathbb{R}^n \times \mathbb{R}^m \to \mathbb{R}$ by
\begin{align*}
H(x,p,u)=L(x,u)+p\cdot f(x,u).
\end{align*}
Suppose that $(x^*,u^*,p^*) \in C^1([0,T];\mathbb{R}^n) \times C([0,T];\mathbb{R}^m) \times C^1([0,T];\mathbb{R}^n)$ is a normal extremal satisfying, for every $t \in [0,T]$,
\begin{align*}
\dot{x}^*(t)=f(x^*(t),u^*(t)).
\end{align*}
\begin{align*}
-\dot{p}^*(t)=\nabla_x H(x^*(t),p^*(t),u^*(t)).
\end{align*}
\begin{align*}
0=\nabla_u H(x^*(t),p^*(t),u^*(t)).
\end{align*}
\begin{align*}
p^*(T)=\nabla\Phi(x^*(T)).
\end{align*}
Assume that there exists $\alpha>0$ such that the symmetric Hessian matrix $\nabla_{uu}^2H(x^*(t),p^*(t),u^*(t))\in\mathbb{R}^{m\times m}$ satisfies
\begin{align*}
\xi\cdot \nabla_{uu}^2H(x^*(t),p^*(t),u^*(t))\xi\ge \alpha |\xi|^2 \quad \text{for every } t\in[0,T] \text{ and every } \xi\in\mathbb{R}^m.
\end{align*}
Assume also that the second variation of $J$ at $(x^*,u^*)$ is coercive on the continuous critical cone for the fixed-initial-state problem.
Let $(\mathcal{T}_h)$ be a family of meshes
\begin{align*}
\mathcal{T}_h=\{0=t_0<t_1<\cdots<t_N=T\},
\end{align*}
where $N=N(h)$ may depend on $h$. Define $h_k=t_{k+1}-t_k$ for $0 \le k \le N-1$ and $h=\max_{0\le k\le N-1} h_k$. Assume that the meshes are quasi-uniform: there exists $\rho>0$, independent of $h$, such that $h_k \ge \rho h$ for all $0 \le k \le N-1$.
For each mesh, consider the normalized forward Euler transcription over variables $x_k\in\mathbb{R}^n$ for $0\le k\le N$ and $u_k\in\mathbb{R}^m$ for $0\le k\le N-1$:
\begin{align*}
\min_{(x_k,u_k)}\left\{\sum_{k=0}^{N-1}h_kL(x_k,u_k)+\Phi(x_N)\right\}
\end{align*}
subject to
\begin{align*}
x_0=x_0^*.
\end{align*}
\begin{align*}
\frac{x_{k+1}-x_k}{h_k}-f(x_k,u_k)=0 \quad \text{for } 0\le k\le N-1.
\end{align*}
Assume that for all sufficiently small $h$ there exists a locally optimal feasible point $(x_{k,h},u_{k,h})$ satisfying
\begin{align*}
\max_{0\le k\le N}|x_{k,h}-x^*(t_k)|+\max_{0\le k\le N-1}|u_{k,h}-u^*(t_k)| \to 0
\end{align*}
as $h \to 0$. Assume that the equality-constraint LICQ holds at $(x_{k,h},u_{k,h})$ for the variables $(x_1,\dots,x_N,u_0,\dots,u_{N-1})$ with $x_0=x_0^*$ fixed, so that the defect multiplier vector is unique.
Let $\lambda_{k,h}\in\mathbb{R}^n$ denote the multiplier attached to the normalized defect constraint
\begin{align*}
\frac{x_{k+1}-x_k}{h_k}-f(x_k,u_k)=0
\end{align*}
in the discrete Lagrangian
\begin{align*}
\mathcal{L}_h=\sum_{k=0}^{N-1}h_kL(x_k,u_k)+\Phi(x_N)+\sum_{k=0}^{N-1}\lambda_{k,h}\cdot\left(\frac{x_{k+1}-x_k}{h_k}-f(x_k,u_k)\right).
\end{align*}
Define the recovered discrete costate $\mu_{k,h}\in\mathbb{R}^n$ by
\begin{align*}
\mu_{k,h}=-h_k^{-1}\lambda_{k,h} \quad \text{for } 0\le k\le N-1.
\end{align*}
Assume the following mesh-independent maximum-norm stability estimate for the discrete adjoint terminal-value equations. For every sufficiently small $h$, define $A_{k,h}\in\mathbb{R}^{n\times n}$ for $1\le k\le N-1$ by
\begin{align*}
A_{k,h}=\nabla_x f(x_{k,h},u_{k,h})^\top.
\end{align*}
For every family of vectors $r_{k,h}\in\mathbb{R}^n$ for $1\le k\le N-1$, every terminal vector $b_h\in\mathbb{R}^n$, and every sequence $z_{k,h}\in\mathbb{R}^n$ for $0\le k\le N-1$ satisfying
\begin{align*}
z_{k-1,h}-z_{k,h}-h_k A_{k,h}z_{k,h}=r_{k,h} \quad \text{for } 1\le k\le N-1,
\end{align*}
and
\begin{align*}
z_{N-1,h}=b_h,
\end{align*}
there exists a constant $C>0$, independent of $h$, $r_{k,h}$, $b_h$, and $z_{k,h}$, such that
\begin{align*}
\max_{0\le k\le N-1}|z_{k,h}|\le C\left(|b_h|+\max_{1\le k\le N-1}|r_{k,h}|\right).
\end{align*}
Then
\begin{align*}
\max_{0\le k\le N-1}\left|-h_k^{-1}\lambda_{k,h}-p^*(t_k)\right|\to0
\end{align*}
as $h\to0$.
Knowledge Status
Analysis
Discussion
This result formalizes convergence of scaled forward euler defect multipliers to the continuous costate by giving precise hypotheses and conclusions for the controlled system under consideration. It is used to justify stability, existence, optimality, or numerical approximation arguments elsewhere in the control theory notes.
Proof
[proofplan]
The proof derives the discrete adjoint equations from the finite-dimensional KKT system and then compares those equations with the sampled continuous costate. The normalization of the defect constraint makes the raw multiplier $\lambda_{k,h}$ have size $h_k$, so the meaningful costate variable is $\mu_{k,h}=-h_k^{-1}\lambda_{k,h}$. Terminal stationarity gives $\mu_{N-1,h}$ close to $p^*(T)$, and the discrete adjoint recursion gives a stable terminal-value problem for the error $z_{k,h}=\mu_{k,h}-p^*(t_k)$. The assumed maximum-norm stability estimate converts the vanishing terminal mismatch and vanishing residual into [uniform convergence](/page/Uniform%20Convergence) of $\mu_{k,h}$ to $p^*(t_k)$.
[/proofplan]
[step:Derive the discrete stationarity equations from KKT]
Fix a sufficiently small mesh parameter $h$ and suppress the dependence of $N$ on $h$. The finite-dimensional variables are $(x_1,\dots,x_N,u_0,\dots,u_{N-1})$, with $x_0=x_0^*$ fixed. Since $f$, $L$, and $\Phi$ are $C^2$, the finite-dimensional objective and all equality-constraint maps in the transcription are $C^1$. Thus no stationarity equation with respect to $x_0$ is required. We use the finite-dimensional equality-constrained KKT necessary condition: at a local minimizer of a $C^1$ equality-constrained problem, LICQ implies existence of Lagrange multipliers satisfying stationarity; moreover, LICQ makes the equality multiplier vector unique. Since the equality-constraint LICQ holds at the locally optimal feasible point $(x_{k,h},u_{k,h})$ for these free variables, this theorem applies and gives a unique defect multiplier vector $\lambda_{k,h}\in\mathbb{R}^n$.
For $0\le k\le N-1$, stationarity with respect to $u_k$ gives
\begin{align*}
0=h_k\nabla_u L(x_{k,h},u_{k,h})-\nabla_u f(x_{k,h},u_{k,h})^\top\lambda_{k,h}.
\end{align*}
Using $\mu_{k,h}=-h_k^{-1}\lambda_{k,h}$, this becomes
\begin{align*}
0=\nabla_u L(x_{k,h},u_{k,h})+\nabla_u f(x_{k,h},u_{k,h})^\top\mu_{k,h}.
\end{align*}
Equivalently,
\begin{align*}
0=\nabla_u H(x_{k,h},\mu_{k,h},u_{k,h}).
\end{align*}
Stationarity with respect to $x_N$ gives
\begin{align*}
0=\nabla\Phi(x_{N,h})+\frac{\lambda_{N-1,h}}{h_{N-1}}.
\end{align*}
Hence
\begin{align*}
\mu_{N-1,h}=\nabla\Phi(x_{N,h}).
\end{align*}
For each $1\le k\le N-1$, stationarity with respect to $x_k$ gives
\begin{align*}
0=h_k\nabla_x L(x_{k,h},u_{k,h})+\frac{\lambda_{k-1,h}}{h_{k-1}}-\frac{\lambda_{k,h}}{h_k}-\nabla_x f(x_{k,h},u_{k,h})^\top\lambda_{k,h}.
\end{align*}
Substituting $\lambda_{j,h}=-h_j\mu_{j,h}$ for $j=k-1,k$ yields
\begin{align*}
\mu_{k-1,h}-\mu_{k,h}-h_k\nabla_x f(x_{k,h},u_{k,h})^\top\mu_{k,h}=h_k\nabla_x L(x_{k,h},u_{k,h}).
\end{align*}
[guided]
We now compute the KKT equations carefully, because the sign and scaling of the multiplier are the whole point of the theorem. The free finite-dimensional variables are $(x_1,\dots,x_N,u_0,\dots,u_{N-1})$; the initial value $x_0=x_0^*$ is fixed data. The defect constraint at index $k$ is the map $G_{k,h}:\mathbb{R}^{nN}\times\mathbb{R}^{mN}\to\mathbb{R}^n$ defined, with $x_0=x_0^*$ held fixed, by
\begin{align*}
G_{k,h}(x_1,\dots,x_N,u_0,\dots,u_{N-1})=\frac{x_{k+1}-x_k}{h_k}-f(x_k,u_k).
\end{align*}
The discrete Lagrangian is the objective plus $\sum_{k=0}^{N-1}\lambda_{k,h}\cdot G_{k,h}$. We use the finite-dimensional equality-constrained KKT necessary condition: for a $C^1$ finite-dimensional equality-constrained problem, LICQ at a local minimizer gives Lagrange multipliers satisfying stationarity, and LICQ also gives uniqueness of the equality multiplier vector. Since LICQ is assumed at the local minimizer for these equality constraints, this result gives a unique defect multiplier vector $\lambda_{k,h}$ and stationarity of $\mathcal{L}_h$ with respect to every free variable.
First vary $u_k$. Only the objective term $h_kL(x_k,u_k)$ and the defect term $\lambda_{k,h}\cdot G_{k,h}$ depend on $u_k$. Differentiating gives
\begin{align*}
0=h_k\nabla_u L(x_{k,h},u_{k,h})-\nabla_u f(x_{k,h},u_{k,h})^\top\lambda_{k,h}.
\end{align*}
The recovered costate is defined by $\mu_{k,h}=-h_k^{-1}\lambda_{k,h}$, so $\lambda_{k,h}=-h_k\mu_{k,h}$. Substitution gives
\begin{align*}
0=h_k\nabla_u L(x_{k,h},u_{k,h})+h_k\nabla_u f(x_{k,h},u_{k,h})^\top\mu_{k,h}.
\end{align*}
Since $h_k>0$, division by $h_k$ gives
\begin{align*}
0=\nabla_u L(x_{k,h},u_{k,h})+\nabla_u f(x_{k,h},u_{k,h})^\top\mu_{k,h}.
\end{align*}
This is exactly $0=\nabla_u H(x_{k,h},\mu_{k,h},u_{k,h})$, because $H(x,p,u)=L(x,u)+p\cdot f(x,u)$.
Next vary the terminal variable $x_N$. The variable $x_N$ appears in $\Phi(x_N)$ and in the last defect $G_{N-1,h}$ through the term $x_N/h_{N-1}$. Therefore stationarity gives
\begin{align*}
0=\nabla\Phi(x_{N,h})+\frac{\lambda_{N-1,h}}{h_{N-1}}.
\end{align*}
Using $\mu_{N-1,h}=-h_{N-1}^{-1}\lambda_{N-1,h}$, this becomes
\begin{align*}
\mu_{N-1,h}=\nabla\Phi(x_{N,h}).
\end{align*}
This is the discrete transversality condition.
Finally fix $1\le k\le N-1$ and vary the interior state $x_k$. This variable appears in three places: in the running cost $h_kL(x_k,u_k)$, in the previous defect $G_{k-1,h}$ through $x_k/h_{k-1}$, and in the current defect $G_{k,h}$ through $-x_k/h_k-f(x_k,u_k)$. Differentiating those three contributions gives
\begin{align*}
0=h_k\nabla_x L(x_{k,h},u_{k,h})+\frac{\lambda_{k-1,h}}{h_{k-1}}-\frac{\lambda_{k,h}}{h_k}-\nabla_x f(x_{k,h},u_{k,h})^\top\lambda_{k,h}.
\end{align*}
Substituting $\lambda_{k-1,h}=-h_{k-1}\mu_{k-1,h}$ and $\lambda_{k,h}=-h_k\mu_{k,h}$ gives
\begin{align*}
0=h_k\nabla_x L(x_{k,h},u_{k,h})-\mu_{k-1,h}+\mu_{k,h}+h_k\nabla_x f(x_{k,h},u_{k,h})^\top\mu_{k,h}.
\end{align*}
Rearranging yields the backward discrete adjoint equation
\begin{align*}
\mu_{k-1,h}-\mu_{k,h}-h_k\nabla_x f(x_{k,h},u_{k,h})^\top\mu_{k,h}=h_k\nabla_x L(x_{k,h},u_{k,h}).
\end{align*}
The appearance of $h_k$ in this recursion is a direct consequence of differentiating the cost and defect at node $k$; no reindexing is being hidden.
[/guided]
[/step]
[step:Show the discrete terminal condition converges to the continuous terminal costate]
From the terminal stationarity equation,
\begin{align*}
\mu_{N-1,h}=\nabla\Phi(x_{N,h}).
\end{align*}
The assumed uniform convergence of the discrete states gives $x_{N,h}\to x^*(T)$. Since $\nabla\Phi:\mathbb{R}^n\to\mathbb{R}^n$ is continuous,
\begin{align*}
\mu_{N-1,h}\to \nabla\Phi(x^*(T)).
\end{align*}
The continuous transversality condition gives $\nabla\Phi(x^*(T))=p^*(T)$, hence
\begin{align*}
\mu_{N-1,h}\to p^*(T).
\end{align*}
Define the terminal error $b_h\in\mathbb{R}^n$ by
\begin{align*}
b_h=\mu_{N-1,h}-p^*(t_{N-1}).
\end{align*}
Then
\begin{align*}
|b_h|\le |\mu_{N-1,h}-p^*(T)|+|p^*(T)-p^*(t_{N-1})|.
\end{align*}
Because $p^*\in C^1([0,T];\mathbb{R}^n)$, the map $p^*$ is uniformly continuous on $[0,T]$. Since $T-t_{N-1}=h_{N-1}\le h\to0$, the second term tends to $0$. Therefore
\begin{align*}
b_h\to0.
\end{align*}
[/step]
[step:Write the costate error as a stable discrete terminal-value problem]
For $0\le k\le N-1$, define the error vector $z_{k,h}\in\mathbb{R}^n$ by
\begin{align*}
z_{k,h}=\mu_{k,h}-p^*(t_k).
\end{align*}
For $1\le k\le N-1$, define
\begin{align*}
A_{k,h}=\nabla_x f(x_{k,h},u_{k,h})^\top.
\end{align*}
Subtracting the sampled continuous costate values from the discrete adjoint equation gives
\begin{align*}
z_{k-1,h}-z_{k,h}-h_kA_{k,h}z_{k,h}=r_{k,h},
\end{align*}
where $r_{k,h}\in\mathbb{R}^n$ is defined by
\begin{align*}
r_{k,h}=h_k\nabla_x L(x_{k,h},u_{k,h})-p^*(t_{k-1})+p^*(t_k)+h_kA_{k,h}p^*(t_k).
\end{align*}
The terminal condition for this error equation is
\begin{align*}
z_{N-1,h}=b_h.
\end{align*}
Thus the error sequence has exactly the form required by the assumed maximum-norm stability estimate.
[/step]
[step:Prove that the residual in the error equation vanishes uniformly]
We prove that
\begin{align*}
\max_{1\le k\le N-1}|r_{k,h}|\to0.
\end{align*}
Define the continuous adjoint vector field $F:[0,T]\to\mathbb{R}^n$ by
\begin{align*}
F(t)=\nabla_x L(x^*(t),u^*(t))+\nabla_x f(x^*(t),u^*(t))^\top p^*(t).
\end{align*}
The continuous adjoint equation is exactly $\dot p^*(t)=-F(t)$ for every $t\in[0,T]$. Since $x^*$, $u^*$, and $p^*$ are continuous and $\nabla_x L$ and $\nabla_x f$ are continuous, $F$ is continuous on the compact interval $[0,T]$ and hence uniformly continuous and bounded.
By the [fundamental theorem of calculus](/theorems/632) applied componentwise to $p^*$,
\begin{align*}
p^*(t_k)-p^*(t_{k-1})=-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s).
\end{align*}
For $1\le k\le N-1$, define $F_{k,h}\in\mathbb{R}^n$ by
\begin{align*}
F_{k,h}=\nabla_x L(x_{k,h},u_{k,h})+\nabla_x f(x_{k,h},u_{k,h})^\top p^*(t_k).
\end{align*}
Then the residual can be written as
\begin{align*}
r_{k,h}=h_kF_{k,h}-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s).
\end{align*}
Let $M_F=\max_{0\le t\le T}|F(t)|$. Define the compact reference set $K_0\subset\mathbb{R}^n\times\mathbb{R}^m$ by
\begin{align*}
K_0=\{(x^*(t),u^*(t)):0\le t\le T\}.
\end{align*}
It is compact because it is the continuous image of $[0,T]$. The assumed uniform convergence of the discrete states and controls implies that there is $R>0$ such that, for all sufficiently small $h$, every $(x_{k,h},u_{k,h})$ lies in the compact set
\begin{align*}
K_R=\{(x,u)\in\mathbb{R}^n\times\mathbb{R}^m:\operatorname{dist}((x,u),K_0)\le R\}.
\end{align*}
On $K_R$, the continuous maps $\nabla_x L$ and $\nabla_x f$ are uniformly continuous, and $p^*$ is bounded on $[0,T]$. Hence
\begin{align*}
\varepsilon_h:=\max_{1\le k\le N-1}|F_{k,h}-F(t_k)|\to0.
\end{align*}
For each $k$,
\begin{align*}
|r_{k,h}|\le h_k|F_{k,h}-F(t_k)|+\left|h_kF(t_k)-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s)\right|.
\end{align*}
The first term is bounded by $h\varepsilon_h$. For the second term, add and subtract $h_{k-1}F(t_k)$ to obtain
\begin{align*}
\left|h_kF(t_k)-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s)\right|\le |h_k-h_{k-1}|M_F+\int_{t_{k-1}}^{t_k}|F(t_k)-F(s)|\,d\mathcal{L}^1(s).
\end{align*}
Since $|h_k-h_{k-1}|\le h$ and $F$ is uniformly continuous, the right-hand side is bounded by
\begin{align*}
hM_F+h\omega_F(h),
\end{align*}
where $\omega_F(h)=\sup\{|F(t)-F(s)|:s,t\in[0,T], |s-t|\le h\}$ and $\omega_F(h)\to0$. Therefore
\begin{align*}
\max_{1\le k\le N-1}|r_{k,h}|\le h\varepsilon_h+hM_F+h\omega_F(h)\to0.
\end{align*}
[guided]
The residual must be compared with the differential equation solved by the continuous costate, not merely bounded term by term. Define
\begin{align*}
F(t)=\nabla_x L(x^*(t),u^*(t))+\nabla_x f(x^*(t),u^*(t))^\top p^*(t).
\end{align*}
The continuous adjoint equation says $-\dot p^*(t)=F(t)$ for every $t\in[0,T]$. This is the consistency input: it identifies the increment of the sampled continuous costate with the integral of the same vector field that appears in the discrete adjoint recursion.
Since $x^*$, $u^*$, and $p^*$ are continuous, and since $\nabla_x L$ and $\nabla_x f$ are continuous, the map $F:[0,T]\to\mathbb{R}^n$ is continuous. Because $[0,T]$ is compact, $F$ is uniformly continuous and bounded. By the fundamental theorem of calculus applied componentwise to $p^*$,
\begin{align*}
p^*(t_k)-p^*(t_{k-1})=\int_{t_{k-1}}^{t_k}\dot p^*(s)\,d\mathcal{L}^1(s)=-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s).
\end{align*}
Now define the discrete sampled vector field $F_{k,h}\in\mathbb{R}^n$ by
\begin{align*}
F_{k,h}=\nabla_x L(x_{k,h},u_{k,h})+\nabla_x f(x_{k,h},u_{k,h})^\top p^*(t_k).
\end{align*}
Using the definition of $r_{k,h}$ and the preceding integral identity gives
\begin{align*}
r_{k,h}=h_kF_{k,h}-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s).
\end{align*}
This formula shows exactly what must be controlled: the discrete vector field must approximate $F(t_k)$, and the integral of $F$ over the previous mesh interval must be approximated by a one-point quadrature term.
Define the compact reference set $K_0\subset\mathbb{R}^n\times\mathbb{R}^m$ by
\begin{align*}
K_0=\{(x^*(t),u^*(t)):0\le t\le T\}.
\end{align*}
This set is compact because it is the continuous image of the compact interval $[0,T]$. Uniform convergence of $(x_{k,h},u_{k,h})$ to $(x^*(t_k),u^*(t_k))$ implies that there is $R>0$ such that all discrete pairs lie, for all sufficiently small $h$, in the compact set
\begin{align*}
K_R=\{(x,u)\in\mathbb{R}^n\times\mathbb{R}^m:\operatorname{dist}((x,u),K_0)\le R\}.
\end{align*}
On $K_R$, the continuous maps $\nabla_x L$ and $\nabla_x f$ are uniformly continuous, and $p^*$ is bounded on $[0,T]$. Therefore
\begin{align*}
\varepsilon_h:=\max_{1\le k\le N-1}|F_{k,h}-F(t_k)|\to0.
\end{align*}
Let $M_F=\max_{0\le t\le T}|F(t)|$. For each $1\le k\le N-1$,
\begin{align*}
|r_{k,h}|\le h_k|F_{k,h}-F(t_k)|+\left|h_kF(t_k)-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s)\right|.
\end{align*}
The first term is at most $h\varepsilon_h$. For the second term, insert $h_{k-1}F(t_k)$, because the integration interval has length $h_{k-1}=t_k-t_{k-1}$:
\begin{align*}
\left|h_kF(t_k)-\int_{t_{k-1}}^{t_k}F(s)\,d\mathcal{L}^1(s)\right|\le |h_k-h_{k-1}|M_F+\int_{t_{k-1}}^{t_k}|F(t_k)-F(s)|\,d\mathcal{L}^1(s).
\end{align*}
Since both $h_k$ and $h_{k-1}$ are bounded by $h$, we have $|h_k-h_{k-1}|\le h$. If
\begin{align*}
\omega_F(h)=\sup\{|F(t)-F(s)|:s,t\in[0,T], |s-t|\le h\},
\end{align*}
then [uniform continuity](/page/Uniform%20Continuity) gives $\omega_F(h)\to0$, and the integral term is at most $h\omega_F(h)$. Consequently
\begin{align*}
\max_{1\le k\le N-1}|r_{k,h}|\le h\varepsilon_h+hM_F+h\omega_F(h)\to0.
\end{align*}
This proves the required uniform residual consistency. The continuous adjoint equation is essential here: it is what turns the difference $p^*(t_k)-p^*(t_{k-1})$ into the integral of the correct vector field.
[/guided]
[/step]
[step:Apply the stability estimate to obtain uniform convergence of the recovered costates]
The sequence $z_{k,h}$ satisfies
\begin{align*}
z_{k-1,h}-z_{k,h}-h_kA_{k,h}z_{k,h}=r_{k,h} \quad \text{for } 1\le k\le N-1
\end{align*}
and
\begin{align*}
z_{N-1,h}=b_h.
\end{align*}
The assumed maximum-norm stability estimate therefore yields
\begin{align*}
\max_{0\le k\le N-1}|z_{k,h}|\le C\left(|b_h|+\max_{1\le k\le N-1}|r_{k,h}|\right).
\end{align*}
The previous steps showed that $b_h\to0$ and $\max_{1\le k\le N-1}|r_{k,h}|\to0$. Hence
\begin{align*}
\max_{0\le k\le N-1}|z_{k,h}|\to0.
\end{align*}
Recalling that $z_{k,h}=\mu_{k,h}-p^*(t_k)$ and $\mu_{k,h}=-h_k^{-1}\lambda_{k,h}$, we obtain
\begin{align*}
\max_{0\le k\le N-1}\left|-h_k^{-1}\lambda_{k,h}-p^*(t_k)\right|\to0.
\end{align*}
This is the desired convergence of the scaled forward Euler defect multipliers to the continuous costate.
[/step]
Prerequisites (0/2 completed)
Prerequisites Graph
Interactive dependency map showing how this theorem builds on foundational concepts
Loading dependency graph...
Theorem
Definition
Current
Requires
Explore Further
Uniform Convergence
Definition
Fundamental Theorem Of Calculus
Theorem #632
Lower Semicontinuity of the Norm
Functional Analysis
Cauchy Condensation Test
Series
Admissible Potentials Are Bounded by Their $c$-Transforms
Analysis
Equivalent Characterisations of Compactness in Metric Spaces
Topology
Term-Wise Integration
Power Series
Rankine-Hugoniot Jump Condition
Partial Differential Equations
Levy Continuity Theorem
Measure Theory
Lyapunov Stability and Local Asymptotic Stability Theorem
Analysis
Analysis
Area