Smooth Parameter Dependence for Characteristic Solutions

Theorem

Edit Issues Pull Requests Attributions Admin

Discussion

No discussion available for this theorem.

Proof

[proofplan] We write the characteristic equations as one autonomous ordinary differential equation on $U \times I$ with a $C^k$ vector field. Since the initial point of the lifted characteristic depends $C^k$ on $(\lambda,\theta)$, smooth dependence for ordinary differential equations gives a common $C^k$ characteristic flow on a small parameter box. The noncharacteristic condition implies that the spatial characteristic map $(s,\theta)\mapsto x_{\lambda_0}(s,\theta)$ has invertible derivative at $(0,\theta_0)$, and continuity preserves this invertibility for nearby $\lambda$. A parameter-dependent inverse function theorem then supplies a $C^k$ inverse $(\lambda,x)\mapsto (s_\lambda(x),\theta_\lambda(x))$, and composing it with the lifted characteristic height $z_\lambda$ gives the desired $C^k$ family of solutions. [/proofplan] [step:Rewrite the characteristic equations as a parameter-dependent initial value problem] Define the lifted vector field \begin{align*} F: U \times I \to \mathbb{R}^{n+1} \end{align*} by $F(x,z)=(a(x,z),b(x,z))$. Since $a$ and $b$ are $C^k$, the map $F$ is $C^k$. Define the initial-data map \begin{align*} Q: \Lambda \times \Theta \to U \times I \end{align*} by $Q(\lambda,\theta)=(Y(\theta),h(\lambda,\theta))$. Because $Y$ and $h$ are $C^k$, the map $Q$ is $C^k$. For each fixed $(\lambda,\theta)$, let $J_{\lambda,\theta}\subset\mathbb{R}$ denote the maximal open interval containing $0$ on which the lifted characteristic curve is defined. The lifted characteristic curve is the solution \begin{align*} \gamma_{\lambda,\theta}: J_{\lambda,\theta} \to U \times I \end{align*} of the ordinary differential equation \begin{align*} \frac{d}{ds}\gamma_{\lambda,\theta}(s)=F(\gamma_{\lambda,\theta}(s)). \end{align*} It satisfies the initial condition \begin{align*} \gamma_{\lambda,\theta}(0)=Q(\lambda,\theta). \end{align*} Writing \begin{align*} \gamma_{\lambda,\theta}(s)=(x_\lambda(s,\theta),z_\lambda(s,\theta)) \end{align*} recovers exactly the characteristic system in the theorem statement. By the $C^k$ smooth-dependence refinement of the [Picard-Lindelof theorem](/theorems/69), applied to the $C^k$ vector field $F$ and the $C^k$ initial-data map $Q$, there exist open neighbourhoods $\Lambda_1 \subset \Lambda$ of $\lambda_0$, $\Theta_1 \subset \Theta$ of $\theta_0$, and an interval $J=(-\varepsilon,\varepsilon)$ with $\varepsilon>0$ such that the solution map $\Gamma: \Lambda_1 \times J \times \Theta_1 \to U \times I$, defined by $\Gamma(\lambda,s,\theta)=\gamma_{\lambda,\theta}(s)$, is $C^k$. The hypotheses are satisfied because $F$ is $C^k$, the initial point $Q(\lambda,\theta)$ depends $C^k$ on $(\lambda,\theta)$, and $Q(\lambda_0,\theta_0)\in U\times I$. Let $X: \Lambda_1 \times J \times \Theta_1 \to U$ be defined by $X(\lambda,s,\theta)=x_\lambda(s,\theta)$, and let $Z: \Lambda_1 \times J \times \Theta_1 \to I$ be defined by $Z(\lambda,s,\theta)=z_\lambda(s,\theta)$. These are the component maps of $\Gamma$. Then $X$ and $Z$ are $C^k$. [guided] The characteristic system is easiest to control if we regard the spatial component $x$ and the unknown value $z$ as a single point in $U \times I$. Define \begin{align*} F: U \times I \to \mathbb{R}^{n+1} \end{align*} by $F(x,z)=(a(x,z),b(x,z))$. This is a $C^k$ map because both component maps $a:U\times I\to\mathbb{R}^n$ and $b:U\times I\to\mathbb{R}$ are $C^k$. The initial condition also depends on parameters. We package it as the map \begin{align*} Q: \Lambda \times \Theta \to U \times I \end{align*} defined by $Q(\lambda,\theta)=(Y(\theta),h(\lambda,\theta))$. The map $Q$ is $C^k$ because $Y$ and $h$ are $C^k$. For fixed $(\lambda,\theta)$, let $J_{\lambda,\theta}\subset\mathbb{R}$ denote the maximal open interval containing $0$ on which the lifted characteristic curve is defined, and define \begin{align*} \gamma_{\lambda,\theta}: J_{\lambda,\theta} \to U \times I \end{align*} to be the solution of \begin{align*} \frac{d}{ds}\gamma_{\lambda,\theta}(s)=F(\gamma_{\lambda,\theta}(s)). \end{align*} It satisfies the initial condition \begin{align*} \gamma_{\lambda,\theta}(0)=Q(\lambda,\theta). \end{align*} If we write $\gamma_{\lambda,\theta}(s)=(x_\lambda(s,\theta),z_\lambda(s,\theta))$, then the single lifted equation is precisely the pair of characteristic equations \begin{align*} \frac{\partial x_\lambda}{\partial s}(s,\theta)=a(x_\lambda(s,\theta),z_\lambda(s,\theta)) \end{align*} and \begin{align*} \frac{\partial z_\lambda}{\partial s}(s,\theta)=b(x_\lambda(s,\theta),z_\lambda(s,\theta)). \end{align*} The initial condition for the spatial component is \begin{align*} x_\lambda(0,\theta)=Y(\theta). \end{align*} The initial condition for the height component is \begin{align*} z_\lambda(0,\theta)=h(\lambda,\theta). \end{align*} We now use the $C^k$ smooth-dependence refinement of the [Picard-Lindelof theorem](/theorems/69). Its hypotheses are satisfied: the vector field $F$ is $C^k$, the initial point $Q(\lambda,\theta)$ depends $C^k$ on $(\lambda,\theta)$, and the initial point $Q(\lambda_0,\theta_0)=(Y(\theta_0),h(\lambda_0,\theta_0))$ lies in the [open set](/page/Open%20Set) $U\times I$. Therefore, after shrinking to neighbourhoods $\Lambda_1$ of $\lambda_0$ and $\Theta_1$ of $\theta_0$, and after choosing an interval $J=(-\varepsilon,\varepsilon)$, the solution exists for all $(\lambda,s,\theta)\in \Lambda_1\times J\times \Theta_1$ and defines a $C^k$ map $\Gamma: \Lambda_1 \times J \times \Theta_1 \to U \times I$ by $\Gamma(\lambda,s,\theta)=\gamma_{\lambda,\theta}(s)$. Taking coordinate projections gives $C^k$ maps $X: \Lambda_1 \times J \times \Theta_1 \to U$ and $Z: \Lambda_1 \times J \times \Theta_1 \to I$ defined by $X(\lambda,s,\theta)=x_\lambda(s,\theta)$ and $Z(\lambda,s,\theta)=z_\lambda(s,\theta)$. This common parameter box is the point of invoking smooth dependence: it gives one domain on which all nearby characteristics are defined and vary $C^k$ in every variable. [/guided] [/step] [step:Use the noncharacteristic condition to invert the spatial characteristic map] For each $\lambda \in \Lambda_1$, define $X_\lambda: J \times \Theta_1 \to U$ by $X_\lambda(s,\theta)=X(\lambda,s,\theta)=x_\lambda(s,\theta)$. At $(s,\theta)=(0,\theta_0)$ and $\lambda=\lambda_0$, the derivative with respect to $(s,\theta)$ has columns \begin{align*} \partial_s X_{\lambda_0}(0,\theta_0) = a(Y(\theta_0),h(\lambda_0,\theta_0)) \end{align*} and \begin{align*} \partial_{\theta_j}X_{\lambda_0}(0,\theta_0) = \partial_{\theta_j}Y(\theta_0), \qquad 1 \leq j \leq n-1. \end{align*} The vectors $\partial_{\theta_1}Y(\theta_0),\dots,\partial_{\theta_{n-1}}Y(\theta_0)$ span the tangent space of the parametrised hypersurface at $y_0$, and $\nu(Y(\theta_0))$ is normal to that tangent space. Since \begin{align*} a(Y(\theta_0),h(\lambda_0,\theta_0)) \cdot \nu(Y(\theta_0)) \neq 0, \end{align*} the vector $a(Y(\theta_0),h(\lambda_0,\theta_0))$ is not tangent to the hypersurface. Hence the $n$ vectors \begin{align*} a(Y(\theta_0),h(\lambda_0,\theta_0)), \quad \partial_{\theta_1}Y(\theta_0), \quad \dots, \quad \partial_{\theta_{n-1}}Y(\theta_0) \end{align*} form a basis of $\mathbb{R}^n$. Therefore \begin{align*} \det D_{(s,\theta)}X_{\lambda_0}(0,\theta_0) \neq 0. \end{align*} Because $X$ is $C^k$ and $k\geq 1$, the map \begin{align*} (\lambda,s,\theta) \mapsto \det D_{(s,\theta)}X(\lambda,s,\theta) \end{align*} is continuous. After shrinking $\Lambda_1$, $J$, and $\Theta_1$, we may assume \begin{align*} \det D_{(s,\theta)}X(\lambda,s,\theta)\neq 0 \end{align*} for every $(\lambda,s,\theta)\in \Lambda_1\times J\times \Theta_1$. [/step] [step:Apply the inverse function theorem with parameters] Define the augmented map $\mathcal{X}: \Lambda_1 \times J \times \Theta_1 \to \Lambda_1\times U$ by $\mathcal{X}(\lambda,s,\theta)=(\lambda,X(\lambda,s,\theta))$. Since $X$ is $C^k$, the map $\mathcal{X}$ is $C^k$. Its derivative at $(\lambda_0,0,\theta_0)$ is block triangular: the derivative in the $\lambda$ variables is the identity on the parameter component, and the derivative in the $(s,\theta)$ variables on the spatial component is $D_{(s,\theta)}X_{\lambda_0}(0,\theta_0)$. Hence $D\mathcal{X}_{(\lambda_0,0,\theta_0)}$ is invertible because $D_{(s,\theta)}X_{\lambda_0}(0,\theta_0)$ is invertible. By the [Inverse Function Theorem](/theorems/51), after shrinking there are open neighbourhoods $\Lambda_2\subset\Lambda_1$ of $\lambda_0$, $J_2\subset J$ of $0$, $\Theta_2\subset\Theta_1$ of $\theta_0$, and an open neighbourhood $W\subset\Lambda_1\times U$ of $(\lambda_0,y_0)$ such that $\mathcal{X}:\Lambda_2\times J_2\times\Theta_2\to W$ is a $C^k$ diffeomorphism onto its image. Since $W$ is open in $\Lambda_1\times U$ and contains $(\lambda_0,y_0)$, we may shrink $\Lambda_2$ and choose an open neighbourhood $V\subset U$ of $y_0$ so that $\Lambda_2\times V\subset W$. On this product neighbourhood, the inverse has the form $(\lambda,x)\mapsto (\lambda,S(\lambda,x))$ because the first component of $\mathcal{X}$ is $\lambda$. Thus $S: \Lambda_2\times V\to J_2\times\Theta_2$ is $C^k$, and $V\subset X_\lambda(J_2\times\Theta_2)$ for every $\lambda\in\Lambda_2$. Thus there exist open neighbourhoods $J_2\subset J$ of $0$, $\Theta_2\subset\Theta_1$ of $\theta_0$, $\Lambda_2\subset\Lambda_1$ of $\lambda_0$, an open neighbourhood $V\subset U$ of $y_0$, and a $C^k$ map $S: \Lambda_2 \times V \to J_2 \times \Theta_2$ defined by $S(\lambda,x)=(s_\lambda(x),\theta_\lambda(x))$, such that, for every $\lambda\in\Lambda_2$, the restricted map \begin{align*} X_\lambda|_{J_2\times\Theta_2}: J_2 \times \Theta_2 \to X_\lambda(J_2\times\Theta_2) \end{align*} is a $C^k$ diffeomorphism onto its open image, $V\subset X_\lambda(J_2\times\Theta_2)$, and \begin{align*} (X_\lambda|_{J_2\times\Theta_2})^{-1}(x)=(s_\lambda(x),\theta_\lambda(x)) \end{align*} for every $(\lambda,x)\in\Lambda_2\times V$. Equivalently, \begin{align*} X(\lambda,s_\lambda(x),\theta_\lambda(x))=x \end{align*} for every $(\lambda,x)\in\Lambda_2\times V$. [guided] We now turn local invertibility for each fixed $\lambda$ into a jointly $C^k$ inverse depending on $\lambda$. The relevant map is $X: \Lambda_1 \times J \times \Theta_1 \to U$, defined by $X(\lambda,s,\theta)=x_\lambda(s,\theta)$. For every point in the shrunk box, the derivative of $X$ with respect to the variables being inverted, namely $(s,\theta)$, is an invertible [linear map](/page/Linear%20Map) \begin{align*} D_{(s,\theta)}X(\lambda,s,\theta): \mathbb{R}\times\mathbb{R}^{n-1}\to \mathbb{R}^n. \end{align*} Rather than citing a separate parameter-dependent theorem, we obtain the uniform statement from the ordinary [Inverse Function Theorem](/theorems/51). Define the augmented map $\mathcal{X}:\Lambda_1\times J\times\Theta_1\to\Lambda_1\times U$ by $\mathcal{X}(\lambda,s,\theta)=(\lambda,X(\lambda,s,\theta))$. This map is $C^k$ because $X$ is $C^k$. Its derivative at $(\lambda_0,0,\theta_0)$ is block triangular: the parameter block is the identity on $\mathbb{R}^m$, and the spatial block in the variables $(s,\theta)$ is $D_{(s,\theta)}X_{\lambda_0}(0,\theta_0)$. Since that spatial block is invertible, the full derivative of $\mathcal{X}$ is invertible. The inverse function theorem therefore gives, after shrinking, a $C^k$ inverse to $\mathcal{X}$ on a neighbourhood of $(\lambda_0,y_0)$ in $\Lambda_1\times U$. Now use the fact that the target neighbourhood is open in $\Lambda_1\times U$ and contains $(\lambda_0,y_0)$. After shrinking $\Lambda_2$ and choosing an open neighbourhood $V\subset U$ of $y_0$, we may ensure $\Lambda_2\times V$ lies inside that target neighbourhood. Because the first component of $\mathcal{X}(\lambda,s,\theta)$ is exactly $\lambda$, its inverse on this product has the form $(\lambda,x)\mapsto (\lambda,S(\lambda,x))$. This gives a single $C^k$ inverse map $S:\Lambda_2\times V\to J_2\times\Theta_2$ defined by $S(\lambda,x)=(s_\lambda(x),\theta_\lambda(x))$. For each fixed $\lambda$, this says that the restriction $X_\lambda|_{J_2\times\Theta_2}$ is a $C^k$ diffeomorphism onto its open image, that $V$ lies inside that image, and that \begin{align*} (X_\lambda|_{J_2\times\Theta_2})^{-1}(x)=(s_\lambda(x),\theta_\lambda(x)) \end{align*} for every $x\in V$. Thus \begin{align*} X(\lambda,s_\lambda(x),\theta_\lambda(x))=x \end{align*} for every $(\lambda,x)\in\Lambda_2\times V$. This is the step that lets us solve for the characteristic label $\theta$ and characteristic time $s$ as $C^k$ functions of the physical point $x$ and the external parameter $\lambda$. [/guided] [/step] [step:Define the solution by pulling back the lifted characteristic height] Define $u: \Lambda_2 \times V \to I$ by $u(\lambda,x)=Z(\lambda,s_\lambda(x),\theta_\lambda(x))$. For each $\lambda\in\Lambda_2$, define $u_\lambda: V \to I$ by $u_\lambda(x)=u(\lambda,x)$. Since $Z$ is $C^k$ and $(\lambda,x)\mapsto (s_\lambda(x),\theta_\lambda(x))$ is $C^k$, the composition defining $u$ is $C^k$. It remains to verify the initial condition. Shrink $\Theta_2$ further, if necessary, so that $Y(\Theta_2)\subset V$. If $\theta\in\Theta_2$, then \begin{align*} X(\lambda,0,\theta)=x_\lambda(0,\theta)=Y(\theta). \end{align*} Since $S(\lambda,\cdot)$ is the inverse of the restricted map $X_\lambda|_{J_2\times\Theta_2}$ on $V$, this gives \begin{align*} s_\lambda(Y(\theta))=0, \qquad \theta_\lambda(Y(\theta))=\theta. \end{align*} Therefore $u_\lambda(Y(\theta))=Z(\lambda,0,\theta)=z_\lambda(0,\theta)=h(\lambda,\theta)$. Thus the constructed family has the prescribed initial data. Finally, we verify the equation. The inverse identity gives \begin{align*} u_\lambda(X(\lambda,s,\theta))=Z(\lambda,s,\theta) \end{align*} for $(\lambda,s,\theta)\in\Lambda_2\times J_2\times\Theta_2$. Since $k\geq 1$, differentiating this identity with respect to $s$ and using the chain rule gives \begin{align*} \nabla u_\lambda(X(\lambda,s,\theta))\cdot \partial_s X(\lambda,s,\theta)=\partial_s Z(\lambda,s,\theta). \end{align*} The characteristic equations identify $\partial_s X(\lambda,s,\theta)$ with $a(X(\lambda,s,\theta),Z(\lambda,s,\theta))$ and $\partial_s Z(\lambda,s,\theta)$ with $b(X(\lambda,s,\theta),Z(\lambda,s,\theta))$. For $x\in V$, set $(s,\theta)=(s_\lambda(x),\theta_\lambda(x))$. Since $X(\lambda,s_\lambda(x),\theta_\lambda(x))=x$ and $Z(\lambda,s_\lambda(x),\theta_\lambda(x))=u_\lambda(x)$, substitution yields \begin{align*} a(x,u_\lambda(x))\cdot \nabla u_\lambda(x)=b(x,u_\lambda(x)). \end{align*} Thus $u_\lambda$ is a classical solution with the prescribed initial data. [guided] The inverse branch is defined only on $\Lambda_2\times V$, so the solution must be defined on that same parameter neighbourhood. Define $u: \Lambda_2 \times V \to I$ by \begin{align*} u(\lambda,x)=Z(\lambda,s_\lambda(x),\theta_\lambda(x)). \end{align*} For each $\lambda\in\Lambda_2$, define $u_\lambda:V\to I$ by $u_\lambda(x)=u(\lambda,x)$. The map $Z$ is $C^k$ from the ODE smooth-dependence step, and the inverse branch $S(\lambda,x)=(s_\lambda(x),\theta_\lambda(x))$ is $C^k$ from the parameter-dependent inverse theorem. Therefore their composition is $C^k$, so $(\lambda,x)\mapsto u_\lambda(x)$ is $C^k$ on $\Lambda_2\times V$. We now check the initial condition. Shrink $\Theta_2$ further, if necessary, so that $Y(\Theta_2)\subset V$. For $\theta\in\Theta_2$, the characteristic initial condition gives $X(\lambda,0,\theta)=x_\lambda(0,\theta)=Y(\theta)$. Since $S(\lambda,\cdot)$ is the chosen inverse branch of $X_\lambda$ on $V$, applying this inverse to $Y(\theta)$ gives $s_\lambda(Y(\theta))=0$ and $\theta_\lambda(Y(\theta))=\theta$. Hence $u_\lambda(Y(\theta))=Z(\lambda,0,\theta)=z_\lambda(0,\theta)=h(\lambda,\theta)$. Finally we verify that $u_\lambda$ satisfies the first-order PDE. The defining identity is \begin{align*} u_\lambda(X(\lambda,s,\theta))=Z(\lambda,s,\theta), \end{align*} because $S$ is the inverse branch of $X_\lambda$ and $u_\lambda$ was defined by composing $Z$ with that inverse branch. Since $k\geq 1$, all functions in this identity are differentiable. Differentiating with respect to the characteristic time $s$ gives \begin{align*} \nabla u_\lambda(X(\lambda,s,\theta))\cdot \partial_s X(\lambda,s,\theta)=\partial_s Z(\lambda,s,\theta). \end{align*} The characteristic system identifies the two $s$-derivatives: \begin{align*} \partial_s X(\lambda,s,\theta)=a(X(\lambda,s,\theta),Z(\lambda,s,\theta)) \end{align*} and \begin{align*} \partial_s Z(\lambda,s,\theta)=b(X(\lambda,s,\theta),Z(\lambda,s,\theta)). \end{align*} For an arbitrary $x\in V$, put $(s,\theta)=(s_\lambda(x),\theta_\lambda(x))$. The inverse identity gives $X(\lambda,s_\lambda(x),\theta_\lambda(x))=x$, and the definition of $u_\lambda$ gives $Z(\lambda,s_\lambda(x),\theta_\lambda(x))=u_\lambda(x)$. Substituting these two identities into the differentiated characteristic identity yields \begin{align*} a(x,u_\lambda(x))\cdot \nabla u_\lambda(x)=b(x,u_\lambda(x)). \end{align*} Thus $u_\lambda$ is a classical solution, it has the prescribed initial data, and the solution family depends $C^k$ on $(\lambda,x)$. [/guided] [/step] [step:Record the common neighbourhood and conclude smooth dependence] The preceding construction used only shrinking of $\Lambda_1$, $\Theta_1$, $J$, and $V$ to $\Lambda_2$, $\Theta_2$, $J_2$, and $V$. Since the lifted solution map $\Gamma: \Lambda_2 \times J_2 \times \Theta_2 \to U \times I$ takes values in $U\times I$, all lifted characteristic curves in the final box remain in $U\times I$. The determinant condition \begin{align*} \det D_{(s,\theta)}X(\lambda,s,\theta)\neq 0 \end{align*} holds throughout the same box by construction, so each spatial characteristic map has invertible Jacobian on that box. We use only the inverse branch on the common neighbourhood $V\subset X_\lambda(J_2\times\Theta_2)$; we do not require the whole fixed box $J_2\times\Theta_2$ to map onto $V$. The map $u: \Lambda_2 \times V \to I$, defined by $u(\lambda,x)=u_\lambda(x)$, is $C^k$, and each $u_\lambda$ satisfies \begin{align*} u_\lambda(Y(\theta))=h(\lambda,\theta) \end{align*} for $\theta\in\Theta_2$. Renaming the shrunk neighbourhoods as $\Lambda$, $\Theta$, and $V$ gives the theorem. [guided] The final point is bookkeeping: all earlier shrinkings must be compatible with the statement of the theorem. We have shrunk the original parameter neighbourhoods and characteristic-time interval to $\Lambda_2$, $\Theta_2$, and $J_2$, and we have chosen one physical neighbourhood $V\subset U$ of $y_0$. The lifted solution map $\Gamma:\Lambda_2\times J_2\times\Theta_2\to U\times I$ still takes values in $U\times I$, so every lifted characteristic used in the construction remains inside the domain where $a$ and $b$ are defined. The same final box also preserves the nondegeneracy needed for inversion. By construction, \begin{align*} \det D_{(s,\theta)}X(\lambda,s,\theta)\neq 0 \end{align*} for every $(\lambda,s,\theta)\in\Lambda_2\times J_2\times\Theta_2$. Thus the spatial characteristic map has invertible Jacobian throughout the parameter box. We only use the inverse branch on the common neighbourhood $V\subset X_\lambda(J_2\times\Theta_2)$, so the proof does not require the fixed box $J_2\times\Theta_2$ to map exactly onto $V$. The map $u:\Lambda_2\times V\to I$, defined by $u(\lambda,x)=u_\lambda(x)$, is $C^k$, and the initial-data verification gives \begin{align*} u_\lambda(Y(\theta))=h(\lambda,\theta) \end{align*} for every $\theta\in\Theta_2$. Renaming $\Lambda_2$, $\Theta_2$, and $V$ as the final neighbourhoods $\Lambda$, $\Theta$, and $V$ gives exactly the smooth-dependence statement. [/guided] [/step]

Prerequisites (0/5 completed)

Prerequisites Graph

Interactive dependency map showing how this theorem builds on foundational concepts

Loading dependency graph...

Theorems

Definitions & Concepts

What brings you to Androma?

Start with a route through the knowledge graph.